ITSM - Incident/Problem/Outage Management Quick Links and Resources

Quick links and resources for technologists engaging when there is an Incident, Problem with Outage, or suspected Outage of a system for which the technologist is responsible.

What is an Incident? Service Outage?

How to Report an Outage

Communication Guidelines

Service affecting Problems should be reported in a 'live' fashion to SNCC as soon as possible. SNCC can then begin engaging appropriate resources to address the Problem.

  • Call SNCC
    • SysOps (3-2648 option 3)
    • NOC (3-4188 option 1)

Communicating During an Outage

Ongoing communications via the DoIT Operations Support channel is preferred as it allows for simultaneous communication among individuals engaged in the Problem.

    • Call SNCC
      • SysOps (3-2648 option 3)
      • NOC (3-4188 option 1)

    • Out of Band
      • Out of Band is used only when Microsoft Teams is unavailable or SSO (Single Sign On/Shibboleth) is unavailable.
      • Platform to be decided

Accessing the Problem Record

The Problem record is the official record. During the Outage the Problem record is owned and maintained by SNCC.

Additional Resources

Roles During an Outage

Complete role definitions can be found in the DoIT Operational Framework - Section 4.0 - Incident Management.

  • Help Desk - Respond to end-users and record end-user experience during outage. Represent end-user experience to technologists working to resolve the Problem.
  • SNCC - Engage appropriate technologists to resolve Problem. Serve as communications hub during the Problem and maintain the Problem record.
  • Technologists - Work to resolve Problem and keep appropriate stakeholders up to date on resolution progress.
  • Situation Manager - For those services which have Situation Managers identified, serve as a resource for technologist response to the Problem and identify additional resources as needed.
  • Duty Manager - SNCC on-call role which serves the management role in the absence of a Situation Manager. This role is a resource for SNCC staff.


Keywords:
incident management outage major technologist resources reporting problem
Doc ID:
24507
Owned by:
Andy B. in ITSM
Created:
2012-05-30
Updated:
2023-09-08
Sites:
DCTeam-internal, DoITCOOP-internal, DoITHelpDesk-internal, DoITStaff-internal, ITSM-internal, NetworkSrvcs-internal, SEO-internal, SNCC-internal, SysEngineering-internal