Tool Mentor: TNO - Problem Management
TM151 - How to Use Tivoli Netcool OMNIbus to Perform Problem Management
Tool: IBM Tivoli Netcool OMNIbus
Relationships
Main Description

Context

Tool mentors explain how a tool can perform tasks, which are part of ITUP processes and activities. The tasks are listed as Related Elements in the Relationships section.

You can see the details of how processes and activities are supported by this tool mentor, by clicking the links next to the icons:

Details

The IBM® Tivoli® Netcool/OMNIbus product receives alert data from the network and server environment and either directly due to a specific alert or a correlation of related alerts, will identify a probable or root cause event and optionally automatically open an incident record. All current events are stored in the memory resident ObjectServer database and are available to filtered Desktop views for investigation and management

The Netcool/OMNIbus Native Desktop provides two views: the monitor console, showing high-level monitor or summary views, and the event list, showing filtered alarm views. The Desktop allows an operator to immediately visualize the major problem areas of the managed systems. The monitor console shows events grouped into configurable categories such as:

  • Technology - for example, all routers
  • Business - for example, a department or region
  • Service - for example, a Virtual Private Network, end-to-end application, or customer

Screenshot of the Monitor console

The Filter or Association panels show the following:

  • A label - Identifies the filter associated with a panel. If you select the Label button, the SQL definition behind the panel is displayed in the Filter Builder.
  • Notes - optional information that can show the total events for the panel, the highest and lowest severities, and a metric defined by operators and administrators - showing for example the sum of all repeated events for the panel.
  • The lava lamp - proportionally shows the different severities of events. This changes dynamically as new events are received or correlated by the system. Netcool supports the ISO Standard severity levels: Critical, Major, Minor, Warning, Undefined, and Clear, and these are represented by the colors displayed. The lava lamp style may be replaced by a histogram representation.
  • View button - Launches the event list associated with the panel
  • View name - A pull-down menu defining the event view launched by the View button

The Desktop provides a range of configurable options controlling appearance, refresh rates, and the notification behavior of the event list when new events arrive. Notification may be by pop-up and/or audible alarm. New events may be set to flash if required.

A single event consists of 26 standard fields including, for example, affected Node or Device, repeating event count and Problem description or Summary:

Screenshot of event showing affected Node, repeating event count and problem description

The underlying ObjectServer schema can be modified to provide additional information as required. The event may be structured to common standard forms for example x733. All fields of the event can be shown, but it is more likely to show only those fields which are important to an operator at first glance. A pull-down list is available to display all fields in a selected event.

The event list provides the operator with a personalized view of the active events, together with a range of customizable tools supporting alarm management.

Screenshot of the event list with a personalized view of the active events

Events can be ordered and re-sorted by operator interaction. Events can be selected, acknowledged, assigned to, re-prioritized, and deleted (if the user has the correct authorization privileges).

The scroll bars allow full event information to be viewed. You can scroll down through all the alarms in the system. If you use the horizontal scroll bar at the base of the event list, you can see the complete set of information available for each alarm.

The Summary bar, located below the horizontal scroll bar, shows the total number of events by severity that match the filter for this view. Clicking on a severity causes a temporary additional filter to be applied to limit the view to the events of the selected severity. An All Events button is activated on the bar, enabling a return to the 'All events for this filter' view.

Top(n) Events

The Desktop includes 'Top N' functionality allowing for a controlled reduction in the information presented to the user, and aiding rapid fault diagnosis. The administrator can set a minimum and maximum range for operator selection, or can lock the definition, preventing user modification. Note that the Severity summary bar continues to indicate the total number of events matching the filter view.

Screenshot of The Netcool/OMNIbus Native Desktop

The Alerts Menu

The alerts menu provides a context-sensitive set of functions that may be further customized by the administrator.

Screenshot of the alerts menu

The default menu options are summarized below:

  • Acknowledge or De-acknowledge - If you acknowledge an event, the text changes to white. Since this is a multi-user system working in real-time, every other user will see this happen instantly.
  • Prioritize - Events can be re-prioritized to different states based on the ISO severity levels: Critical, Major, Minor, Warning, Indeterminate, and Clear.
  • Take Ownership - You can automatically assign an event to yourself.
  • User Assign - Events can be assigned to different operators.
  • Group Assign - Events can be assigned to different groups of operators.
  • Delete - Events can be deleted.
  • Resolve - Events can be automatically closed with predefined resolution details.

All these functions are automatically 'journalized' in the system, with the date, time, and operator who changed the function. This provides an audit trail of what happens to an event, automatically. In addition, each of the menu functions is user access controlled.

While it is feasible through the flexible configuration of the Probe, ObjectServer schema and Desktops to fully manage an Event as a Problem or Incident, the majority of installations will make use of a Help Desk style application for the generation of a Problem Ticket. Netcool bi-directional gateways enable the ObjectServer to seamlessly integrate with all major Help Desk and Service Center applications that leading companies use to resolve faults and ensure service availability.

The Netcool/OMNIbus application can automatically open a trouble ticket in the corporate help desk system and populate it with the enriched event messages, as well as obtaining ongoing status reports on each trouble ticket. "Point and Click" Desktop tools also allow the Operator to initiate the Ticket process where manual intervention is necessary or preferred.

The Netcool solution can provide inserts into the target system and also send subsequent updates, journal entries or closures. Netcool provides the ability to open, update, and notify the trouble ticket system to close the ticket. Similarly, Netcool can receive updates and notification to clear the event from the trouble ticket system. Severity, priority, assigned to, and others are typical data elements exchanged between Netcool and the trouble ticket system.

For More Information

For more information about this tool, click on the link for this tool at the top of this page.