Alarms

The Cisco device can be configured to send SNMP traps when important system events occur such as when an interface starts or stops running, temperature thresholds are crossed, or when authentication failures occur. Traps are translated into Cisco EMF alarms when specific conditions are met, then arise against the appropriate object. Alarms display in the Event Browser and alarm indicators appear in the Map Viewer accordingly. Alarms clear automatically (if the resolution can be clearly detected by the EM) or manually from the Event Browser.

In order to receive (SNMP) trap data from the device, the following configurations must be in place:

EM Alarms

The EM enables you to identify events, or alarms, which occur on the chassis. Within the Map Viewer application, alarm notification occurs on individual objects by the colored status icons next to each managed object name in the left-hand pane or as colored outlines on the chassis map. The following table details all status colors and their related severities.

Alarms propagate up the object hierarchy, and are reflected at the highest level. For example, say a critical (red) alarm occurs on an interface. If you do not have the chassis map open, and if the interface text is not apparent, how would you know an alarm had occurred at that level? The answer is: propagation. The interface alarm propagates up the hierarchy to site level. This means that whatever level you are working at, you will see that an alarm has occurred. You can follow the path to discover where the alarm exists.

Viewing Alarms

Complete alarm data is available in the Event Browser application that is part of the Cisco EMF.

Event Browser allows you to view all alarms on all objects. The Query Editor window appears automatically when you launch the Event Browser application. The Query Editor allows you to set up a query (or filter) that allows you to filter all the alarms available and display only the alarms matching the query criteria you selected.

Router Trap Support

When a fault occurs on a managed object in the network, the EM receives immediate notification, through a "trap" that is sent through the network. This trap manifests itself as an alarm through Cisco EMF functionality. The following areas support traps in the EM:

Chassis Alarms

The following table provides information on traps that result in alarms which raise against the chassis and lists the alarms which automatically clear when a new alarm of that type enters the system.

Table 11-2 Alarms Raised Against Chassis Objects

Trap	Alarm Description	Severity	Clears
Cold start	Cold Start: Agent reinitializing; configuration may have changed.	Major
Warm start	Warm Start: Agent reinitializing; configuration is unaltered.	Major
Authentication Failure	None	Major
EGP Neighbor loss	None	Minor
Shutdown Notification	Environmental Monitor: Initiating Chassis Shutdown	Critical
Voltage Normal	Environmental Monitor: Chassis Voltage Normal	Normal	Voltage Normal, Warning, Critical, Shutdown & Not Present
Voltage Warning	Environmental Monitor: Chassis Voltage at Warning stage	Warning	Voltage Normal, Warning, Critical, Shutdown & Not Present
Voltage Critical	Environmental Monitor: Chassis Voltage at Critical stage	Critical	Voltage Normal, Warning, Critical, Shutdown & Not Present
Voltage Shutdown	Environmental Monitor: Chassis Voltage at Shutdown stage	Critical	Voltage Normal, Warning, Critical, Shutdown & Not Present
Voltage Not Present	Environmental Monitor: Chassis Voltage Not Present	Informational	Voltage Normal, Warning, Critical, Shutdown & Not Present
Temperature OK	Textual description of Event	Normal	Temperature OK, Warning, Critical, Shutdown & Not Present
Temperature Warning	Textual description of Event, e.g. "Slot 20:Switch Fabric Card(6) OC-192 Hot Sensor"	Warning	Temperature OK, Warning, Critical, Shutdown & Not Present
Temperature Critical	Textual description of Event, e.g. "Slot 20:Switch Fabric Card(6) OC-192 Hot Sensor"	Critical	Temperature OK, Warning, Critical, Shutdown & Not Present
Temperature Shutdown	Textual description of Event	Critical	Temperature OK, Warning, Critical, Shutdown & Not Present
Temperature Not Present	Textual description of Event	Informational	Temperature OK, Warning, Critical, Shutdown & Not Present
Fan Normal	Environmental Monitor: Chassis Fan Normal	Normal	Fan Normal, Warning, Critical, Shutdown & Not Present
Fan Warning	Environmental Monitor: Chassis Fan at Warning stage	Warning	Fan Normal, Warning, Critical, Shutdown & Not Present
Fan Critical	Environmental Monitor: Chassis Fan at Critical stage	Critical	Fan Normal, Warning, Critical, Shutdown & Not Present
Fan Shutdown	Environmental Monitor: Chassis Fan at Shutdown stage	Critical	Fan Normal, Warning, Critical, Shutdown & Not Present
Fan Not Present	Environmental Monitor: Chassis Fan Not Present	Informational	Fan Normal, Warning, Critical, Shutdown & Not Present
Power Supply Normal	Environmental Monitor: Chassis Redundant Power Supply OK	Normal	Power Supply Normal, Warning, Critical, Shutdown & Not Present
Power Supply Warning	Environmental Monitor: Chassis Redundant Power Supply at Warning stage	Warning	Power Supply Normal, Warning, Critical, Shutdown & Not Present
Power Supply Critical	Environmental Monitor: Chassis Redundant Power Supply at Critical stage	Critical	Power Supply Normal, Warning, Critical, Shutdown & Not Present
Power Supply Shutdown	Environmental Monitor: Chassis Redundant Power Supply at Shutdown stage	Critical	Power Supply Normal, Warning, Critical, Shutdown & Not Present
Power Supply Not Present	Environmental Monitor: Chassis Redundant Power Supply Not Present	Informational	Power Supply Normal, Warning, Critical, Shutdown & Not Present

Temperature traps include the affected slot in the alarm description. This functionality will be extended to other environmental alarms in future releases. Similarly, power supply alarms identify the affected power supply, PS0 or PS1.

As can be seen from the table, Cisco EMF alarms may be cleared automatically when other alarms enter the system. The general pattern is that an incoming environmental or power supply alarm clears existing alarms of the same type, no matter the severity. All other alarms must be cleared manually.

Interface Alarms

The following table provides information on traps that result in alarms raised against interface objects.

Syslog Traps

Like the other supported alarms, Cisco IOS can be configured to send Syslog traps to a designated server. There are eight levels of Syslog information which are mapped into four categories of Cisco EMF alarm severity. Syslog specific data is inserted into the Message portion of the Cisco EMF alarm. In all cases, alarms are raised against the Chassis object. There is no automatic clearing of Syslog Alarms.

Syslog alarms have a Description in the Event Browser application in the following format:

"Asserted [<clogHistMsgText>] by facility [<clogHistFacility>], Message name [<clogHistMsgName>]"

"Asserted [Critical/high priority process ATM Periodic may not dismiss.] by facility [SCHED], Message name [EDISMSCRIT]"

Corresponding syslog events associated with syslog traps display in the SysLog Messages window. For further information, see the "System Log" section.

Configuration Management Traps

When a change is made to the configuration of a Cisco router, Cisco IOS can send a "configuration management event trap". This trap is translated into a Cisco EMF alarm with the following description:

"Config Change, Command Source: <ccmHistoryEventCommandSource>, Config Source: < ccmHistoryEventConfigSource>, Config Destination: < ccmHistoryEventConfigDestination>"

"Config Change, Command Source: commandLine, Config Source: running, Config Destination: commandSource"

Alarms are raised against the Chassis object with Informational Severity. There is no automatic clearing.

Heartbeat Polling

Heartbeat polling begins automatically when you commission a chassis. There are two types of heartbeat polling: Connectivity Management and Operational Status Polling.

Connectivity Management

The EM polls the management interface on the chassis every 60 seconds to determine network connectivity. If management connectivity is lost, the chassis enters into a lost comms state and this state ripples down to all subchassis objects. A critical lost comms alarm is raised against the chassis. The chassis continues to poll. If it detects re-establishment, it puts the chassis state back to the relevant state and this state ripples down to all subchassis objects as well. An alarm of Normal severity is then raised which clears the critical lost comms alarm.

Operational Status Polling

Operational status polling occurs at module and interface levels. Each module and interface object polls for its own operational status. Modules poll every 5 minutes and interfaces poll every fifteen minutes. If a module detects that its operational status is down, it enters the Errored state and raises a Major alarm. The Errored state does not propagate down to PVCs, SPVCs, and sub-interfaces. If an interface goes down, you can see this in the Generic Interface Status window. In the Errored state the module or interface will continue to poll if the condition has been rectified. If it detects that the operational state has moved back to normal then the object will transition into the Normal state and raise an alarm of Normal severity which will clear the previous Major alarm.

Disabling Heartbeat Polling

You can stop heartbeat polling on an individual interface by decommissioning the interface. You might want to do this if you have interfaces that are not yet connected or live. For example, when you commission a chassis, subchassis discovery is automatically initiated. If you have pre-deployed interfaces that are not yet live, these are discovered and put into an Errored state, after no connectivity is detected on them. An alarm is also raised on the interface. To correct this situation, you need to decommission the inactive interface and clear the alarm manually.

Performance Logging

Heartbeat polling is unaffected if an object is in the performance logging state.

Troubleshooting Alarms

This section describes troubleshooting techniques to help identify and resolve specific system alarms. If you are unable to resolve an alarm on your own, the Cisco Technical Assistance Center is available to help. For the Cisco Technical Assistance Center contact information, see the "Technical Assistance Center" section.

The Troubleshooting Alarms section is broken down into the following alarm categories:

The following figure provides the basic alarm detection flow and points you to the proper section.

Environmental

High Temperature

Chassis

Informational

Power Supply

Power Supply Mismatch

Module

Mismatch

Interface

Errored

Lost Comms

Link Down

DS3/E3 Down

T1/E1 Down

OC-3 / OC-12 / OC-48 Optical Down

Color	Severity of Alarm
Red	Critical
Orange	Major
Yellow	Minor
Cyan	Warning
Green	No Alarms (Normal)
White	Informational

Syslog Severity	Cisco EMF Severity
Emergency	Critical
Alert	Critical
Critical	Critical
Error	Major
Warning	Minor
Notification	Minor
Informational	Informational
Debug	Informational

Table of Contents