Table Of Contents
Maintenance Troubleshooting
Introduction
Maintenance Events and Alarms
MAINTENANCE (1)
MAINTENANCE (2)
MAINTENANCE (3)
MAINTENANCE (4)
MAINTENANCE (5)
MAINTENANCE (6)
MAINTENANCE (7)
MAINTENANCE (8)
MAINTENANCE (9)
MAINTENANCE (10)
MAINTENANCE (11)
MAINTENANCE (12)
MAINTENANCE (13)
MAINTENANCE (14)
MAINTENANCE (15)
MAINTENANCE (16)
MAINTENANCE (17)
MAINTENANCE (18)
MAINTENANCE (19)
MAINTENANCE (20)
MAINTENANCE (21)
MAINTENANCE (22)
MAINTENANCE (23)
MAINTENANCE (24)
MAINTENANCE (25)
MAINTENANCE (26)
MAINTENANCE (27)
MAINTENANCE (28)
MAINTENANCE (29)
MAINTENANCE (30)
MAINTENANCE (32)
MAINTENANCE (33)
MAINTENANCE (34)
MAINTENANCE (35)
MAINTENANCE (36)
MAINTENANCE (37)
MAINTENANCE (38)
MAINTENANCE (39)
MAINTENANCE (40)
MAINTENANCE (41)
MAINTENANCE (42)
MAINTENANCE (43)
MAINTENANCE (44)
MAINTENANCE (45)
MAINTENANCE (46)
MAINTENANCE (47)
MAINTENANCE (48)
MAINTENANCE (49)
MAINTENANCE (50)
MAINTENANCE (51)
MAINTENANCE (52)
MAINTENANCE (53)
MAINTENANCE (54)
MAINTENANCE (55)
MAINTENANCE (56)
MAINTENANCE (57)
MAINTENANCE (58)
MAINTENANCE (61)
MAINTENANCE (62)
MAINTENANCE (63)
MAINTENANCE (64)
MAINTENANCE (65)
MAINTENANCE (66)
MAINTENANCE (67)
MAINTENANCE (68)
MAINTENANCE (69)
MAINTENANCE (70)
MAINTENANCE (71)
MAINTENANCE (72)
MAINTENANCE (73)
MAINTENANCE (74)
MAINTENANCE (75)
MAINTENANCE (77)
MAINTENANCE (78)
MAINTENANCE (79)
MAINTENANCE (80)
MAINTENANCE (81)
MAINTENANCE (82)
MAINTENANCE (83)
MAINTENANCE (84)
MAINTENANCE (85)
MAINTENANCE (86)
MAINTENANCE (87)
MAINTENANCE (88)
MAINTENANCE (89)
MAINTENANCE (90)
MAINTENANCE (91)
MAINTENANCE (92)
MAINTENANCE (93)
MAINTENANCE (94)
MAINTENANCE (95)
MAINTENANCE (96)
MAINTENANCE (97)
MAINTENANCE (98)
MAINTENANCE (99)
MAINTENANCE (100)
MAINTENANCE (101)
MAINTENANCE (102)
MAINTENANCE (103)
MAINTENANCE (104)
MAINTENANCE (105)
MAINTENANCE (106)
MAINTENANCE (107)
MAINTENANCE (108)
MAINTENANCE (109)
MAINTENANCE (110)
MAINTENANCE (111)
MAINTENANCE (118)
MAINTENANCE (119)
MAINTENANCE (120)
MAINTENANCE (122)
MAINTENANCE (123)
Monitoring Maintenance Events
Test Report - Maintenance (1)
Report Threshold Exceeded - Maintenance (2)
Local Side has Become Faulty - Maintenance (3)
Mate Side has Become Faulty - Maintenance (4)
Changeover Failure - Maintenance (5)
Changeover Timeout - Maintenance (6)
Mate Rejected Changeover - Maintenance (7)
Mate Changeover Timeout - Maintenance (8)
Local Initialization Failure - Maintenance (9)
Local Initialization Timeout - Maintenance (10)
Switchover Complete - Maintenance (11)
Initialization Successful - Maintenance (12)
Administrative State Change - Maintenance (13)
Call Agent Administrative State Change - Maintenance (14)
Feature Server Administrative State Change - Maintenance (15)
Process Manager: Starting Process - Maintenance (16)
Invalid Event Report Received - Maintenance (17)
Process Manager: Process has Died - Maintenance (18)
Process Manager: Process Exceeded Restart Rate - Maintenance (19)
Lost Connection to Mate - Maintenance (20)
Network Interface Down - Maintenance (21)
Mate is Alive - Maintenance (22)
Process Manager: Process Failed to Complete Initialization - Maintenance (23)
Process Manager: Restarting Process - Maintenance (24)
Process Manager: Changing State - Maintenance (25)
Process Manager: Going Faulty - Maintenance (26)
Process Manager: Changing Over to Active - Maintenance (27)
Process Manager: Changing Over to Standby - Maintenance (28)
Administrative State Change Failure - Maintenance (29)
Element Manager State Change - Maintenance (30)
Process Manager: Sending Go Active to Process - Maintenance (32)
Process Manager: Sending Go Standby to Process - Maintenance (33)
Process Manager: Sending End Process to Process - Maintenance (34)
Process Manager: All Processes Completed Initialization - Maintenance (35)
Process Manager: Sending All Processes Initialization Complete to Process - Maintenance (36)
Process Manager: Killing Process - Maintenance (37)
Process Manager: Clearing the Database - Maintenance (38)
Process Manager: Cleared the Database - Maintenance (39)
Process Manager: Binary Does not Exist for Process - Maintenance (40)
Administrative State Change Successful with Warning - Maintenance (41)
Number of Heartbeat Messages Received is Less Than 50% of Expected - Maintenance (42)
Process Manager: Process Failed to Come Up in Active Mode - Maintenance (43)
Process Manager: Process Failed to Come Up in Standby Mode - Maintenance (44)
Application Instance State Change Failure - Maintenance (45)
Network Interface Restored - Maintenance (46)
Thread Watchdog Counter Expired for a Thread - Maintenance (47)
Index Table Usage Exceeded Minor Usage Threshold Level - Maintenance (48)
Index Table Usage Exceeded Major Usage Threshold Level - Maintenance (49)
Index Table Usage Exceeded Critical Usage Threshold Level - Maintenance (50)
A Process Exceeds 70% of Central Processing Unit Usage - Maintenance (51)
Central Processing Unit Usage is Now Below the 50% Level - Maintenance (52)
The Central Processing Unit Usage is Over 90% Busy - Maintenance (53)
The Central Processing Unit has Returned to Normal Levels of Operation - Maintenance (54)
The Five Minute Load Average is Abnormally High - Maintenance (55)
The Load Average has Returned to Normal Levels - Maintenance (56)
Memory and Swap are Consumed at Critical Levels - Maintenance (57)
Memory and Swap are Consumed at Abnormal Levels - Maintenance (58)
No Heartbeat Messages Received Through the Interface - Maintenance (61)
Link Monitor: Interface Lost Communication - Maintenance (62)
Outgoing Heartbeat Period Exceeded Limit - Maintenance (63)
Average Outgoing Heartbeat Period Exceeds Major Alarm Limit - Maintenance (64)
Disk Partition Critically Consumed - Maintenance (65)
Disk Partition Significantly Consumed - Maintenance (66)
The Free Inter-Process Communication Pool Buffers Below Minor Threshold - Maintenance (67)
The Free Inter-Process Communication Pool Buffers Below Major Threshold - Maintenance (68)
The Free Inter-Process Communication Pool Buffers Below Critical Threshold - Maintenance (69)
The Free Inter-Process Communication Pool Buffer Count Below Minimum Required - Maintenance (70)
Local Domain Name System Server Response Too Slow - Maintenance (71)
External Domain Name System Server Response Too Slow - Maintenance (72)
External Domain Name System Server not Responsive - Maintenance (73)
Local Domain Name System Service not Responsive - Maintenance (74)
Mismatch of Internet Protocol Address Local Server and Domain Name System - Maintenance (75)
Mate Time Differs Beyond Tolerance - Maintenance (77)
Bulk Data Management System Admin State Change - Maintenance (78)
Resource Reset - Maintenance (79)
Resource Reset Warning - Maintenance (80)
Resource Reset Failure - Maintenance (81)
Average Outgoing Heartbeat Period Exceeds Critical Limit - Maintenance (82)
Swap Space Below Minor Threshold - Maintenance (83)
Swap Space Below Major Threshold - Maintenance (84)
Swap Space Below Critical Threshold - Maintenance (85)
System Health Report Collection Error - Maintenance (86)
Status Update Process Request Failed - Maintenance (87)
Status Update Process Database List Retrieval Error - Maintenance (88)
Status Update Process Database Update Error - Maintenance (89)
Disk Partition Moderately Consumed - Maintenance (90)
Internet Protocol Manager Configuration File Error - Maintenance (91)
Internet Protocol Manager Initialization Error - Maintenance (92)
Internet Protocol Manager Interface Failure - Maintenance (93)
Internet Protocol Manager Interface State Change - Maintenance (94)
Internet Protocol Manager Interface Created - Maintenance (95)
Internet Protocol Manager Interface Removed - Maintenance (96)
Inter-Process Communication Input Queue Entered Throttle State - Maintenance (97)
Inter-Process Communication Input Queue Depth at 25% of its Hi-Watermark - Maintenance (98)
Inter-Process Communication Input Queue Depth at 50% of its Hi-Watermark - Maintenance (99)
Inter-Process Communication Input Queue Depth at 75% of its Hi-Watermark - Maintenance (100)
Switchover in Progress - Maintenance (101)
Thread Watchdog Counter Close to Expiry for a Thread - Maintenance (102)
Central Processing Unit is Offline - Maintenance (103)
Aggregration Device Address Successfully Resolved - Maintenance (104)
Unprovisioned Aggregration Device Detected - Maintenance (105)
Aggregration Device Address Resolution Failure - Maintenance (106)
No Heartbeat Messages Received Through Interface From Router - Maintenance (107)
A Log File Cannot be Transferred - Maintenance (108)
Five Successive Log Files Cannot be Transferred - Maintenance (109)
Access to Log Archive Facility Configuration File Failed or File Corrupted - Maintenance (110)
Cannot Login to External Archive Server - Maintenance (111)
Domain Name Server Zone Database does not Match Between the Primary Domain Name Server and the Internal Secondary Authoritative Domain Name Server - Maintenance (118)
Periodic Shared Memory Database Backup Failure - Maintenance (119)
Periodic Shared Memory Database Backup Success - Maintenance (120)
Northbound Provisioning Message is Retransmitted - Maintenance (122)
Northbound Provisioning Message Dropped Due To Full Index Table - Maintenance (123)
Troubleshooting Maintenance Alarms
Local Side has Become Faulty - Maintenance (3)
Mate Side has Become Faulty - Maintenance (4)
Changeover Failure - Maintenance (5)
Changeover Timeout - Maintenance (6)
Mate Rejected Changeover - Maintenance (7)
Mate Changeover Timeout - Maintenance (8)
Local Initialization Failure - Maintenance (9)
Local Initialization Timeout - Maintenance (10)
Process Manager: Process has Died - Maintenance (18)
Process Manager: Process Exceeded Restart Rate - Maintenance (19)
Lost Connection to Mate - Maintenance (20)
Network Interface Down - Maintenance (21)
Process Manager: Process Failed to Complete Initialization - Maintenance (23)
Process Manager: Restarting Process - Maintenance (24)
Process Manager: Going Faulty - Maintenance (26)
Process Manager: Binary Does not Exist for Process - Maintenance (40)
Number of Heartbeat Messages Received is Less Than 50% of Expected - Maintenance (42)
Process Manager: Process Failed to Come Up in Active Mode - Maintenance (43)
Process Manager: Process Failed to Come Up in Standby Mode - Maintenance (44)
Application Instance State Change Failure - Maintenance (45)
Thread Watchdog Counter Expired for a Thread - Maintenance (47)
Index Table Usage Exceeded Minor Usage Threshold Level - Maintenance (48)
Index Table Usage Exceeded Major Usage Threshold Level - Maintenance (49)
Index Table Usage Exceeded Critical Usage Threshold Level - Maintenance (50)
A Process Exceeds 70% of Central Processing Unit Usage - Maintenance (51)
The Central Processing Unit Usage is Over 90% Busy - Maintenance (53)
The Five Minute Load Average is Abnormally High - Maintenance (55)
Memory and Swap are Consumed at Critical Levels - Maintenance (57)
No Heartbeat Messages Received Through the Interface - Maintenance (61)
Link Monitor: Interface Lost Communication - Maintenance (62)
Outgoing Heartbeat Period Exceeded Limit - Maintenance (63)
Average Outgoing Heartbeat Period Exceeds Major Alarm Limit - Maintenance (64)
Disk Partition Critically Consumed - Maintenance (65)
Disk Partition Significantly Consumed - Maintenance (66)
The Free Inter-Process Communication Pool Buffers Below Minor Threshold - Maintenance (67)
The Free Inter-Process Communication Pool Buffers Below Major Threshold - Maintenance (68)
The Free Inter-Process Communication Pool Buffers Below Critical Threshold - Maintenance (69)
The Free Inter-Process Communication Pool Buffer Count Below Minimum Required - Maintenance (70)
Local Domain Name System Server Response Too Slow - Maintenance (71)
External Domain Name System Server Response Too Slow - Maintenance (72)
External Domain Name System Server not Responsive - Maintenance (73)
Local Domain Name System Service not Responsive - Maintenance (74)
Mate Time Differs Beyond Tolerance - Maintenance (77)
Average Outgoing Heartbeat Period Exceeds Critical Limit - Maintenance (82)
Swap Space Below Minor Threshold - Maintenance (83)
Swap Space Below Major Threshold - Maintenance (84)
Swap Space Below Critical Threshold - Maintenance (85)
System Health Report Collection Error - Maintenance (86)
Status Update Process Request Failed - Maintenance (87)
Status Update Process Database List Retrieval Error - Maintenance (88)
Status Update Process Database Update Error - Maintenance (89)
Disk Partition Moderately Consumed - Maintenance (90)
Internet Protocol Manager Configuration File Error - Maintenance (91)
Internet Protocol Manager Initialization Error - Maintenance (92)
Internet Protocol Manager Interface Failure - Maintenance (93)
Inter-Process Communication Input Queue Entered Throttle State - Maintenance (97)
Inter-Process Communication Input Queue Depth at 25% of Its Hi-Watermark - Maintenance (98)
Inter-Process Communication Input Queue Depth at 50% of Its Hi-Watermark - Maintenance (99)
Inter-Process Communication Input Queue Depth at 75% of Its Hi-Watermark - Maintenance (100)
Switchover in Progress - Maintenance (101)
Thread Watchdog Counter Close to Expiry for a Thread - Maintenance (102)
Central Processing Unit is Offline - Maintenance (103)
No Heartbeat Messages Received Through Interface From Router - Maintenance (107)
Five Successive Log Files Cannot be Transferred - Maintenance (109)
Access to Log Archive Facility Configuration File Failed or File Corrupted - Maintenance (110)
Cannot Login to External Archive Server - Maintenance (111)
Domain Name Server Zone Database does not Match Between the Primary Domain Name Server and the Internal Secondary Authoritative Domain Name Server - Maintenance (118)
Periodic Shared Memory Database Backup Failure - Maintenance (119)
Maintenance Troubleshooting
Revised: December 2, 2008, OL-8000-30
Introduction
This chapter provides the information needed to monitor and troubleshoot Maintenance events and alarms. This chapter is divided into the following sections:
•
Maintenance Events and Alarms - Provides a brief overview of each Maintenance event and alarm.
•
Monitoring Maintenance Events - Provides the information needed to monitor and correct Maintenance events.
•
Troubleshooting Maintenance Alarms - Provides the information needed to troubleshoot and correct Maintenance alarms.
Maintenance Events and Alarms
This section provides a brief overview of the Maintenance events and alarms for the Cisco BTS 10200 Softswitch in numerical order. Table 7-1 lists all maintenance events and alarms by severity.
Note
Click the maintenance message number in Table 7-1 to display information about the event.
MAINTENANCE (1)
For additional information, refer to the "Test Report - Maintenance (1)" section.
DESCRIPTION
|
Test Report
|
SEVERITY
|
Information (INFO)
|
THRESHOLD
|
10000
|
THROTTLE
|
0
|
MAINTENANCE (2)
For additional information, refer to the "Report Threshold Exceeded - Maintenance (2)" section.
DESCRIPTION
|
Report Threshold Exceeded
|
SEVERITY
|
INFO
|
THRESHOLD
|
0
|
THROTTLE
|
0
|
DATAWORDS
|
Report Type - TWO_BYTES Report Number - TWO_BYTES Threshold Level - TWO_BYTES
|
PRIMARY CAUSE
|
Issued when the threshold for a given report type and number is exceeded.
|
PRIMARY ACTION
|
No action is required since this is an information report. The root cause event report - threshold should be investigated to determine if there is a service affecting situation.
|
MAINTENANCE (3)
To troubleshoot and correct the cause of the alarm, refer to the "Local Side has Become Faulty - Maintenance (3)" section.
DESCRIPTION
|
Local Side has Become Faulty
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30] Reason - STRING [80] Probable Cause - STRING [80]
|
PRIMARY CAUSE
|
Can result from maintenance report 5, 6, 9, 10, 19, 20.
|
PRIMARY ACTION
|
Review information from command line interface (CLI) log report. Usually software problem; restart software using the Installation and Startup procedure.
|
SECONDARY CAUSE
|
Manually shutting down the system using platform stop command.
|
SECONDARY ACTION
|
Reboot host machine, reinstall all applications and restart all applications. If fault state is a commonly occurring problem, then operating system (OS) or hardware may be a problem.
|
MAINTENANCE (4)
To troubleshoot and correct the cause of the alarm, refer to the "Mate Side has Become Faulty - Maintenance (4)" section.
DESCRIPTION
|
Mate Side has Become Faulty
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30] Reason - STRING [80] Probable Cause - STRING [80] Mate Ping - STRING [50]
|
PRIMARY CAUSE
|
Local side has detected the mate side going to faulty state.
|
PRIMARY ACTION
|
Display the event summary on the faulty mate side, using the report event-summary command (see the CLI Guide for command details).
|
SECONDARY ACTION
|
Review information in the event summary. This is usually a software problem.
|
TERNARY ACTION
|
After confirming the active side is processing traffic, restart software on the mate side. Log in to the mate platform as root user. Enter platform stop command and then platform start command.
|
SUBSEQUENT ACTION
|
If software restart does not resolve the problem that is, if the platform goes immediately to faulty again, or does not start, contact Cisco Technical Assistance Center (TAC). It may be necessary to reinstall software. If problem is commonly occurring, then OS or hardware may be a problem. Reboot host machine, then reinstall and restart all applications. If you reboot, this will bring down other applications running on this machine. Contact Cisco TAC for assistance.
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (5)
To troubleshoot and correct the cause of the alarm, refer to the "Changeover Failure - Maintenance (5)" section.
DESCRIPTION
|
Changeover Failure
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
Issued when changing from an active processor to a standby and the changeover fails.
|
PRIMARY ACTION
|
Review information from CLI log report.
|
SECONDARY CAUSE
|
This alarm is usually caused by a software problem on the specific platform identified in the alarm report.
|
SECONDARY ACTION
|
On the platform identified in this alarm report, restart the platform.
|
TERNARY ACTION
|
If platform restart is not successful, reinstall the application for this platform, and then restart platform again.
|
SUBSEQUENT ACTION
|
If necessary, reboot host machine this platform is located on. Then reinstall and restart all applications on this machine. If faulty state is a commonly occurring event, then OS or hardware may be a problem. Contact Cisco TAC for assistance. It may also be helpful to gather information event/alarm reports that were issued before and after this alarm report.
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (6)
To troubleshoot and correct the cause of the alarm, refer to the "Changeover Timeout - Maintenance (6)" section.
DESCRIPTION
|
Changeover Timeout
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
System failed to changeover within time period. Soon after this event is issued, one platform will go to faulty state.
|
PRIMARY ACTION
|
Review information from CLI log report.
|
SECONDARY CAUSE
|
This alarm is usually caused by a software problem on the specific platform identified in the alarm report.
|
SECONDARY ACTION
|
On the platform identified in this alarm report, restart the platform.
|
TERNARY ACTION
|
If platform restart is not successful, reinstall the application for this platform, and then restart platform again.
|
SUBSEQUENT ACTION
|
If necessary, reboot host machine this platform is located on. Then reinstall and restart all applications on this machine. If faulty state is a commonly occurring event, then OS or hardware may be a problem. Contact Cisco TAC for assistance. It may also be helpful to gather information event/alarm reports that were issued before and after this alarm report.
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (7)
To troubleshoot and correct the cause of the alarm, refer to the "Mate Rejected Changeover - Maintenance (7)" section.
DESCRIPTION
|
Mate Rejected Changeover
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
Mate is not yet in stable state.
|
PRIMARY ACTION
|
Enter the status command to get information on the two systems in the pair (primary and secondary Element Management System (EMS), Call Agent (CA) or Feature Server (FS)).
|
SECONDARY CAUSE
|
Mate detects itself faulty during changeover and then rejects changeover.
Note This attempted changeover could be caused by a forced (operator) switch, or could be caused by secondary instance rejecting changeover as primary is being brought up.
|
SECONDARY ACTION
|
If mate is faulty (not running), then perform the corrective action steps listed for the MAINTENANCE (4) event.
|
TERNARY ACTION
|
If both systems (local and mate) are still running, diagnose whether both instances are operating in stable state (one in active and the other in standby). If both are in a stable state, wait 10 minutes and try the "control" command again.
|
SUBSEQUENT ACTION
|
If standby side is not in stable state, bring down the standby side and restart software using the "platform stop" and "platform start" commands. If software restart does not resolve the problem, or if the problem is commonly occurring, contact Cisco TAC. It may be necessary to reinstall software. Additional OS or hardware problems may also need to be resolved.
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (8)
To troubleshoot and correct the cause of the alarm, refer to the "Mate Changeover Timeout - Maintenance (8)" section.
DESCRIPTION
|
Mate Changeover Timeout
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
Faulty mate.
|
PRIMARY ACTION
|
Review information from CLI log report concerning faulty mate.
|
SECONDARY ACTION
|
This alarm is usually caused by a software problem on the specific mate platform identified in the alarm report.
|
TERNARY ACTION
|
On the mate platform identified in this alarm report, restart the platform.
|
SUBSEQUENT ACTION
|
If mate platform restart is not successful, reinstall the application for this mate platform, and then restart mate platform again. If necessary, reboot host machine this mate platform is located on. Then reinstall and restart all applications on that machine.
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (9)
To troubleshoot and correct the cause of the alarm, refer to the "Local Initialization Failure - Maintenance (9)" section.
DESCRIPTION
|
Local Initialization Failure
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
Local initialization has failed.
|
PRIMARY ACTION
|
When this event report is issued, the system has failed and the re-initialization process has failed.
|
SECONDARY ACTION
|
Check that the binary files are present for the unit (Call Agent, Feature Server, Element Manager).
|
TERNARY ACTION
|
If the files are not present, then re-install the files from initial or backup media. Then restart the failed device.
|
MAINTENANCE (10)
To troubleshoot and correct the cause of the alarm, refer to the "Local Initialization Timeout - Maintenance (10)" section.
DESCRIPTION
|
Local Initialization Timeout
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
Local initialization has timed out.
|
PRIMARY ACTION
|
Check that the binary files are present for the unit (Call Agent, Feature, Server, or Element Manager).
|
SECONDARY CAUSE
|
When the event report is issued, the system has failed and the re-initialization process has failed.
|
SECONDARY ACTION
|
If the files are not present, then re-install the files from initial or backup media. Then restart the failed device.
|
MAINTENANCE (11)
For additional information, refer to the "Switchover Complete - Maintenance (11)" section.
DESCRIPTION
|
Switchover Complete
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
Acknowledges that the changeover successfully completed.
|
PRIMARY ACTION
|
Informational event report and no further action is required.
|
MAINTENANCE (12)
For additional information, refer to the "Initialization Successful - Maintenance (12)" section.
DESCRIPTION
|
Initialization Successful
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
PRIMARY CAUSE
|
Initiates a local initialization that is successful.
|
PRIMARY ACTION
|
Informational event report and no further action is required.
|
MAINTENANCE (13)
For additional information, refer to the "Administrative State Change - Maintenance (13)" section.
DESCRIPTION
|
Administrative State Change
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Facility Type - STRING [40] Facility ID - STRING [40] Initial Admin State - STRING [20] Target Admin State - STRING [20] Current Admin State - STRING [20]
|
PRIMARY CAUSE
|
The administrative state of a managed resource has changed.
|
PRIMARY ACTION
|
No action is required, since this informational event report is given after manually changing the administrative state of a managed resource.
|
MAINTENANCE (14)
For additional information, refer to the "Call Agent Administrative State Change - Maintenance (14)" section.
DESCRIPTION
|
Call Agent Administrative State Change
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Call Agent ID - STRING [40] Current Local State - STRING [40] Current Mate State - STRING [20]
|
PRIMARY CAUSE
|
Indicates that call agent has changed operational state as a result of a manual switchover (control command in CLI).
|
PRIMARY ACTION
|
No action is required.
|
MAINTENANCE (15)
For additional information, refer to the "Feature Server Administrative State Change - Maintenance (15)" section.
DESCRIPTION
|
Feature Server Administrative State Change
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Feature Server ID - STRING [40] Feature Server Type - STRING [40] Current Local State - STRING [20] Current Mate State - STRING [20]
|
PRIMARY CAUSE
|
Indicates that call agent has changed operational state as a result of a manual switchover (control command in CLI).
|
PRIMARY ACTION
|
No action is required.
|
MAINTENANCE (16)
For additional information, refer to the "Process Manager: Starting Process - Maintenance (16)" section.
DESCRIPTION
|
Process Manager: Starting Process
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Process Name - STRING [40] Restart Type - STRING [40] Restart Mode - STRING [32] Process Group - ONE_BYTE
|
PRIMARY CAUSE
|
Process is being started as system is being brought up.
|
PRIMARY ACTION
|
No action is required.
|
MAINTENANCE (17)
For additional information, refer to the "Invalid Event Report Received - Maintenance (17)" section.
DESCRIPTION
|
Invalid Event Report Received
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Report Type - TWO_BYTES Report Number - TWO_BYTES Validation Failure - STRING [30]
|
PRIMARY CAUSE
|
Indicates that a process has sent an event report that cannot be found in the database.
|
PRIMARY ACTION
|
If during system initialization a short burst of these event reports are issued prior to the database initialization, then these event reports are informational and can be ignored.
|
SECONDARY ACTION
|
Otherwise, contact Cisco TAC technical support for more information. (Contact Cisco TAC.)
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (18)
To troubleshoot and correct the cause of the alarm, refer to the "Process Manager: Process has Died - Maintenance (18)" section.
DESCRIPTION
|
Process Manager: Process has Died
|
SEVERITY
|
MINOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Process Name - STRING [40] Process Group - FOUR_BYTES
|
PRIMARY CAUSE
|
Software problem.
|
PRIMARY ACTION
|
If problem persists, contact Cisco TAC technical support. (Contact Cisco TAC.)
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (19)
To troubleshoot and correct the cause of the alarm, refer to the "Process Manager: Process Exceeded Restart Rate - Maintenance (19)" section.
DESCRIPTION
|
Process Manager: Process Exceeded Restart Rate
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Process Name - STRING [40] Restart Rate - FOUR_BYTES Process Group - ONE_BYTE
|
PRIMARY CAUSE
|
This alarm is usually caused by a software problem on the specific platform identified in the alarm report. Soon after this event is issued, one platform will go to faulty state.
|
PRIMARY ACTION
|
Review information from CLI log report.
|
SECONDARY ACTION
|
On the platform identified in this alarm report, restart the platform.
|
TERNARY ACTION
|
If platform restart is not successful, reinstall the application for this platform, and then restart platform again.
|
SUBSEQUENT ACTION
|
If necessary, reboot host machine this platform is located on. Then reinstall and restart all applications on this machine.
|
MAINTENANCE (20)
To troubleshoot and correct the cause of the alarm, refer to the "Lost Connection to Mate - Maintenance (20)" section.
DESCRIPTION
|
Lost Connection to Mate
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Mate Ping - STRING [50]
|
PRIMARY CAUSE
|
Network interface hardware problem.
|
PRIMARY ACTION
|
Check whether the network interface is down. If so, restore network interface and restart the software.
|
SECONDARY CAUSE
|
Router problem.
|
SECONDARY ACTION
|
If router problem, then repair router and reinstall.
|
TERNARY CAUSE
|
Soon after this event is issued, one platform may go to faulty state.
|
MAINTENANCE (21)
To troubleshoot and correct the cause of the alarm, refer to the "Network Interface Down - Maintenance (21)" section.
DESCRIPTION
|
Network Interface Down
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
IP Address - STRING [50]
|
PRIMARY CAUSE
|
Network interface hardware problem.
|
PRIMARY ACTION
|
Subsequently system goes faulty.
|
SECONDARY CAUSE
|
Soon after this event is issued, one platform may go to faulty state.
|
SECONDARY ACTION
|
Check whether the network interface is down. If so, restore network interface and restart the software.
|
MAINTENANCE (22)
For additional information, refer to the "Mate is Alive - Maintenance (22)" section.
DESCRIPTION
|
Mate is Alive
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Local State - STRING [30] Mate State - STRING [30]
|
MAINTENANCE (23)
To troubleshoot and correct the cause of the alarm, refer to the "Process Manager: Process Failed to Complete Initialization - Maintenance (23)" section.
DESCRIPTION
|
Process Manager: Process Failed to Complete Initialization
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Process Name - STRING [40] Process Group - ONE_BYTE
|
PRIMARY CAUSE
|
The specified process failed to complete initialization during the restoral process.
|
PRIMARY ACTION
|
Verify that the specified process's binary image is installed. If not, install it and restart the platform.
|
MAINTENANCE (24)
To troubleshoot and correct the cause of the alarm, refer to the "Process Manager: Restarting Process - Maintenance (24)" section.
DESCRIPTION
|
Process Manager: Restarting Process
|
SEVERITY
|
MINOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Process Name - STRING [40] Restart Type - STRING [40] Restart Mode - STRING [32] Process Group - ONE_BYTE
|
PRIMARY CAUSE
|
Software problem process has exited abnormally and had to be restarted.
|
PRIMARY ACTION
|
If problem persists, contact Cisco TAC.
|
Refer to the "Obtaining Technical Assistance" section on page liv for detailed instructions on contacting Cisco TAC and opening a service request.
MAINTENANCE (25)
For additional information, refer to the "Process Manager: Changing State - Maintenance (25)" section.
DESCRIPTION
|
Process Manager: Changing State
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Platform State - STRING [40]
|
MAINTENANCE (26)
To troubleshoot and correct the cause of the alarm, refer to the "Process Manager: Going Faulty - Maintenance (26)" section.
DESCRIPTION
|
Process Manager: Going Faulty
|
SEVERITY
|
MAJOR
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
DATAWORDS
|
Reason - STRING [40]
|
PRIMARY CAUSE
|
System has been brought down/system has detected a fault.
|
PRIMARY ACTION
|
If it is not due to the operator intentionally bringing down the system, then the platform has detected a fault and has shut down. This is typically followed by MAINTENANCE (3). Use corrective action procedures provided for MAINTENANCE (3).
|
MAINTENANCE (27)
For additional information, refer to the "Process Manager: Changing Over to Active - Maintenance (27)" section.
DESCRIPTION
|
Process Manager: Changing Over to Active
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
MAINTENANCE (28)
For additional information, refer to the "Process Manager: Changing Over to Standby - Maintenance (28)" section.
DESCRIPTION
|
Process Manager: Changing Over to Standby
|
SEVERITY
|
INFO
|
THRESHOLD
|
100
|
THROTTLE
|
0
|
MAINTENANCE (29)
To monitor and correct the cause of the event, refer to the "Administrative State Change Failure - Maintenance (29)" section.