|
Your chassis went through extensive testing and burn-in before leaving the factory; however, if your system appears to have problems starting up, use the information in this chapter to help isolate the problem. This chapter also includes the environmental reporting features of the CSC-ENVM card and discusses how to use them with your chassis.
This chapter includes the following:
Typically, any problems encountered at installation startup will be related either to the chassis or to the network. Use the information in this chapter to help isolate problems and to rule out the chassis as the source of the problem. For additional troubleshooting assistance, refer to the optional Troubleshooting Internetworking Systems publication.
Before you call a customer service representative, have the following information ready:
The key to problem solving in a system such as this is to try to isolate the problem to a specific subsystem. By comparing what the system is doing to what it should be doing, the task of isolating a problem is greatly simplified.
Consider the following subsystems:
The following troubleshooting sections discuss these subsystems in more detail.
Check the following items to help isolate the problem:
Check the following items to help isolate the problem:
Time Saver In general, if a particular port will not initialize, and the LED indicator on the interface card is lit, the problem may be the applique or internal cabling; however, to save time during the troubleshooting process, check the more accessible interface card before checking the applique. |
The CSC-ENVM card (ENVM) not only provides nonvolatile and Multibus memory, but it also provides the environmental monitoring required for the chassis. This card checks input and exhaust air temperatures and backplane-voltage levels. If the ambient temperature exceeds a given range and threshold, or if intake airflow is restricted, or if any of the voltages drift away from the factory settings, an error is detected, and the ENVM shuts down all functions in the chassis. (The blower will continue to run.) Power is returned by turning off the main breaker switch, waiting 20 to 30 seconds, and then turning on the main breaker switch. (This is called toggling the breaker switch.)
The system runs a routine that polls the ENVM for voltage and temperature readings. The system obtains these statistics from the ENVM at a rate of once a minute and stores them in a buffer. You can display the contents of this buffer with the show environment command. The resulting display contains the current intake and exhaust air temperatures, shutdown thresholds, and voltage measurements of DC lines. Following is sample output of the show environment command:
Router# show environment
Environmental controller firmware version 2.2
Serial number is 00225259, calibrated on 5-10-93, by technician mpo
Internal temperature measured 27.0(C), shuts down at 43.0(C)
Air flow appears good.
+ 5 volt line measured at 5.061(V)
+12 volt line measured at 12.070(V)
-12 volt line measured at -12.034(V)
-5 volt line measured at -4.915(V)
Router#
If a measurement exceeds desired margins, but has not exceeded fatal margins, a warning message is printed to the system console. The system software queries the ENVM for measurements once a minute, but warnings for a given test point are printed to the screen at most once every four hours. If a measurement exceeds the desired margins at any time within a four-hour period, an automatic warning message appears on the console screen. As noted, the user may query the ENVM using the show environment command at any time to determine if a measurement is at the warning level. Following is an example of the message displayed on the console screen when a measurement has exceeded an acceptable margin:
Router#
ENVIRONMENTAL WARNING: Airflow appears marginal.
Following is an example of a message displayed on the console screen when an ambient temperature measurement has exceeded an acceptable margin, and the temperature reading is given:
Router#
ENVIRONMENTAL WARNING: Internal temperature measured 41.3(C)
Following is an example of a message displayed on the console screen when a voltage measurement has exceeded an acceptable margin, and the voltage reading is given:
Router#
ENVIRONMENTAL WARNING: +5 volt testpoint measured 5.310(V)
The system displays the following type of message if voltage or temperature exceeds maximum or fatal thresholds:
Router#
SHUTDOWN: air flow problem
If a test point reaches the fatal threshold, the ENVM performs as follows:
Consider the following example-system conditions:
Step 1 Determine if the error message indicates an air temperature or airflow problem. If it is air temperature, go to Step 2. If it is airflow, go to Step 3.
Step 2 For suspected air temperature problems, answer the following questions:
Is the ambient room temperature less than 40°C at all times?
Is the temperature of the air going into the air intake vent less than 40°C?
Is there sufficient airflow through the chassis?
Is there sufficient airflow within the rack or wiring closet?
Is the front access panel of the chassis in place?
Is the air intake vent of the chassis isolated from the exhaust air of all other chassis?
The appropriate response for the questions in Step 3 should be yes. If not, correct the problem or contact a customer service representative for additional assistance.
Step 3 For suspected airflow problems, answer the following questions:
Is there insufficient air going through the chassis?
Is there any blockage in or around the air intake vent?
Is there any blockage in or around the air exhaust vent?
Are there multiple units installed in this rack configuration? If the answer is yes, will turning off some or all of the other chassis (if possible) allow the chassis to continue running?
Is there an air pressure differential between the air intake and exhaust vents?
Is the exhaust air of the chassis recirculating to its own air intake port?
The appropriate response for the questions in Step 3 should be no. If not, and if a problem cannot be resolved using these guidelines, contact a customer service representative for additional assistance.
To help determine why the ENVM told the system to shut down, use the show environment last command. (Refer to the section "Displaying the Reason for the Last Environmental Shutdown.")
If an ENVM-induced shutdown occurs, the ENVM logs the last measured value from each of the six test points to internal nonvolatile random access memory (NVRAM). Only the last set of these six measurements is stored at any one time. These measurements can be printed to the console using the show environment last command.
Following is sample output of the show environment last command:
Router# show environment last
Environmental controller firmware version 2.2
Serial number is 00225259, calibrated on 5-10-93, by technician mpo
Internal temperature measured 27.0(C), shuts down at 43.0(C)
Air flow appears good.
+ 5 volt line measured at 5.061(V)
+12 volt line measured at 12.070(V)
-12 volt line measured at -12.034(V)
-5 volt line measured at -4.915(V)
LAST Environmental Shutdown Measurements:
Internal temperature was 24.0(C)
Air flow sensor was good
+5 volt line was 4.990(V)
+12 volt line was 9.900(V)*
-12 volt line was -11719(V)
-5 volt line was -4.926(V)
Router#
This section describes how to test for problems with system memory using the bootstrap program. This program can help you isolate or rule out hardware problems encountered when installing your chassis. The diagnostic tests for the chassis help initialize the processor hardware and boot the main operating system software.
Caution Before proceeding, read through this procedure and the "Processor Card Configuration Register" section in Chapter 5. If you have any doubts about your ability to perform any part, contact a customer service representative for information on how to proceed. |
If you remove the jumpers from the boot file number field (bits 3, 2, 1, and 0) of the configuration register, you can start the system in standalone bootstrap mode. The bootstrap mode prompt is
an angle bracket (>). Once in bootstrap mode, enter ? to display a list of available commands and options.
Following is a sample menu display from a system with a CSC/4 processor card (slightly different with the CSC/3):
> ?
$ state Toggle cache state (? for help)
B [filename] Bootload filename and start it
C [address] Continue [optional address]
D /S M L V Deposit value V of size S into location L with modifier M
E /S M L Examine location L with size S with modifier M
G [address] Start up execution
H Offer help with commands
I Initialize
K Stack trace
L filename Bootload filename, but do not start it
O Show configuration register option settings
P Set break point
S Single step next instruction
T function Test device (? for help)
Deposit and Examine sizes may be B (byte), L (long) or S (short).
Modifiers may be R (register) or S (byte swap).
Following are some of the most useful bootstrap commands:
M Memory test
P Probe IO/memory space
By default, the memory test examines onboard memory on the processor and memory cards. You can test NVRAM on the CSC-ENVM by supplying starting and ending addresses as described in Table 4-1. The test is destructive to data in NVRAM. The CSC-MC or Flash memory cannot be tested in this way.
Test | CSC/4 and CSC/3 |
---|---|
CSC-ENVM 64K | Start: 2000000 |
System memory | End: 200FFFF |
CSC-ENVM 64K | Start: 20B0000 |
Nonvolatile memory | End: 20BFFFF |
Use the Probe I/O memory space test to look at the mapped I/O space in the main memory and test for interface card responses. The test starts at address 2000000. You can specify probe increments in order to look for specific regions. The test uses the probe increments to search for registers that can be read from and written to. See the examples following.
Follow these steps to run the bootstrap diagnostics:
Step 1 Turn OFF power to the chassis and attach appropriate ESD protection.
Step 2 Remove the front access cover following the appropriate steps in the section "Opening the Chassis" in Chapter 5.
Caution Before proceeding, read the section "Processor Card Configuration Register" in Chapter 5 to familiarize yourself with the configuration register jumpers on the processor card. |
Step 3 Remove the jumpers from the boot file number field (bits 3, 2, 1, and 0) of the processor configuration register. (Note jumper positions before removal.)
Step 4 Turn ON power to the chassis. Wait for the system to display the banner message and prompt you with an angle bracket (>).
To test memory, enter the following at the > prompt and then press the Return key:
t m
To use the default addresses and select the default tests, press the Return key after each prompt displayed, as in the following sample of the Memory/Bus Diagnostic on the CSC/4 processor card.
Caution If you want to preserve your NVRAM configuration files, do not use the following test between the nonvolatile memory addresses listed in Table 4-1 because the test will overwrite the configuration files in NVRAM. |
Memory/Bus diagnostic
Starting Address [1000]?
Ending Address [400000]?
Hex argument for variable tests [FFFF]?
Select Tests [all]?
Number of passes to run [2]?
Message Level (0=silence, 1=summary, 2=normal)[2]?
Testing addresses between 0x1000 and 0x400000
Begin pass 0, test 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 End pass
Begin pass 1, test 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 End pass
No errors during 2 passes
To test I/O space, enter the following at the prompt (>) and press the Return key:
t p
The memory probe begins at address 2000000. You can specify probe increments. Table 4-2 lists the times required to run the memory tests.
Memory Probe Test Times | |||
---|---|---|---|
Processor Card | Memory Test Times | Probe Increment = 1 | Probe Increment1 = 10 |
CSC/4 | ~11 minutes2 | ~20 seconds | ~2 seconds |
CSC/3 | ~8 minutes | ~30 seconds | ~1 seconds |
The following example shows a test done with probe increments of 1:
IO/memory space probe
Starting address [2000000]?
Ending address [210FFFF]?
Probe increment (in shorts) [1]?
Probing from 0x2000000 to 0x210FFFF with interval 0x1
Region 0x210C000 to 0x210C07E exists (0K)
Done
>
The following example shows a test done with probe increments of 10:
IO/memory space probe
Starting address [2000000]?
Ending address [210FFFF]?
Probe increment (in shorts) [1]? 10
Probing from 0x2000000 to 0x210FFFF with interval 0x10
Region 0x210C000 to 0x210C060 exists (0K)
Done
>
|