Contents Safety Notices . . xi Electrical Safety . . xi Laser Safety Information . xiii Data Integrity and Verification . . xv About This Book . . xvii ISO 9000 . . xvii Online Publications . . xvii Related Publications . .
Page 4
Tape Cartridge Data Efficiency . 27 Using the 5.0GB 8-mm Tape Drive . . 28 Status Lights . 28 Status Light States . 28 Loading the 8-mm Tape Cartridge . 29 Unloading the 8-mm Tape Cartridge . . 30 Cleaning the Tape Path on the 5.0GB 8-mm Tape Drive . .
Page 5
Menu Inactivity . . 60 General User Menus . . 61 Privileged User Menus . . 63 Main Menu . . 63 Service Processor Setup Menu . . 65 Passwords . . 66 System Power Control Menu . . 69 System Information Menu .
Page 6
Chapter 5. Using the Online and Standalone Diagnostics . . 107 Standalone and Online Diagnostics Operating Considerations . . 107 Selecting a Console Display . . 107 Identifying the Terminal Type to the Diagnostics . 108 Undefined Terminal Types. . 108 Running Standalone Diagnostics .
Page 7
Display Microcode Level . . 154 Display or Change Bootlist . 154 Display or Change Diagnostic Run Time Options . . 154 Display Previous Diagnostic Results . . 155 Display Resource Attributes . . 155 Display Service Hints . . 155 Display Software Product Data .
Page 8
Chapter 8. Hardware Problem Determination . 177 Problem Determination Using the Standalone or Online Diagnostics . . 177 Step 1. Considerations before Running This Procedure . . 177 Step 2 . . 177 Step 3 . . 178 Step 4 . .
Page 9
. 220 Examples For Using the Generic Sample Modem Configuration Files . . 221 Customizing the Modem Configuration Files . . 221 IBM 7852-400 DIP Switch Settings . . 222 Terminal Emulators . . 222 Recovery Procedures . . 222 Seamless Transfer of a Modem Session .
Safety Notices A danger notice indicates the presence of a hazard that has the potential of causing death or serious personal injury. Danger notices appear on the following pages: v xi v xii A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor personal injury.
Page 12
CAUTION: This product is equipped with a three–wire power cable and plug for the user’s safety. Use this power cable with a properly grounded electrical outlet to avoid electrical shock. DANGER To prevent electrical shock hazard, disconnect the power cable CAUTION: This unit has more than one power supply cord.
Laser Safety Information The optical drive in this system unit is a laser product. The optical drive has a label that identifies its classification. The label, located on the drive, is shown below. CLASS 1 LASER PRODUCT LASER KLASSE 1 LUOKAN 1 LASERLAITE APPAREIL A LASER DE CLASSE 1 IEC 825:1984 CENELEC EN 60 825:1991...
Web site at: http://www.rs6000.ibm.com/resource/hardware_docs/ Related Publications The following publications are available: v The Enterprise Server S80 Installation Guide, SA38-0582, contains information on how to set up and cable the system, install and remove options, and verify system operation.
Trademarks The following terms are trademarks of International Business Machines Corporation in the United States, other countries, or both: v AIX v IBM v RS/6000 v SP Other company, product, and service names may be trademarks or service marks of others.
Chapter 1. System Introduction The Enterprise Server S80 and p Series 680 Model S85 systems are 64-bit, symmetric multiprocessing (SMP) enterprise servers. The systems provide the power, capacity and expandability to support evolution of your business into the 64-bit computing environment while still supporting the existing 32-bit applications.
System Rack The System Rack supports a minimum of one processor card and a maximum of four processor cards. Each processor card has six 64-bit processors. A maximum of 24 processors may be installed in the system, which shares a common system memory. The system memory is controlled through a multi-port controller, which supports up to 16 memory slots.
I/O Rack Each I/O rack holds up to two I/O Drawers. Each I/O Drawer provides up to 14 PCI adapters per drawer. The primary I/O Drawer (Drawer 0) reserves slots for support of system media, service processor and hot-pluggable DASD bays resident in the drawer. For information about PCI adapter slot placement, see PCI Adapter Placement Reference , (SA38-0538).
I/O Drawers Your system has a 10 EIA Unit I/O Drawer as described below. 10 EIA Unit I/O Drawer The 10 EIA Unit I/O Drawer features hot-pluggable disk drive banks shown in the following figure. Disk drive banks allow system users to insert and remove disk drives without performing a power down of the system which increases the availability of the server.
Input/Output Connectors 10 EIA Unit I/O Drawer The 10 EIA Unit I/O Drawer Input/Output connectors are shown in the following figure. 1. PCI adapter slots (1 - 7) 2. PCI adapter slot 8, service processor card and JTAG cable in primary I/O drawer (drawer 0) 3.
Page 25
13. SPCN2 connector 14. SPCN1 connector 15. Serial port S2 16. Serial port S1 17. Operator panel (OP) connector 18. RIO 0 and RIO 1 connectors 19. Power cord connector for left power supply 20. Left power supply, power good LED 21.
Chapter 2. Using the System System power control and use of options such as the keyboard, mouse, and drives supported by the server are discussed in this section. Operator Panel and Power Control The following diagram shows the locations of the operator panel display and the operator panel.
Powering On the System Perform the following steps to power on your system rack and connected I/O Drawers: 1. Open the operator panel cover on the system rack. 2. Press the Power (white) pushbutton on the operator panel. 3. The Power On light on the operator panel starts blinking as the system is powered on.
Page 29
6. Press the Power pushbutton (white) on the operator panel. The operator panel display shows 08 O ? (the international power-off symbol) with the ? blinking. 7. Press the Power pushbutton (white) on the operator panel again. The system powers off, and the Power On light on the system rack operator panel display goes off and remains off.
POST Indicators During the power-on sequence, a series of POST indicators appear on the display. The following indicators are displayed on the screen. v Memory v Keyboard v Network v SCSI v Speaker If the POST completes successfully, the text displays OK. The following list describes each of the displayed POST indicators: Memory Memory test.
POST Keys Numerical 1 Key The numerical 1 key, when pressed during POST, invokes the System Management Services (SMS) interface. Numerical 5 Key The numerical 5 key, when pressed during POST, invokes the default boot list mode, located in firmware. This mode attempts to boot from the first device of each type found in the list.
Console Strategy The firmware invokes a console selection sequence at system boot time if any of the following are true: v A console has not yet been selected v A previous console selection sequence timed out v A change in the system configuration affecting the console (keyboard installed/removed, mouse installed/removed, graphics adapter installed/removed or moved to another PCI slot).
I/O Drawer Indicator Panels This section describes the indicator panel located on the left front of the 10 EIA Unit I/O Drawer. 10 EIA Unit I/O Drawer Indicator Panel Power On LED Drawer Indicator Panel Display 10 EIA Unit I/O Drawer and Power Supply LED Status State of LED Operator Panel LED Right Power Supply...
Page 34
10 EIA Unit I/O Drawer Indicator Panel Status Drawer State Drawer Indicator Panel After Connecting AC Power Blinking location U?.? Standby Blinking Blinking location U?.? System power On solid On solid location Ux.y# Receive Firmware Command On solid On solid location Ux.y# Thereafter On solid After Power is Removed Because of a Shutdown...
Using the Keyboards There are several keyboards available for the system unit. The keyboards have various keys that enter data and control the cursor location. The keyboards can be engraved for the languages of different countries. The functions of each keyboard depend on the software used. The character sets for the keyboards are contained and explained in the documentation for your operating system.
Page 36
On all of the keyboards, you can adjust the tilt position for typing comfort. To tilt the keyboard, pull out on the keyboard legs. The legs snap into position. To decrease the tilt of the keyboard, rotate the keyboard legs until they snap into the bottom of the keyboard case.
Using the Three-Button Mouse The mouse is a hand-operated locating device. A three-button mouse is available for use with the system unit. Consult your application documentation for the exact use of the three-button mouse. You can use the mouse to perform such functions as positioning a cursor, selecting items from a menu, or moving around in your document much easier and faster than if you used only the keyboard.
The mouse has a cable that plugs into the mouse connector at the rear of the primary I/O drawer. Handling the Mouse Correctly For best operation, handle the mouse with care. Incorrect handling can damage the mouse. Do not: v Operate the mouse on cloth, unfinished wood, newspaper, or carpet. v Drop or hit the mouse.
Cleaning the Mouse Use the following procedure to clean the mouse: 1. Remove the retaining ring by turning it counterclockwise. Retaining Ring Ball Cavity 2. Remove the ball. 3. Inspect the ball for contaminants. Wipe it clean with a dry, lint-free cloth. 4.
Using the 3.5-Inch Diskette Drive Diskette Compatibility The system unit has a 1.44MB diskette drive installed. The 1.44MB diskette drive can format, read, and write diskettes compatible with the following diskette drives: v 1.0MB diskettes with 720KB formatted data capacity. v 2.0MB diskettes with 1.44MB formatted data capacity (HD).
v To allow writing onto a diskette, slide the write-protect tab to cover the protect slot. (Slot Closed) Write-Protect Tab Loading and Unloading the 3.5-Inch Diskette To load a diskette into the drive, insert the diskette in the diskette drive with the labeled metal shutter first.
General Information for 5.0GB 8-mm Tape Drive If your system has a 5.0GB 8-mm tape drive, review the following for recommendations on care and cleaning of your tape drive. Recommendations v Use only Data Grade 8-mm tape cartridges. These cartridges are identified by either a Data, D, or D8 marking on the data cartridge.
Tape Cartridge Compatibility The 8-mm tape drive is compatible with existing 8-mm tape subsystems that comply to the American National Standard (ANSI) X3B5/89-136, Helical-scan Digital Computer Tape Cartridge, 8-mm for Information Exchange. Data compression effectively increases the cartridge capacity and data transfer rate for the 5.0GB tape cartridges. Note: 160 meter cartridges are not processed and are automatically ejected by the drive.
Environmental Considerations for 8-mm Data Cartridges Information in this section describes operating and storage conditions including temperature, relative humidity, and maximum wet-bulb data. Attention: The manufacturer has specified a set of temperature and humidity ranges in which the 8-mm data cartridge can operate with ease. Only regular cleaning procedures are required when operating the cartridge within this range.
Tape Cartridge Data Efficiency The 8-mm tape cartridge efficiency is defined as the amount of data that can be stored on the cartridge. The following variables affect the amount of data that can be stored on a tape cartridge: v Size of the data file v Number of file marks per file v File mark size used v Compatibility mode (2.3GB or 5.0GB)
Using the 5.0GB 8-mm Tape Drive The optional 5.0GB 8-mm tape drive is a half-height device. Status Lights The 5.0GB 8-mm tape drive has two green status lights and one amber status light. The on and off combinations of the status lights indicate the conditions of the 8-mm tape drive.
Note: If a fault or an error condition occurs, first try to recover by pressing the unload button. If this does not correct the fault, switch off the power to the 8-mm tape drive, and then switch on the power to the drive. If the condition continues, call your service representative.
Unloading the 8-mm Tape Cartridge Before performing the unload operation, make sure the power to the 8-mm tape drive is To unload and eject the tape cartridge, press the unload button. The 8-mm tape drive rewinds the tape and then ejects the tape cartridge from the tape drive. After you press the unload button, the following occurs: 1.
Page 49
error and ejects the cleaning cartridge. The disturbance status light (amber) remains on if it was on prior to the cleaning operation being attempted. Some video cleaning cartridges are extremely abrasive. An 8-mm tape drive may be permanently damaged after only a few cleaning operations using an abrasive-type cleaning cartridge.
General Information for 4.0GB 4-mm Tape Drive The 4.0GB 4-mm tape drive is a streaming tape drive that is used to: v Save and restore system data files. v Archive important records. v Distribute operating system software upgrades. The 4-mm tape drive can be attached to any system using a single-ended interface meeting the Small Computer System Interface ll (SCSI-ll) Standard ANSI X3.131-199X Rev.
Types of 4-mm Tape Cartridges The 4-mm tape drive is shipped with three 4-mm cartridges to help start your tape operations immediately. 4-mm Data Cartridge Use this non-labeled cartridge for saving or restoring your programs or data. Additional data cartridges can be ordered. 4-mm Diagnostic Cartridge Use this specially labeled cartridge to perform diagnostics on the drive.
Setting the Write-Protect Tab on 4-mm Tape Cartridges The window on the tape cartridge controls write-protection. When the write-protect tab of a tape cartridge is set (window open), information can be read from the tape, but cannot be written to it. When the write-protect tab of a tape cartridge is not set (window closed), information can be both written to and read from the tape.
Operating in Harsh Environments The 4-mm tape drive is ideally suited to streaming operations, as opposed to tape movement operations involving multiple stop/starts and random searches. When the tape is used for frequent stop and start operations, streaming movement is beneficial and should be used whenever possible.
Using the 4.0GB 4-mm Tape Drive The optional 4.0GB 4-mm tape drive is a half-hight device. Status Lights The 4.0GB 4-mm tape drive has two green status lights and one amber status light. The on and off combinations of the status lights indicate the conditions of the 4-mm tape drive.
Page 55
Status Lights on the 4.0GB 4-mm Tape Drive Status Ready Read-Write Disturbance (green) (green) (amber) LED test On 2 On 2 On 2 seconds at seconds at seconds at power on power on power on The power-on self-test (POST) is running or the Flashing Diagnostic Cartridge is running.
Loading the 4-mm Tape Cartridge Before loading the tape cartridge, make sure the power is on and the write-protect switch on the tape cartridge is properly set. Refer to “Setting the Write-Protect Tab on 4-mm Tape Cartridges” on page 34. The tape drive loads the tape from the cartridge and prepares it for reading and writing.
Unloading the 4-mm Tape Cartridge Before performing the unload operation, make sure the power to the 4-mm tape drive is To unload and eject the tape cartridge, press the unload button. The 4-mm tape drive rewinds the tape and then ejects the tape cartridge from the tape drive. After you press the unload button, the following occurs: 1.
Cleaning the Tape Path on the 4.0GB 4-mm Tape Drive The 4-mm tape path should be cleaned either approximately every 30 hours of tape motion or once a month, whichever occurs first. The 4-mm tape drive monitors the recording quality of the tape cartridge and indicates that the tape path requires cleaning when the disturbance status light (amber) is on.
Page 59
After the 4-mm cleaning cartridge has been fully inserted into the 4-mm tape drive, the following cleaning operations are performed automatically: 1. The cleaning tape is loaded into the tape path. 2. The drive is cleaned by moving the cleaning tape forward for approximately 30 seconds.
Using the CD-ROM Drive CAUTION: A Class 3 laser is contained in the device. Do not attempt to operate the device while it is disassembled. Do not attempt to open the covers of the device, as it is not serviceable and is to be replaced as a unit. The CD-ROM is located in bay A of the I/O Drawer.
Cleaning the CD-ROM Drive This CD-ROM drive has an internal head-cleaning mechanism, and therefore does not require an external cleaning device. The internal cleaning mechanism cleans the head every time the tray is operated. Always handle discs carefully by the edges to avoid leaving fingerprints or scratching the disc.
Using the Hot Swap Disk Drives The following describes how to install or remove hot-swap disk drives. v For information on installing a SCSI disk drive, go to page 48. v For information on removing a SCSI disk drive, go to page 47. Relationship of AIX Prompts and Physical Drive Location A SCSI adapter and a SCSI drive address can be displayed on a user display.
Handling Guidelines The hot-swap disk drive is a sensitive device. Handle the hot-swap carrier and disk drive with care. v Do not drop the disk drive or subject the drive to excessive shock. v Do not expose the disk drive to temperatures lower than -40° F (-40° C) or higher than 158 °...
Disk Drive Status Light States The following table explains the meaning of the green and amber status lights and spin down button. Spin Down Button Green Amber SCSI Disk Drive Status Lights Light or Button Status Definition Amber Drive spinning Drive not spinning Green Power On...
Removing and Inserting Disk Drives This section provides instructions to help you remove or insert SCSI hot-swap disk drives into hot-swap bays. Note: You do not have to power down the system to remove a hot-swappable assembly from the 10 EIA Unit I/O Drawer. Before you perform these procedures, take the appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration.
Inserting a Disk Drive into the Hot-Swap Bays This procedure describes how to insert disk drives into a hot-swap bay while the I/O Drawer is powered on. Attention: Caution should be used when handling all hard drives. Drives are more likely to be damaged during installation and service.
Page 67
3. Each hot-swap disk drive must have a hot-swap disk drive carrier attached. Attention: Handle the hot-swap disk drive with care, as it is very fragile and can be easily damaged if exposed to shock, electrostatic discharge, or rough handling. 4.
Page 68
c. Align the side of the carrier with the raised guides on the left of the bay. d. Slide the carrier into the bay gently to keep from damaging the hot-swap disk drive. Slide the carrier to the rear of the bay until it is seated into the connector. S80 and S85 User’s Guide...
Page 69
e. Move the carrier latch to the locked position. If the carrier does not lock, check that the drive carrier is fully seated in the backplane. When the drive carrier is fully seated and power is turned on, the green LED on the drive carrier illuminates.
Removing Disk Drives from the Hot-Swap Bays This procedure describes how to remove hot-swap disk drives from a hot-swap bay while the I/O Drawer is powered on. If you remove a hot-swap disk drive when the I/O Drawer power is off, steps 4 and 5 do not apply.
Page 71
6. Unlock the drive by turning the carrier latch 90 degrees clockwise to the open position. SCSI Disk Drives 7. Disconnect the drive from the connector on the backplane by grasping the handle on the drive carrier and carefully pulling the drive out of the server. Attention: Do not open the drive;...
This web site gives ergonomic guidelines to help you understand the ergonomic considerations that you should know when working at a computer workstation. The address is: http://www.us.pc.ibm.com/healthycomputing S80 and S85 User’s Guide...
Using the Service Processor and Electronic Electronic Service Agent Features The Service Processor and Electronic Service Agent features protect users against unnecessary system downtime by keeping support personnel (both internal and external) aware of any unexpected changes in the system environment. In combination, the two features provide a flexible solution to automated system maintenance.
Ethernet. Once alerted to the system failure, the Control Workstation initiates actions to prepare and send the service request. A modem is required for enabling automated problem reporting to the IBM service center. Configuration files for several types of modems are included as part of the Electronic Service Agent package.
Chapter 3. Service Processor Menus The service processor menus make it possible for you to configure service processor options, as well as enable and disable functions. Service processor menus are available using an ASCII terminal while the system is powering on or when the service processor has detected a server problem (such as a surveillance failure).
Page 76
Service Processor Functions Service AIX Service SMS (ASCII or Processor Aids (ASCII or graphics Menus (ASCII graphics terminals) terminals) terminals) Read VPD image from last system boot Read System POST Errors Read Service Processor Error Logs View Progress Indicators from last Boot Power-off System Read NVRAM Start Talk Mode...
Service Processor Menus The service processor menus are divided into two groups: v General user menus - the user must know the general access password. v Privileged user menus - the user must know the privileged access password. The following section describes these two groups of menus, how to access them, and the functions associated with each option.
The service aid, Save or Restore Hardware Management Policies, can be used to save your settings after initial setup or whenever the settings must be changed for system operation purposes. It is strongly recommended that this service aid is used for backing up service processor settings to protect the usefulness of the service processor, and the availability of the server.
General User Menus The menu options presented to the general user are a subset of the options available to the privileged user. The user must know the general access password, if set, to access these menus. GENERAL USER MENU 1. Continue System Boot 2.
Page 80
v Exit from Menus Allows the user to continue with the system boot already in progress using the current ASCII terminal as the active console. Make this selection if you do not wish to monitor the IPL progress messages on your terminal. Exiting from the menus causes the modems to disconnect, but the boot process continues.
Privileged User Menus The following menus are available to privileged users only. The user must know the privileged access password, if set, to access these menus. Main Menu At the top of the Main Menu is a listing containing: v Your service processor’s current firmware version v The firmware copyright notice v The system name given to your server during setup You need the firmware version for reference when you either update or repair the...
Page 82
Note: The information under the Service Processor Firmware heading in the Main Menu illustration that follows is example information only. Service Processor Firmware VERSION: 19990707 Copyright 1999 IBM Corporation SYSTEM NAME MAIN MENU 1. Service Processor Setup Menu 2. System Power Control Menu 3.
Service Processor Setup Menu The service processor setup menu shown below is accessed from the Main menu: SERVICE PROCESSOR SETUP MENU 1. Change Privileged Access Password 2. Change General Access Password 3. Enable/Disable Console Mirroring: Currently Enabled 4. Start Talk Mode 5.
Passwords Passwords can be any combination of up to 8 alphanumeric characters. You can enter longer passwords, but the entries are truncated to include only the first 8 characters. The privileged-access password can be set from the service processor menus or from System Management Services (SMS) utilities (see “Chapter 4.
Page 85
v Enable/Disable Console Mirroring When console mirroring is enabled, the service processor sends information to both serial ports. This capability can be enabled by local or remote users. This provides local users the capability to monitor remote sessions. Console mirroring can be enabled for the current session only.
Page 86
v OS Surveillance Setup Menu This menu can be used to set up operating system (OS) surveillance. OS Surveillance Setup Menu 1. Surveillance: Currently Enabled 2. Surveillance Time Interval: 2 minutes 3. Surveillance Delay: 2 minutes 98. Return to Previous Menu 1>...
System Power Control Menu This menu can be used to set power control options. Other menus that control boot options are available here: SYSTEM POWER CONTROL MENU 1. Enable/Disable Unattended Start Mode: Currently Enabled 2. Reboot/Restart Policy Setup Menu 3. Continue System Boot 4.
System Information Menu Access to system configuration information, error logs, system resources, and processor configuration is a available from this menu: SYSTEM INFORMATION MENU 1. Read VPD Image from Last System Boot 2. Read Progress Indicators from Last System Boot 3.
Page 89
v Read Service Processor Configuration Displays current service processor configuration. v Processor Configuration/Deconfiguration Menu This menu allows the user to change the system processor configuration. If it is necessary to take one of the processors offline, this menu allows you to deconfigure a processor, and then reconfigure the processor at a later time.
Language Selection Menu The service processor menus and messages are available in different languages. This menu is used to select a language if desired: LANGUAGE SELECTION MENU 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol 6. Svenska 98. Return to Previous Menu 99.
Call-In/Call-Out Setup Menu The following menu is used to configure a modem for the service processor for use to support the system: CALL-IN/CALL-OUT SETUP MENU 1. Modem Configuration Menu 2. Serial Port Selection Menu 3. Serial Port Speed Setup Menu 4.
Modem Configuration Menu The first two lines of the Modem Configuration Menu are status lines showing the current selections. Selections are made in the two sections labeled Modem Ports and Modem Configuration File Name. Select the serial port that you want to activate and then select the modem configuration file for the modem on the port.
Serial Port Selection Menu You can enable and/or disable the call-in and call-out functions of each serial port in any combination from this menu: Serial Port Selection Menu 1. Serial Port 1 Call-Out: Currently Disabled 2. Serial Port 2 Call-Out: Currently Disabled 3.
Telephone Number Setup Menu This menu can be used to set or change the telephone numbers for reporting a system failure. The service processor allows setting or changing telephone numbers for: Telephone Number Setup Menu 1. Service Center Telephone Number: Currently Unassigned 2.
Call-Out Policy Setup Menu Callout settings can be set using the following menu: CALL-OUT POLICY SETUP MENU 1. Call-Out policy (First/All): Currently First 2. Remote timeout, (in seconds): Currently 120 3. Remote latency, (in seconds): Currently 2 4. Number of retries: Currently 2 98.
Customer Account Setup Menu This menu allows users to enter information that is specific to their account: Customer Account Setup Menu 1. Customer Account Number: Currently Unassigned 2. Customer RETAIN Login userid: Currently Unassigned 3. Customer RETAIN login password: Currently Unassigned 98.
Reboot/Restart Policy Setup Menu The following menu controls Reboot/Restart Policy: Reboot/Restart Policy Setup Menu 1. Number of reboot attempts: Currently 1 2. Use OS-Defined restart policy? Currently Yes 3. Enable supplemental restart policy? Currently No 4. Call-Out before restart: Currently Disabled 98.
Boot Mode Menu The Boot Mode menu allows a selection of boot options to be selected. The following figure shows the Boot Mode Menu: Boot Mode Menu 1. Boot to SMS Menu: Currently Disabled 2. Service Mode Boot from Saved List: Currently Disabled 3.
Service Processor Procedures in Service Mode When the system is in service mode, the following service-processor parameters are suspended: v Unattended start mode v Reboot/restart policy v Call-out v Surveillance. When service mode is exited, the service processor parameters revert to the customer settings.
Service Processor Functions This section discusses some of the service processor features in greater detail. The service processor supports the following functions: Built-in Functions Initialization and Test Service Processor BIST System Chip Initialization Error Data Collection BIST/POST errors and status Isolation to Field Replaceable Unit (FRU) on failure Configuration...
System Power-On Methods v Power-on Switch - see “Powering On the System” on page 10. v Remote Power-on via Ring-Indicate Signal The server automatically powers on when it detects a ″ring indicate″ signal from a modem attached to one of the integrated serial ports. A remote user can call the server to activate ring detection by the modem.
v Fast/Slow Boot (IPL) Capabilities Using the operator-panel functions, you can select the IPL type, mode, and speed of your boot capabilities. For more information, refer to operator panel functions 01 and 02 in “Operator Panel Function Descriptions” on page 206. Attention: Selecting fast IPL results in several diagnostic tests being skipped.
Service Processor Reboot/Restart Recovery Reboot describes bringing the system hardware back up. For example, from a system reset or power on. The boot process ends when control passes to the operating system process. Restart describes activating the operating system after the system hardware is reinitialized.
Service Processor Reboot/Restart Policy Controls The operating system’s automatic restart policy (see operating system documentation) indicates the OS response to a system crash. The service processor can be instructed to refer to that policy, or not, by the Use OS-Defined Restart Policy menu item. If the operating system has no automatic restart policy, or if it is disabled, then the service processor restart policy can be controlled from the service processor menus by using the Enable Supplemental Restart Policy selection.
Page 105
The following provides a more thorough understanding of the relations among the OS and service processor restart controls: OS Automatic Service processor to Service Processor System response reboot/restart after use OS-Defined Enable supplemental crash setting restart policy? restart policy? None None Restarts None...
Configure/Deconfigure Processors The system features Repeat-Gard, an enhanced error detection and prevention function. This function allows the system to deconfigure processors from the system configuration if a processor has an unrecoverable error, or repeating recoverable errors. This prevents the system from using a defective processor. If a processor is deconfigured, it remains off-line for subsequent reboots until the faulty processor hardware is replaced.
Service Processor System Monitoring - Surveillance Surveillance is a function in which the service processor monitors the system, and the system monitors the service processor. This monitoring is accomplished by periodic samplings called heartbeats. Surveillance is available during two phases: v System firmware bringup (automatic) v Operating system runtime (optional).
Page 108
Surveillance does not take effect until the next time the operating system is started after setting the parameters. If desired, surveillance mode can be initiated immediately from service aids. In addition to the three options above, a fourth option is available to allow you to select immediate surveillance;...
To Call-Out before restart, set Call-out before restart to ENABLED from the Reboot/Restart Policy Setup menu. Note: Some modems, such as IBM 7857-017, are not designed for the paging function. Although they can be used for paging, they will return an error message when they do not get the expected response from another modem.
Console Mirroring Console mirroring allows a user on a local ASCII terminal to monitor the service processor activities of a remote user. Console mirroring ends when the service processor releases control of the serial ports to the system firmware. System Configuration The following describes the configuration for console mirroring: v Service Processor v Modem connected to one serial port and enabled for incoming calls...
Service Processor Error Logs The service processor error logs contain errors detected by the service processor. Error Log 19970626223337 0. Error detected.. 4B00F010 B455440004B00710700001370000000000000000007420A400010000000 0000000000000 Press "C" to clear error log, any other key to continue. > Note: The time stamp in this error log is Coordinated Universal Time (CUT), which is also referred to as Greenwich Mean Time (GMT).
LCD Progress Indicator Log The following is an example of the LCD progress indicator log. The progress indicators (checkpoints) are displayed in two sections. Above the dashed line are the progress indicators from the boot that produced the current sessions. Below the dashed line are progress indicators from the boot preceding the one that produced the current sessions.
Chapter 4. System Management Services Use the System Management Services menus to view information about your system and to perform such tasks as setting a password. Other tasks, such as accessing and changing adapter and device configurations are performed using system management services.
Password Utilities Entering this selection permits access to the options shown on the following screen: Password Utilities 1 Set Privileged Access-Password 2 Remove Privileged-Access Password 3 Unattended Start Mode <ON> .------. |X=Exit| ′------' ===> Set Privileged-Access Password: The privileged-access password protects against the unauthorized starting of the system programs.
Display Error Log A screen similar to the following displays when you select this option. You can view or clear your system error log. .--------------------------------------------------------. Error Log Date Time ErrorCode Location | Entry 1. 99/06/30 02:15:35 25a80011 U1.1-P1.IQ2 | Entry 2. 99/07/01 03:18:36 20300026 U1.1-P1.IQ2 ′--------------------------------------------------------' .-----------------.
Remote Initial Program Load Setup This option allows you to enable and set up the remote startup capability of your system. You must first specify the network parameters as shown in the following screen: Network Parameters 1. IP Parameters 2. Adapter Parameters 3.
Adapter Parameters Select the Adapter Parameters option from the Network Parameters menu to view an adapter’s hardware address, as well as configure network adapters that require setup. A screen similar to the following displays when you make this selection: Adapter Parameters Device Slot Hardware Address...
Page 118
Ping Select Ping from the Network Parameters menu to test a connection to a remote system unit. After selecting the Ping option, you must choose an adapter to communicate with the remote system. Device Slot Hardware Address 1. Gigabit Ethernet 5: U0.1-P1-I5/E1 0004ac7c8628 2.
SCSI Utilities SCSI Utilities allow you to set delay times for the SCSI hard disk spin-up and to set SCSI IDs for SCSI controllers installed in the system. SCSI Utilities 1. Harddrive Spin Up Delay 2. Change SCSI Id .--------. | X=Exit | ′--------' ===>...
MultiBoot Multiboot 1 Select Software 2 Software Default 3 Select Install Device 4 Select Boot Devices 5 OK Prompt 6 Multiboot Startup <OFF> .------. |X=Exit| ′------' ===> Select Software If this option is supported by the operating system, the names of the operating system installed are shown.
Select Boot Devices Select this option to view and change the custom boot list, which is the sequence of devices read at startup time. The boot list can contain five devices. Select Boot Devices 1 Display Current Settings 2 Restore Default Settings 3 Configure 1st Boot Device 4 Configure 2nd Boot Device 5 Configure 3rd Boot Device...
Page 122
Restore Default Settings Restores the boot list to the default sequence of: v Primary diskette drive v CD-ROM drive v Tape id=5 (slot=1) v Hard disk drive id=6 (Slot=1) v Network device. Selecting any of the Configure Boot Device options displays a screen similar to the following.
Select Language Select this option to change the language used by the text-based System Management Services screens and the service processor menus. Select Language 1. English 2. Francais 3. Deutsch 4. Italiano 5. Espanol 6. Svenska ===> .------. |X=Exit| ′------' Note: To properly display languages other than English, your ASCII terminal must support the ISO-8859 character set.
Chapter 5. Using the Online and Standalone Diagnostics The diagnostics consist of Standalone Diagnostics and Online Diagnostics. Standalone Diagnostics are packaged on removable media and must be booted before they can be run. If booted, they have no access to the AIX Error Log or the AIX Configuration Data. Online Diagnostics, when installed, reside with AIX in the file system.
Identifying the Terminal Type to the Diagnostics Note: This is not the same as selecting a console display. When you run diagnostics, you must identify what type of terminal you are using. If the terminal type is not known when the FUNCTION SELECTION menu is displayed, the diagnostics do not allow you to continue until a terminal is selected from the DEFINE TERMINAL option menu.
Running Online Diagnostics Consider the following when you run the Online Diagnostics from a server or a disk: v The diagnostics cannot be loaded and run from a disk until the AIX operating system has been installed and configured. After the AIX operating system has been installed, all three modes of diagnostics are available.
General Attributes Always Required The following general attributes are the default settings for the diagnostics. Be sure your terminal is set to these attributes. Note: These attributes should be set before the diagnostics are loaded. Refer to the following table. General Setup 3151 3151...
Page 129
General Setup 3151 3151 3161/ 3164 Description Attributes /11/31/41 /51/61 Settings Settings Settings Line wrap The cursor moves to the first character position of the next line in the page after it reaches the last character position of the current line in the page. Forcing insert Field Field...
Additional Communication Attributes The following communication attributes are for the 3151, 3161, and 3164 terminals. Communication 3151 3151 3161/3164 Description Setup Attributes /11/31/41 /51/61 Settings Settings Settings Operating mode Echo Echo Echo Data entered from the keyboard on the terminal is sent to the system unit for translation and then sent back to the display screen.
Additional Keyboard Attributes The following keyboard attributes are for the keyboard attached to the 3151, 3161, and 3164 terminals. Keyboard Setup 3151/11/ 3151 3161 /3164 Description Attributes 31/41 /51/61 Settings Settings Settings Enter Return Return Return The Enter key functions as the Return key.
Additional Printer Attributes The following printer attributes are for a printer attached to the 3151, 3161, and 3164 terminals. Printer Setup 3151/11/ 3151 3161 /3164 Description Attributes 31/41 /51/61 Settings Settings Settings Line speed 9600 9600 9600 Uses 19200 or 9600 bps (bits per second) line speed to communicate with the system unit.
Online Diagnostics Modes of Operation The Online Diagnostics can be run in three modes: v Service Mode v Concurrent Mode v Maintenance Mode Service Mode Service mode provides the most complete checkout of the system resources. This mode also requires that no other programs be running on the system. All system resources, except the SCSI adapter and the disk drives used for paging, can be tested.
Concurrent Mode Concurrent mode provides a way to run Online Diagnostics on some of the system resources while the system is running normal activity. Because the system is running in normal operation, some of the resources cannot be tested in concurrent mode. The following resources cannot be tested in concurrent mode: v SCSI adapters connected to paging devices v Disk drive used for paging...
Maintenance Mode Maintenance mode runs the Online Diagnostics using the customer’s version of the AIX operating system. This mode requires that all activity on the AIX operating system be stopped so the Online Diagnostics have most of the resources available to check. All of the system resources except the SCSI adapters, memory, processor, and the disk drive used for paging can be checked.
Standalone Diagnostic Operation Standalone Diagnostics provide a method to test the system when the Online Diagnostics are not installed and a method of testing the disk drives that cannot be tested by the Online Diagnostics. No Error Log Analysis is done by the Standalone Diagnostics. The CD-ROM drive and the SCSI controller that controls it cannot be tested by the Standalone Diagnostics.
Logical and Physical Locations The system uses physical location codes in conjunction with AIX location codes to provide mapping of the failing field replaceable units. The location codes are produced by the system unit’s firmware and AIX. Physical Location Codes Physical location codes provide a mapping of logical functions in a platform (or expansion sites for logical functions, such as connectors or ports) to their specific locations within the physical structure of the platform.
AIX Location Codes The basic formats of the AIX location codes are: v For non-SCSI devices/drives – AB-CD-EF-GH v For SCSI devices/drives – AB-CD-EF-G,H For planars, cards, and non-SCSI devices the location code is defined as: AB-CD-EF-GH | | | | | | | Device/FRU/Port ID | | Connector ID | devfunc Number, Adapter Number or Physical Location...
Page 139
For integrated adapter, EF-GH is the same as the definition for a pluggable adapter. For example, the location code for a diskette drive is 01-D1-00-00. A second diskette drive is 01-D1-00-01. For SCSI the location code is defined as: AB-CD-EF-G,H | | | | | | | | | Logical Unit address of the SCSI Device | | | Control Unit Address of the SCSI Device...
AIX and Physical Location Code Reference Tables The following tables contain location codes that are used to identify functional units in the system. Each table below shows the locations for a physical part the system. System Rack FRU Name Slot Location Code AIX Location Physical Logical...
System Rack Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection Processor Card 0 U1.1-P1-C1 System Type 1, Front Backplane Processor Card 0 U1.1-P1-C1 00-00 On Processor All Cache Card 0 Processor Card 0 U1.1-P1-C1 00-00 On Processor Processor 0...
Page 142
System Rack Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection R1 Memory Card U1.1-P1-M12 00-00 System Quad D, Rear Backplane R1 Memory Card U1.1-P1-M13 00-00 System Quad D, Rear Backplane R1 Memory Card U1.1-P1-M14 00-00 System Quad D, Rear...
Page 143
System Rack Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection Regulator Card U1.1-P2-V1 Power Front Backplane Regulator Card U1.1-P2-V2 Power Front Backplane Regulator Card U1.1-P2-V3 Power Front Backplane Regulator Card U1.1-P2-V4 Power Front Backplane Regulator Card U1.1-P2-V5 Power...
Page 144
System Rack Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection Bulk Power U1.2-V5 Power Supply 5 Front Harness, P05 AC Power Rear U1.2-V7 Power and 5 v standby Harness, P1, AC Box P7, P8, P9 S80 and S85 User’s Guide...
Page 146
I/O Drawer 0 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection JTAG U0.1-P1-X1 Not used by On Service Processor VDISK U0.1-P1-X1 Not used by On Service Processor NVRAM U0.1-P1-X1 Not used by On Service Processor PCI slot 8, U0.1-P1-X1...
Page 147
I/O Drawer 0 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection PCI Slot 10 U0.1-P1-I10 40-60 or I/O Planar J16, PCI Controller 40-60 to 64 Bit slot 40-67 or 4C-XX PCI Controller 2 U0.1-P1 00-f860-0000 PCI Slot 11 U0.1-P1-I11...
Page 148
I/O Drawer 0 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection DASD Fan U0.1-F1 Fan Monitoring Assembly U0.1-F2 Card J5 Monitoring (two-fan assembly Card on left power supply when viewed from rear) (10 EIA Unit I/O Drawer) DASD Fan U0.1-F3...
Page 150
I/O Drawer 1 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection PCI Slot 10 U0.2-P1-I10 80-60 or I/O Planar J16, PCI Controller 80-60 to 64 Bit slot 80-67 or 8C-XX PCI Controller 6 U0.2-P1 00-f8e0-0000 PCI Slot 11 U0.2-P1-I11...
Page 151
I/O Drawer 1 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection DASD Fan U0.2-F1 Fan Monitoring Assembly U0.2-F2 Card J5 Monitoring (two-fan assembly Card on left power supply when viewed from rear) (10 EIA Unit I/O Drawer) DASD Fan U0.2-F3...
Page 153
I/O Drawer 2 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection PCI Slot 10 U0.3-P1-I10 C0-60 or I/O Planar J16, PCI Controller C0-60 to 64 Bit slot C0-67 or CC-XX PCI Controller 10 U0.3-P1 00-f960-0000 PCI Slot 11 U0.3-P1-I11...
Page 154
I/O Drawer 2 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection DASD Fan U0.3-F1 Fan Monitoring Assembly U0.3-F2 Card J5 Monitoring (two-fan assembly Card on left power supply when viewed from rear) (10 EIA Unit I/O Drawer) DASD Fan U0.3-F3...
Page 156
I/O Drawer 3 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection PCI Slot 10 U0.4-P1-I10 G0-60 or I/O Planar J16, PCI Controller G0-60 to 64 Bit G0-67 or GC-XX PCI Controller 14 U0.4-P1 00-f9E0-0000 PCI Slot 11 U0.4-P1-I11 F0-58 or...
Page 157
I/O Drawer 3 Locations FRU Name Slot Location Code AIX Location Physical Logical Name Code Connection Connection DASD Fan U0.4-F1 Fan Monitoring Assembly U0.4-F2 Card J5 Monitoring (two-fan assembly Card on left power supply when viewed from rear) (10 EIA Unit I/O Drawer) DASD Fan U0.4-F3...
Chapter 6. Introduction to Tasks and Service Aids The AIX Diagnostic Package contains programs that are called Tasks. Tasks can be thought of as performing a specific function on a resource; for example, running diagnostics, or performing a service aid on a resource. This chapter describes the Tasks available in AIX Diagnostics Version 4.2 and later.
Tasks The following tasks are described in this chapter: v Add Resource to Resource List v AIX Shell Prompt v Analyze Adapter Internal Log v Backup and Restore Media v Certify Media v Change Hardware Vital Product Data v Configure Dials and LPFKeys v Configure Reboot Policy v Configure Remote Maintenance Policy v Configure Ring Indicate Power On Policy...
Page 161
v Process Supplemental Media v Run Diagnostics v Run Error Log Analysis v Run Exercisers v Save or Restore Hardware Management Policies v SCSI Bus Analyzer v SCSI Device Identification and Removal v SCSD Tape Drive Service Aid v Spare Sector Availability v SSA Service Aid v Update Disk Based Diagnostics v Update System or Service Processor Flash...
Add Resource to Resource List Use this task to add resources back to the resource list. Note: Only resources that were previously detected by the diagnostics and deleted from the Diagnostic Test List are listed. If no resources are available to be added, then none are listed.
Certify Media This task allows the selection of diskette or hardfiles to be certified. Hardfiles can be connected either to a SCSI adapter (non-RAID) or a PCI SCSI RAID adapter. The usage and criteria for a hardfile connected to a non-RAID SCSI adapter are different from those for a hardfile connected to a PCI SCSI RAID adapter.
Configure Dials and LPFKeys This service aid provides a tool for configuring and removing dials/LPFKs to the asynchronous serial ports. Since version 4.1.3 a TTY must be defined on the async port before the Dials and LPFKs can be configured on the port. Before version 4.2 the Dials and LPFKs could only be configured on the standard serial ports.
Page 165
The ’Supplemental Restart Policy’, if enabled, is used when the O/S Defined Restart Policy is undefined, or is set to False. When surveillance detects operating system inactivity during restart, an enabled ’Supplemental Restart Policy’ causes a system reset and the reboot process begins. v Call-Out Before Restart (on/off) When enabled, Call-Out Before Restart allows the system to call out (on a serial port that is enabled for call out) when an operating system restart is initiated.
Configure Remote Maintenance Policy Note: Runs on CHRP systems units only. The Remote Maintenance Policy includes modem configurations and phone numbers to use for remote maintenance support. Use this service aid to display and change the following settings for the Remote Maintenance Policy.
Page 167
This is the number for a telephone near the system, or answered by someone responsible for the system. This is the telephone number left on the pager for callback. v Customer System Phone Number This is the number to which your system’s modem is connected. The service or administration center representatives need this number to make direct contact with your system for problem investigation.
Configure Ring Indicate Power On Policy This service aid allows the user to power on a system by telephone from a remote location. If the system is powered off, and Ring Indicate Power On is enabled, the system powers on at a predetermined number of rings. If the system is already on, no action is taken.
Create Customized Configuration Diskette This selection invokes the Diagnostic Package Utility Service Aid which allows the user to perform the following action: v Create a Standalone Diagnostic Package Configuration Diskette The Standalone Diagnostic Package Configuration Diskette allows the following to be changed from the console: –...
Disk Maintenance v Disk to Disk Copy v Display/Alter Sector Disk to Disk Copy Notes: 1. This service aid cannot be used to update to a different size drive. The service aid only supports copying from a SCSI drive to another SCSI drive of similar size. 2.
Display Configuration and Resource List This service aid displays the item header only for all installed resources. Use this service aid when there is no need of seeing the VPD. (No VPD is displayed.) Display Firmware Device Node Information This task displays the firmware device node information that appears on CHRP platforms.
Display Microcode Level The following usage statements describe the syntax of the command to display the current level of micorcode on an adapter or device. Usage: diag -c -d <device_name> -T "disp_mcode" Display or Change Bootlist This service aid allows the bootlist to be displayed, altered, or erased. The system attempts to perform an IPL from the first device in the list.
This option allows the user to turn on or off the progress indicators when running the Diagnostic Applications. The progress indicators are a popup box at the bottom of the screen indicating the test being run (the default is on). v Save Changes to the Database This option allows the user to save any changes made to the run-time options.
Display Software Product Data This task invokes SMIT to display information about the installed software and provides the following functions: v List Installed Software v List Applied but Not Committed Software Updates v Show Software Installation History v Show Fix (APAR) Installation Status v List Fileset Requisites v List Fileset Dependents v List Files Included in a Fileset...
Flags: List the sensors and their values in a text format. List the sensors and their values in a numerical format. For each sensor, the following numerical values are displayed as: <token> <index> <status> <measured value> <location code> -t token Specifies the sensor token to query.
Download Microcode This service aid provides a way to copy microcode to an adapter or device. The service aid presents a list of adapters and devices that use microcode. After the adapter or device is selected, the service aid provides menus to guide you in checking the current level and downloading the needed microcode.
microcode source. /etc/microcode is default. Default source device is disk. Download Microcode to Other Devices The following usage statements describe the syntax of the command: Usage: diag -c -d <device_name> -T "download [-s diskette] [-l previous] [-F]" Flag Description Force flag. Required to download microcode if the current level is unavailable on the source.
Physical Disk Microcode Download This selection is used to update the microcode on any of the disk drives in the array. You can run this task from the AIX command line. The following usage statement describes the syntax of the fastpath command: Usage: diag -T "download"...
Page 179
1. Use the ″Erase Disk″ selection to overwrite the data on the drive. 2. Do a format without certify. 3. Run a second pass of the erase disk selection. For a newly installed drive, you can insure that all blocks on the drive are overwritten with your pattern if you use the following procedure: 1.
Hardfile Attached to PCI SCSI RAID Adapter This function formats the physical disks attached to a PCI SCSI RAID adapter. This task may be run directly from the AIX command line. The following usage statement describes the syntax of the command: Usage: diag -c -d <RAID adapterName>...
Generic Microcode Download This service aid provides a means of executing a ″generic″ script from a diskette. The intended purpose for this ″generic″ script is to load microcode to a supported resource. This script is responsible for executing whatever program is required in order to download the microcode onto the adapter or device.
Periodic Diagnostics This selection provides a tool for configuring periodic diagnostics and automatic error log analysis. A hardware resource can be chosen to be tested once a day, at a user specified time. If the resource cannot be tested because it is busy, error log analysis is performed.
Run Diagnostics The Run Diagnostics task invokes the Resource Selection List menu. When the commit key is pressed, Diagnostics are run on all selected resources. The procedures for running the diagnostics depends on the state of the diagnostics run time options. See “Display or Change Diagnostic Run Time Options” on page 154. Run Error Log Analysis The Run Error Log Analysis task invokes the Resource Selection List menu.
Acronyms The following list describes acronyms used in the exerciser reports. To toggle between two states (example Activate/Halt Devices), use the number of the device. Acronym Description Continue on error (use number to select). Device has run the specified number of cycles and is not running. The exerciser has been terminated by a signal.
Save or Restore Hardware Management Policies Use this service aid to save or restore the settings from Ring Indicate Power On Policy, Surveillance Policy, Remote Maintenance Policy and Reboot Policy. v Save Hardware Management Policies This selection writes all of the settings for the hardware management policies to the file: /etc/lpp/diagnostics/data/hmpolicies v Restore Hardware Management Policies...
When the adapter is selected the SCSI Bus Address Selection menu is displayed. This menu allows the user to enter which address to transmit the SCSI Inquiry Command. Once the address is selected the SCSI Bus Test Run menu is displayed. This menus allows the user to transmit the SCSI Inquiry Command by pressing the Enter key.
SCSD Tape Drive Service Aid This service aid provides a means to obtain the status or maintenance information from a SCSD tape drive. Only some models of SCSI tape drive are supported. The service aid provides the following options: v Display time since a tape drive was last cleaned. The time since the drive was last cleaned is displayed onto the screen.
Copy trace table. Verbose mode. If the -c flag is present, the information displays on the screen. If the -n flag is present, the information about tape head cleaning is printed. Spare Sector Availability This selection checks the number of spare sectors available on the optical disk. The spare sectors are used to reassign when defective sectors are encountered during normal usage or during a format and certify operation.
Update System or Service Processor Flash This selection updates the system or service processor flash for CHRP system units. Further update and recovery instructions may be provided with the update. It is necessary to know the fully qualified path and file name of the flash update image file that was provided.
This selection allows the microcode on the 7135 controller to be updated when required. v Physical Disk Microcode Download This selection is used to update the microcode on any of the disk drives in the array. v Update EEPROM This selection is used to update the contents of the EEPROM on a selected controller.
Chapter 7. Using the System Verification Procedure The system verification procedure is used to check the system for correct operation. If you are analyzing a hardware problem, see Chapter 8. Hardware Problem Determination. Step 1. Considerations before Running ThisProcedure Notes: 1.
Step 2. Loading the Diagnostics 1. Stop all application programs running on the operating system. 2. Stop the operating system. 3. Turn the power off. 4. If you are loading the Standalone Diagnostics and running them from an ASCII terminal, do the following: v The attributes for the terminal must be set to match the defaults of the diagnostics.
Step 3. Running System Verification 1. On the Diagnostic Operating Instructions Screen, press the Enter key. 2. If the terminal type has not been defined, you must use the Initialize Terminal option on the Function Selection menu to initialize the operating system environment before you can continue with the diagnostics.
Chapter 8. Hardware Problem Determination This chapter provides information on using Standalone or Online Diagnostics to help you solve hardware problems. In addition, it provides a list of error codes and recommended actions you can take to resolve the problem before contacting a service representative.
Step 3 Determine if the operating system is accepting commands. Is the operating system accepting commands? The system must be turned off in order to run diagnostics. Verify with the system administration and users that the system may be turned off. If so, then turn off the system unit and go to Step 6.
Step 5 This step invokes the Online Diagnostics in concurrent mode. 1. Log on as root or as superuser (su). 2. Enter the diag command. 3. Wait until the Diagnostic Operating Instructions are displayed, or wait for three minutes. Are the Diagnostic Operating Instructions displayed without any obvious console display problems? Do the following to shut down your system: 1.
Step 7 Find your symptom in the following table, and follow the instructions given in the Action column. Symptom Action Display problem Go to Step 8. All other symptoms Go to “Problem Determination When Unable to Load Diagnostics” on page 186. Step 8 The following steps analyze a console display problem.
Page 199
Step 10 There is a problem with the keyboard. Find the type of keyboard you are using in the following table, and follow the instructions given in the Action column. Keyboard Type Action 101–key keyboard. Identify by the type of Enter Record error code M0KBD001 and report the key used.
Page 200
Step 11 1. If the terminal type has not been defined, use the Initialize Terminal option on the Function Selection menu to initialize the operating system environment before you can continue with the diagnostics. This is a separate and different operation from selecting the console display.
Page 201
Step 12 Did the Diagnostic Selection Menu display? If Problem Determination was selected from the Diagnostic Mode Selection menu, and if a recent error has been logged in the error log, the diagnostics automatically begin testing the resource. Follow the displayed instructions. v If the No Trouble Found screen is displayed, press Enter.
Page 202
Step 14 The diagnostics produced an SRN for this problem. 1. Record the SRN and other numbers read out. 2. Report the SRN to the service organization. 3. STOP. You have completed these procedures. Step 15 When you are loading the Standalone Diagnostics, the attributes for the terminal must be set to match the defaults of the diagnostic programs.
Page 203
Step 16 This step loads the Standalone Diagnostics. If you are unable to load the diagnostics, go to Step 7. 1. Turn the power on. 2. Insert the diagnostic CD-ROM into the CD-ROM drive in the primary I/O Drawer. 3. When the keyboard indicator appears, press the 5 key on the keyboard to indicate that diagnostics are to be loaded.
Problem Determination When Unable to Load Diagnostics Use this procedure to obtain an error code. The service organization uses the error code to determine which field replaceable units (FRUs) are needed to restore the system to correct operation. Step 1. Considerations before Running This Procedure v The diagnostics can use a display connected to a display adapter with the keyboard and mouse connected to the primary I/O Drawer or an ASCII terminal attached to a serial port.
Step 4 This step attempts to load the Standalone diagnostics. 1. Turn the power off. 2. Turn the power on. 3. Insert the diagnostic CD-ROM into the CD-ROM drive. 4. If the Keyboard indicator appears, press the 5 key on the keyboard to indicate that diagnostics are to be loaded.
Step 5 Starting at the top of the following table, find your symptom and follow the instructions given in the Action column. Symptom Action The operator panel power LED does not come Check the power cable to the outlet. Check the on, or comes on and does not stay on.
Page 207
Symptom Action The system stopped and an indicator is If the POST indicator represents: displayed on the system console and an v A keyboard, record error code M0KBD000 eight-digit error code is not displayed. and report the problem to the service organization.
Step 6 The diagnostics loaded correctly. Go to “Problem Determination Using the Standalone or Online Diagnostics” on page 177. Step 7 There is a problem with the keyboard. Find the type of keyboard you are using in the following table, and follow the instructions given in the Action column.
Error Codes The following table lists error codes that you may encounter, and the actions to perform to correct the reported error. Table 1. Firmware Error Codes. Error Code Description Action 20A80000 Insufficient Verify the IP address. information to boot. 20A80001 Client IP address Change IP address.
Page 210
Table 1. Firmware Error Codes. (continued) Error Code Description Action 20EE000C Pointer to the Values normally found in non-volatile storage that point operating system to the location of an Operating System were not found. found in This can happen for two reasons, either your installed non-volatile Operating System doesn’t support storing the values or storage.
Page 211
Table 1. Firmware Error Codes. (continued) Error Code Description Action 25A80100 NVRAM data Turn off, turn on system unit and retry the operation validation check before replacing any system component. failed. Refer to Action under error code 25A80001. 25A80210 Setenv/$Setenv Refer to Action under error code 25A80001.
Page 212
Table 1. Firmware Error Codes. (continued) Error Code Description Action 28030001 RTC not updating Errors reported against the Real Time Clock can be - RTC initialization caused by low Battery voltage and (more rarely) power required outages that occur during normal system usage. These errors are warnings that the Real Time Clock data content needs to be re-established and do not require any FRU replacement unless the error is persistent.
Page 213
Table 1. Firmware Error Codes. (continued) Error Code Description Action 4B00F111 Call to System Check for: Administrator 1. Remote system problem failed 2. Telephone line problem 3. Remote host problem 4B00F112 Call to Pager Check for: failed 1. Remote system problem 2.
Environmental Design The environmental efforts that have gone into the design of this system signify IBM’s commitment to improve the quality of its products and processes. Some of these accomplishments include the elimination of the use of Class I ozone-depleting chemicals in the manufacturing process and reductions in manufacturing wastes.
Appendix B. Notices This information was developed for products and services offered in the U.S.A. The manufacturer may not offer the products, services, or features discussed in this document in other countries. Consult the manufacturer’s representative for information on the products and services currently available in your area. Any reference to the manufacturer’s product, program, or service is not intended to state or imply that only that product, program, or service may be used.
Appendix C. Supplies This appendix contains a list of supplies and the part numbers needed to order them. Part Number Description 21F8763 4mm Cleaning Cartridge 21F8758 4mm DDS|||| Data Tape Cartridge (5-pack) 21F8762 4mm DDS|||| Diagnostic Cartridge 8191160 4mm DDS2 Data Tape Cartridge (5-pack) 8191146 4mm DDS2 Diagnostic Cartridge 21F8732...
Appendix D. Operator Panel Functions The operator panel is used to control the system power and other functions. This appendix describes operator panel function codes. System Rack Operator Panel The following diagram shows the locations of the operator panel display and the operator panel pushbuttons.
Function Code Table The operator panel functions and descriptions are listed in the following table. For details about the function, see “Operator Panel Function Descriptions” on page 206. Notes: 1. The x can be any number 0 through 9, any letter A through F, or a blank. 2.
Page 223
Table 2. Operator Panel Function Codes (continued) Function Code Function Selected 24 xxxx xxxx Reserved 25 xxxx xxxx The service representative switch 1 is set; this function is the first step necessary to set the service function range from 50 to 70.
Operator Panel Function Descriptions The following describes operator panel functions in more detail. Values for IPL Types and Speeds See Table 3 and Table 4 for tables of valid IPL types and speeds used in operator panel functions 01 and 02. Table 3.
Function 01 - Display Selected IPL Type, Mode and Speed This function allows you to display the IPL type and IPL speed. Pressing the Enter button in Function 01 toggles between: v Display IPL type (A or B) v Display IPL speed (F, S, or V = F or V = S) Table 5.
Function 03 - Start IPL Function 03 is enabled only when the system is powered on. This function starts an IPL using the selected IPL mode when the Enter button is pressed. Note: The system may power off and then back on as part of the IPL. Function 04 - Lamp Test This function ensures that no indicators are burned out and that characters displayed at the operator panel are valid.
Function 08 - Fast Power Off Use this function when the system is suspended and a power down cannot be performed. Initially, an attention SRC (11-x A1xx 8008) displays indicating that Function 08 has been selected. Note: If the server has Unattended Start Mode (USM) enabled, selecting this function causes the server to power off and then power on again.
System Info Specific.(YL)...P2 SP_CARD_: Part Number....PART_NUM EC Level....EC_LEVEL FRU Number....FRU_NUM_ Manufacture ID....IBM Serial Number....SERIAL_# Version.....0000RS6K ROM Level (alterable)..19990630 (B) 19990620 (A) <= SvP FW levels System Info Specific.(YL)...P2 The ROM level lines list the level numbers of the system and service processor (SvP) firmware installed.
Service Processor Firmware Version: 19990630 Copyright 1998, IBM Corporation The numbers in the second line show the booted firmware level. If this level is less than the update level available for your server, you should consider installing the update.
Downloading Firmware Updates Prior to downloading the firmware, you are asked to read and accept the terms of the Machine Code License Agreement. Once you accept the terms, you are assigned a password. Write down this password as it is required later to unpack the files you download.
Appendix F. Service Processor Setup and Test For your convenience, a sample Service Processor setup procedure is provided below. Your setup may include more or less of the available features, so you may wish to adjust this checklist for your own application. Service Processor Setup Checklist 1.
Testing the Service Processor Setup These tests include communicating with the server’s operating system. Be sure the necessary serial port(s) is configured. If you need assistance, refer to “Serial Port Configuration” on page 217. Use the following procedure to verify that your Service Processor setup is working. The server should still be powered off as a result of the setup checklist procedure on page “Service Processor Setup Checklist”...
Call-Out During the Service Processor setup, you entered your phone number for the Pager (on page 76) and Customer Voice (on page 76) phone numbers. These entries are used for this test. 1. Power on the server. 2. Bring up the Service Processor Menus, see Chapter 3. Service Processor Menus. Attention: To bring up the Service Processor Menus while the server is booting up, watch the operator panel display for checkpoint E04F.
Appendix G. Modem Configurations With hundreds of modems to choose from, and various modem programming standards, configuring a modem for use with the SP can be challenging. The SP is designed to place little demand on an attached modem, thereby increasing the setup and connection success rates.
If yes, use modem configuration file modem_m0_sp, and go to step 7 on page 221 below. Note: The IBM 7852-400 modem has dip switches on the right side of the unit. See “IBM 7852-400 DIP Switch Settings” on page 222 for the correct switch settings.
Setup Z0 Setup F Setup F0 Setup F1 (Rare) AT&T DataPort 2001 ** Bocamodem 1440E Hayes Smart Modem IBM 5841 IBM 5843 IBM 7851 IBM 7852-10 IBM 7855 USRobotics 36.6K Sportster Zoom V.32 Note: ** Ring interrupt only on first ring.
IBM 7852-400 DIP Switch Settings If you are using a 7852-400 modem to enable service processor communications, for proper operation, the DIP switches must be set according to the following table. Switch Position Function Force DTR Flow Control &E4 Down...
If your modem communicates properly with remote users, it is probably in control. It is recommended that you occasionally change some of the functional settings and then change them back, just for the to ensure that the modem is communicating, and to ensure it has been initialized recently.
If yes, the modem is set up correctly. If no, try another &Dn setting for your server’s modem. See your modem documentation for this information. The &Dn command appears in three places each in three of the sample modem configuration files, (see the note on page 224). Note: Only the following sample modem configuration files contain the &Dn command (in three places each): v modem_f.cfg...
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 244
send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry:...
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 247
connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command.
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 250
expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2\r" # Detect carrier and DTR. expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r"...
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 253
send "ATS0=0\r" # Set AutoAnswer OFF expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup.
Page 254
page: send "ATDT%N,,,,%R;\r" # %N = pager call center number # Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r"...
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 256
expect "0\r" timeout 2 # Confirm command successful. send "AT&C1&D2&R1\r" # Detect carrier and DTR, # Ignore RTS. expect "0\r" timeout 2 # Confirm command successful. done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response.
Page 257
# Add enough commas to wait for # time to enter paging number. # %R = paging number # Confirm successful command. expect "0\r" timeout 60 delay 2 # Wait before hanging up. send "ATH0T\r" # Hang up. expect "0\r" timeout 2 # Confirm successful command.
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # The modem has configuration switches. They should be set to the # factory default settings, except switches 11 and 12.
Page 259
# Expect a connection response. expect "33\r" or "31\r" or "28\r" or "26\r" or "24\r" or "21\r" or "19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode.
Page 260
ignore "0\r" or "OK\r\n" timeout 2 # Ignore modem response. send "AT&E12&E14\r" # Disable pacing # Disable data compression expect "0\r" or "OK\r\n" timeout 2 # Confirm successful command. send "AT&SF1&S0S9=1\r" # DSR independent of CD. # Force DSR on. # CD respond time=100ms expect "0\r"...
Sample File modem_m1.cfg # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP modem_m1 # FUNCTIONS: Modem configuration file specifically for IBM 7857-017 modem with # Auto-Reliable feature. This feature must be turned off for Catcher calls. # This example uses the AT&F reset command to choose the factory defaults.
Appendix H. High Availability Cabling HA - 7017 Cluster Servers consist of a minimum of two systems in a cluster with two 7133 SSA Disk Storage Systems. Each system in the cluster requires the AIX operating system software and HACMP high-availability cluster software. The base systems use and share the external SSA disks in the 7133 SSA Disk Storage Systems.
Page 266
Hardware Failure Failure Behavior If Extra Work Recovery Action and Description HACMP Is Not Required To Behavior If HACMP Is Configured Provide Recovery Configured Action (in addition to normal HACMP configuration) Node Outage or AIX Node unavailable None HACMP fallover. Application(s) crash unavailable for brief time during fallover.
Base HA - 7017 Cluster Server System Cabling This section provides cabling information for the base HA - 7017 Cluster Server. The two 17S/80s should be installed and operational before cabling the HA - 7017 Cluster Server. There are four areas of cabling that you need to consider to ensure the redundancy required for no single points of failure: v Cabling for server system consoles and cluster administration workstations v Heartbeat connections between HA - 7017 Cluster Servers...
HA Cluster Server with ASCII System Console A single ASCII terminal connected to both servers in the cluster allows a system administrator or the service representative to work on one server or the other without disrupting the operation of the cluster. Index Description Cable adapter DB9f--DB25M (9 pin to 25 pin) (PN 40H6328)
HA Cluster Server With Graphical System Console If graphical system console is used for system administration or service representative tasks, each cluster server must have its own console. Appendix H. High Availability Systems...
HA Cluster Server Graphical Cluster Administration Workstation In addition to the system consoles described in the previous sections, a LAN attached cluster administration workstation is required. Two Local area networks are required to eliminate a single point of failure. S80 and S85 User’s Guide...
Base HA - 7017 Cluster Server Heartbeat Connections The primary heartbeat connections between two HA - 7017 Cluster Servers are made through a serial connection and the LAN connections. These connections are shown in the illustration. The serial connection is made using an 8-Port Asynchronous PCI Adapter or an optional 128-Port Asynchronous PCI Adapter.
SSA Cabling Connections This section shows how to connect the cables from the HA - 7017 Cluster Server and the 7133 Serial Disk Systems. SSA From Cluster Servers to Double Looped 7133 Note: The figure above shows cabling for a fully populated 7133 (16 disk drives installed).
Base HA Cluster Server AC Power Connections When installing an HA cluster server, care must be taken to ensure that power is also connected in a redundant manner. The figure below shows how power can be connected to ensure that your cluster has separate power connections. Index Description Power Cable, PDU to 7133 with three power supplies...
Index Numerics console mirroring enable/disable 67 3.5-inch diskette drive quick disconnect 92 using 22 system configuration 92 4.0GB 4-mm tape drive customer administration center 76 general information 32 status lights 36 using 36 deconfigure, processor 88 4-mm tape cartridge determining versions 211 loading 38 diagnostic considerations unloading 39...
Page 276
fibre channel RAID service aids 159 (continued) I/O drawer locations (continued) flash SK-NET FDDI firmware 160 drawer 3 137 format media 160 indicator panel, 10 EIA Unit I/O Drawer 15 generic microcode download 163 inserting a SCSI disk drive 48 log repair action 163 machine check error log 153 keyboards...
Page 277
modem power (continued) configuration file selection 220 powering on the system 10 configurations 219 power LED 15 seamless transfer 223 power-on methods 83 modem_f.cfg, sample file 231 preface xvii modem_f0.cfg, sample file 234 primary console 224 modem_f1.cfg, sample file 237 privileged access password, changing 66 modem_m0.cfg, sample file 240 privileged user menus 63...
Page 278
service processor checklist 215 system introduction 1 service processor configuration 71 system management services service processor feature 55 text-based 95 service processor installation system phone number 76 checklist 215 system POST errors service processor menus read 61, 70 accessing locally 59 system power-on methods 83 accessing remotely 59 system processor configuration menu 71...
Reader’s Comments — We’d Like to Hear From You Title: Enterprise Server S80 p Series 680 Model S85 User’s Guide Order Number: SA38-0557-01 Overall how satisfied are you with the information in this book? Very Satisfied Neutral Dissatisfied Very Satisfied...
Page 280
Cut or Fold Along Line Fold and Tape Please do not Staple Fold and Tape NO POSTAGE NECESSARY IF MAILED IN THE UNITED STATES BUSINESS REPLY MAIL POSTAGE WILL BE PAID BY ADDRESSEE Information Development Department H6DS-9561 11400 Burnet Road Austin, TX 78758-3493 Fold and Tape Please do not Staple...
Page 282
Printed in the United States of America on recycled paper containing 10% recovered post-consumer fiber. November 2000 SA38-0557-01...