Sun Microsystems, Inc.www.sun.comSubmit comments about this document at: http://www.sun.com/hwdocs/feedbackSun Fire™ X4140, X4240, and X4440Servers Di
x Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Sun Welcomes Your CommentsSun is interested in improving its documentation a
1CHAPTER1Initial Inspection of the ServerThis chapter includes the following topics: “Service Troubleshooting Flowchart” on page 1 “Gathering Servic
2 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Gathering Service InformationThe first step in determining the cause of a pr
Chapter 1 Initial Inspection of the Server 3System InspectionControls that have been improperly set and cables that are loose or improperlyconnected a
4 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Internally Inspecting the ServerTo perform a visual inspection of the intern
Chapter 1 Initial Inspection of the Server 5FIGURE 1-2 X4440 Server Front Panel2. Remove the server cover.For instructions on removing the server cove
6 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 200810. If the problem with the server is not evident, you can obtain additional
7CHAPTER2Using SunVTS Diagnostic SoftwareThis chapter contains information about the SunVTS™ diagnostic software tool.Running SunVTS Diagnostic TestsT
8 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 QLogic Host Bus Adapter Test (qlctest) RAM Test (ramtest) Serial Port Te
Chapter 2 Using SunVTS Diagnostic Software 9Using the Bootable Diagnostics CDTo use the diagnostics CD to perform diagnostics:1. With the server power
PleaseRecycleCopyright © 2008 Sun Microsystems, Inc., 4150 Network Circle, Santa Clara, California 95054, U.S.A. All rights reserved.Unpublished - rig
10 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 Solaris system message log is a log of all the general Solaris events log
11CHAPTER3Troubleshooting DIMM ProblemsThis chapter describes how to detect and correct problems with the server’s DualInline Memory Modules (DIMM)s.
12 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008DIMM Replacement PolicyReplace a DIMM when one of the following events take
Chapter 3 Troubleshooting DIMM Problems 133. BIOS reports this event in the service processor’s system event log (SEL) asshown in the sample IPMItool
14 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008The lines in the display start with event numbers (in hex), followed by a d
Chapter 3 Troubleshooting DIMM Problems 15to view ECC errors Linux:The HERD utility can be used to manage DIMM errors in Linux. See the x64Servers Ut
16 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Note – The DIMM Fault and Motherboard Fault LEDs operate on stored power fo
Chapter 3 Troubleshooting DIMM Problems 17FIGURE 3-1 DIMMs and LEDs on Motherboard
18 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE 3-2 DIMMs and LEDs on Mezzanine BoardIsolating and Correcting DIMM E
Chapter 3 Troubleshooting DIMM Problems 193. Press the PRESS TO SEE FAULT button, and inspect the DIMM fault LEDs. SeeFIGURE 3-1 and FIGURE 3-2.A flas
iiiContentsPreface vii1. Initial Inspection of the Server 1Service Troubleshooting Flowchart 1Gathering Service Information 2System Inspection 3Troubl
20 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 200811. Power on the server and run the diagnostics test again.12. Review the l
21APPENDIXAEvent Logs and POST CodesThis appendix contains information about the BIOS event log, the BMC system eventlog, the power-on self-test (POST
22 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Main Advanced PCIPnP Boot Security Chipset Exit**********
Appendix A Event Logs and POST Codes 23b. From the Advanced Settings screen, select Event Log Configuration.TheAdvanced Menu Event Logging Details scr
24 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008c. From the IPMI 2.0 Configuration screen, select View BMC System Event Log
Appendix A Event Logs and POST Codes 25Power-On Self-Test (POST)The system BIOS provides a rudimentary power-on self-test. The basic devicesrequired f
26 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Redirecting Console OutputUse the following instructions to access the serv
Appendix A Event Logs and POST Codes 2710. Set the color depth for the redirection console at either 6 or 8 bits.11. Click the Start Redirection butto
28 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Changing POST OptionsThese instructions are optional, but you can use them
Appendix A Event Logs and POST Codes 293. Select Boot Settings Configuration.The Boot Settings Configuration screen is displayed.4. On the Boot Settin
iv Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Uncorrectable DIMM Errors 12Correctable DIMM Errors 14BIOS DIMM Error Messa
30 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 Boot Num-Lock – This option is On by default (keyboard Num-Lock is turned
Appendix A Event Logs and POST Codes 31POST CodesTABLE A-1 contains descriptions of each of the POST codes, listed in the same orderin which they are
32 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008de00 Preparing CPU for booting to OS by copying all of the context of the B
Appendix A Event Logs and POST Codes 33POST Code CheckpointsThe POST code checkpoints are the largest set of checkpoints during the BIOS pre-boot proc
34 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 20080E Testing and initialization of different Input Devices. Also, update the
Appendix A Event Logs and POST Codes 3560 Initializes NUM-LOCK status and programs the KBD typematic rate.75 Initialize Int-13 and prepare for IPL det
36 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008B1 Save system context for ACPI.00 Prepares CPU for booting to OS by copyin
37APPENDIXBStatus Indicator LEDsThis appendix contains information about the locations and behavior of the LEDs onthe server. It describes the externa
38 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Front Panel LEDsFIGURE B-1 Front Panel LEDs (X4140 shown)Back Panel LEDsFIG
Appendix B Status Indicator LEDs 39Hard Drive LEDsFIGURE B-3 Hard Drive LEDsInternal Status Indicator LEDsThe server has internal status indicators on
Contents vHandling of Uncorrectable Errors 53Handling of Correctable Errors 56Handling of Parity Errors (PERR) 59Handling of System Errors (SERR) 61Ha
40 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Note – The mezzanine board, when present, obscures part of the motherboard,
Appendix B Status Indicator LEDs 41FIGURE B-5 DIMMs and LEDs on Mezzanine Board
42 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008
43APPENDIXCUsing the ILOM Service ProcessorGUI to View System InformationThis appendix contains information about using the Integrated Lights Out Mana
44 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Making a Serial Connection to the SPTo make a serial connection to the SP:1
Appendix C Using the ILOM Service Processor GUI to View System Information 45Viewing ILOM SP Event LogsEvents are notifications that occur in response
46 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE C-1 System Event Logs Page3. Select the category of event that you w
Appendix C Using the ILOM Service Processor GUI to View System Information 47After you have selected a category of event, the Event Log table is updat
48 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 ILOM web GUI operation; for example, from the Maintenance tab, selecting
Appendix C Using the ILOM Service Processor GUI to View System Information 492. From the System Information tab, select Components.The Replaceable Com
vi Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008
50 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Viewing SensorsThis section describes how to view the server temperature, v
Appendix C Using the ILOM Service Processor GUI to View System Information 51FIGURE C-3 Sensor Readings Page3. Click the Refresh button to update the
52 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE C-4 Sensor Details Page5. If the problem with the server is not evid
53APPENDIXDError HandlingThis appendix contains information about how the servers process and log errors.See the following sections: “Handling of Unc
54 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Note – If the error is on low 1MB, the BIOS freezes after rebooting. Theref
Appendix D Error Handling 55FIGURE D-1 DMI Log Screen, Uncorrectable Error
56 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Handling of Correctable ErrorsThis section lists facts and considerations a
Appendix D Error Handling 57FIGURE D-2 DMI Log Screen, Correctable Error If during any stage of memory testing the BIOS finds itself incapable ofread
58 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008EXAMPLE D-1 DMI Log Screen, Correctable Error, Memory Decreased
Appendix D Error Handling 59Handling of Parity Errors (PERR)This section lists facts and considerations about how the server handles parity errors(PER
viiPrefaceThe Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide contains informationand procedures for using available tools to diagnose prob
60 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE D-3 DMI Log Screen, PCI Parity Error The BIOS displays the followin
Appendix D Error Handling 61Note – The Linux system reboots, but does not inform the BIOS of this incident.Handling of System Errors (SERR)This sectio
62 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 FIGURE D-5 shows an example DMI log screen from the BIOS Setup Page with
Appendix D Error Handling 63Handling Mismatching ProcessorsThis section lists facts and considerations about how the server handles mismatchingprocess
64 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Hardware Error Handling SummaryTABLE D-1 summarizes the most common hardwar
Appendix D Error Handling 65Single-bitDRAM ECCerrorWith ECC enabledin the BIOS Setup,the CPU detectsand corrects asingle-bit error onthe DIMM interfac
66 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008PCI SERR,PERRSystem or parityerror on a PCI bus.Sync floods on HyperTranspo
Appendix D Error Handling 67Multiple fanfailureFan failure isdetected by readingtach signals.The Front Fan Fault, Service Action Required,and individu
68 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008
69IndexBBIOSchanging POST options, 28event logs, 21POST code checkpoints, 33POST codes, 31POST overview, 25redirecting console output for POST, 26Boot
viii Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Related DocumentationThe document set for the Sun Fire X4140, X4240, and
70 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008external, 3internal, 4Integrated Lights-Out Manager Service Processor,See I
Preface ixTypographic ConventionsThird-PartyWeb SitesSun™is not responsible for the availability of third-party web sites mentioned in thisdocument. S
Comments to this Manuals