Sun-microsystems Sun Fire X4240 User Manual

Browse online or download User Manual for Computers Sun-microsystems Sun Fire X4240. Sun Microsystems Sun Fire X4240 User Manual

  • Download
  • Add to my manuals
  • Print
  • Page
    / 80
  • Table of contents
  • TROUBLESHOOTING
  • BOOKMARKS
  • Rated. / 5. Based on customer reviews
Page view 0
Sun Microsystems, Inc.
www.sun.com
Submit comments about this document at: http://www.sun.com/hwdocs/feedback
Sun Fire™ X4140, X4240, and X4440
Servers Diagnostics Guide
Part No. 820-3067-11
August 2008, Revision A
Page view 0
1 2 3 4 5 6 ... 79 80

Summary of Contents

Page 1 - Servers Diagnostics Guide

Sun Microsystems, Inc.www.sun.comSubmit comments about this document at: http://www.sun.com/hwdocs/feedbackSun Fire™ X4140, X4240, and X4440Servers Di

Page 2

x Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Sun Welcomes Your CommentsSun is interested in improving its documentation a

Page 3 - Contents

1CHAPTER1Initial Inspection of the ServerThis chapter includes the following topics: “Service Troubleshooting Flowchart” on page 1 “Gathering Servic

Page 4

2 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Gathering Service InformationThe first step in determining the cause of a pr

Page 5 - Contents v

Chapter 1 Initial Inspection of the Server 3System InspectionControls that have been improperly set and cables that are loose or improperlyconnected a

Page 6

4 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Internally Inspecting the ServerTo perform a visual inspection of the intern

Page 7 - Before You Read This Document

Chapter 1 Initial Inspection of the Server 5FIGURE 1-2 X4440 Server Front Panel2. Remove the server cover.For instructions on removing the server cove

Page 8 - Related Documentation

6 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 200810. If the problem with the server is not evident, you can obtain additional

Page 9 - Web Sites

7CHAPTER2Using SunVTS Diagnostic SoftwareThis chapter contains information about the SunVTS™ diagnostic software tool.Running SunVTS Diagnostic TestsT

Page 10 - Sun Welcomes Your Comments

8 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 QLogic Host Bus Adapter Test (qlctest) RAM Test (ramtest) Serial Port Te

Page 11 - Troubleshooting Flowchart

Chapter 2 Using SunVTS Diagnostic Software 9Using the Bootable Diagnostics CDTo use the diagnostics CD to perform diagnostics:1. With the server power

Page 12 - Gathering Service Information

PleaseRecycleCopyright © 2008 Sun Microsystems, Inc., 4150 Network Circle, Santa Clara, California 95054, U.S.A. All rights reserved.Unpublished - rig

Page 13 - System Inspection

10 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 Solaris system message log is a log of all the general Solaris events log

Page 14 - X4140 Server Front Panel

11CHAPTER3Troubleshooting DIMM ProblemsThis chapter describes how to detect and correct problems with the server’s DualInline Memory Modules (DIMM)s.

Page 15 - X4440 Server Front Panel

12 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008DIMM Replacement PolicyReplace a DIMM when one of the following events take

Page 16

Chapter 3 Troubleshooting DIMM Problems 133. BIOS reports this event in the service processor’s system event log (SEL) asshown in the sample IPMItool

Page 17

14 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008The lines in the display start with event numbers (in hex), followed by a d

Page 18 - Diagnostics CD

Chapter 3 Troubleshooting DIMM Problems 15to view ECC errors Linux:The HERD utility can be used to manage DIMM errors in Linux. See the x64Servers Ut

Page 19

16 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Note – The DIMM Fault and Motherboard Fault LEDs operate on stored power fo

Page 20

Chapter 3 Troubleshooting DIMM Problems 17FIGURE 3-1 DIMMs and LEDs on Motherboard

Page 21 - Troubleshooting DIMM Problems

18 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE 3-2 DIMMs and LEDs on Mezzanine BoardIsolating and Correcting DIMM E

Page 22 - DIMM Replacement Policy

Chapter 3 Troubleshooting DIMM Problems 193. Press the PRESS TO SEE FAULT button, and inspect the DIMM fault LEDs. SeeFIGURE 3-1 and FIGURE 3-2.A flas

Page 23

iiiContentsPreface vii1. Initial Inspection of the Server 1Service Troubleshooting Flowchart 1Gathering Service Information 2System Inspection 3Troubl

Page 24 - Correctable DIMM Errors

20 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 200811. Power on the server and run the diagnostics test again.12. Review the l

Page 25 - DIMM Fault LEDs

21APPENDIXAEvent Logs and POST CodesThis appendix contains information about the BIOS event log, the BMC system eventlog, the power-on self-test (POST

Page 26

22 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Main Advanced PCIPnP Boot Security Chipset Exit**********

Page 27 - DIMMs and LEDs on Motherboard

Appendix A Event Logs and POST Codes 23b. From the Advanced Settings screen, select Event Log Configuration.TheAdvanced Menu Event Logging Details scr

Page 28

24 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008c. From the IPMI 2.0 Configuration screen, select View BMC System Event Log

Page 29 - FIGURE 3-1 and FIGURE 3-2

Appendix A Event Logs and POST Codes 25Power-On Self-Test (POST)The system BIOS provides a rudimentary power-on self-test. The basic devicesrequired f

Page 30 - 12. Review the log file

26 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Redirecting Console OutputUse the following instructions to access the serv

Page 31 - Event Logs and POST Codes

Appendix A Event Logs and POST Codes 2710. Set the color depth for the redirection console at either 6 or 8 bits.11. Click the Start Redirection butto

Page 32

28 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Changing POST OptionsThese instructions are optional, but you can use them

Page 33

Appendix A Event Logs and POST Codes 293. Select Boot Settings Configuration.The Boot Settings Configuration screen is displayed.4. On the Boot Settin

Page 34

iv Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Uncorrectable DIMM Errors 12Correctable DIMM Errors 14BIOS DIMM Error Messa

Page 35 - Power-On Self-Test (POST)

30 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 Boot Num-Lock – This option is On by default (keyboard Num-Lock is turned

Page 36 - Redirecting Console Output

Appendix A Event Logs and POST Codes 31POST CodesTABLE A-1 contains descriptions of each of the POST codes, listed in the same orderin which they are

Page 37 - ■ Password: changeme

32 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008de00 Preparing CPU for booting to OS by copying all of the context of the B

Page 38 - Changing POST Options

Appendix A Event Logs and POST Codes 33POST Code CheckpointsThe POST code checkpoints are the largest set of checkpoints during the BIOS pre-boot proc

Page 39

34 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 20080E Testing and initialization of different Input Devices. Also, update the

Page 40

Appendix A Event Logs and POST Codes 3560 Initializes NUM-LOCK status and programs the KBD typematic rate.75 Initialize Int-13 and prepare for IPL det

Page 41 - POST Codes

36 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008B1 Save system context for ACPI.00 Prepares CPU for booting to OS by copyin

Page 42 - POST Codes (Continued)

37APPENDIXBStatus Indicator LEDsThis appendix contains information about the locations and behavior of the LEDs onthe server. It describes the externa

Page 43 - POST Code Checkpoints

38 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Front Panel LEDsFIGURE B-1 Front Panel LEDs (X4140 shown)Back Panel LEDsFIG

Page 44 - Post Code Description

Appendix B Status Indicator LEDs 39Hard Drive LEDsFIGURE B-3 Hard Drive LEDsInternal Status Indicator LEDsThe server has internal status indicators on

Page 45

Contents vHandling of Uncorrectable Errors 53Handling of Correctable Errors 56Handling of Parity Errors (PERR) 59Handling of System Errors (SERR) 61Ha

Page 46

40 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Note – The mezzanine board, when present, obscures part of the motherboard,

Page 47 - Status Indicator LEDs

Appendix B Status Indicator LEDs 41FIGURE B-5 DIMMs and LEDs on Mezzanine Board

Page 48 - Back Panel LEDs

42 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008

Page 49 - Hard Drive LEDs

43APPENDIXCUsing the ILOM Service ProcessorGUI to View System InformationThis appendix contains information about using the Integrated Lights Out Mana

Page 50

44 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Making a Serial Connection to the SPTo make a serial connection to the SP:1

Page 51

Appendix C Using the ILOM Service Processor GUI to View System Information 45Viewing ILOM SP Event LogsEvents are notifications that occur in response

Page 52

46 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE C-1 System Event Logs Page3. Select the category of event that you w

Page 53 - APPENDIX

Appendix C Using the ILOM Service Processor GUI to View System Information 47After you have selected a category of event, the Event Log table is updat

Page 54 - SUNSP0003BA84D777 login:

48 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 ILOM web GUI operation; for example, from the Maintenance tab, selecting

Page 55 - Viewing ILOM SP Event Logs

Appendix C Using the ILOM Service Processor GUI to View System Information 492. From the System Information tab, select Components.The Replaceable Com

Page 56 - System Event Logs Page

vi Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008

Page 57 - Event Log Fields

50 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Viewing SensorsThis section describes how to view the server temperature, v

Page 58 - Information

Appendix C Using the ILOM Service Processor GUI to View System Information 51FIGURE C-3 Sensor Readings Page3. Click the Refresh button to update the

Page 59 - FIGURE C-2

52 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE C-4 Sensor Details Page5. If the problem with the server is not evid

Page 60 - Viewing Sensors

53APPENDIXDError HandlingThis appendix contains information about how the servers process and log errors.See the following sections: “Handling of Unc

Page 61 - Sensor Readings Page

54 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Note – If the error is on low 1MB, the BIOS freezes after rebooting. Theref

Page 62 - Sensor Details Page

Appendix D Error Handling 55FIGURE D-1 DMI Log Screen, Uncorrectable Error

Page 63 - Error Handling

56 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Handling of Correctable ErrorsThis section lists facts and considerations a

Page 64

Appendix D Error Handling 57FIGURE D-2 DMI Log Screen, Correctable Error If during any stage of memory testing the BIOS finds itself incapable ofread

Page 65 - Appendix D Error Handling 55

58 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008EXAMPLE D-1 DMI Log Screen, Correctable Error, Memory Decreased

Page 66

Appendix D Error Handling 59Handling of Parity Errors (PERR)This section lists facts and considerations about how the server handles parity errors(PER

Page 67 - EXAMPLE D-1

viiPrefaceThe Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide contains informationand procedures for using available tools to diagnose prob

Page 68

60 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008FIGURE D-3 DMI Log Screen, PCI Parity Error The BIOS displays the followin

Page 69 - Appendix D Error Handling 59

Appendix D Error Handling 61Note – The Linux system reboots, but does not inform the BIOS of this incident.Handling of System Errors (SERR)This sectio

Page 70 - ■ NMI EVENT!!

62 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008 FIGURE D-5 shows an example DMI log screen from the BIOS Setup Page with

Page 71

Appendix D Error Handling 63Handling Mismatching ProcessorsThis section lists facts and considerations about how the server handles mismatchingprocess

Page 72 - DMI Log Screen with Error

64 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Hardware Error Handling SummaryTABLE D-1 summarizes the most common hardwar

Page 73 - Appendix D Error Handling 63

Appendix D Error Handling 65Single-bitDRAM ECCerrorWith ECC enabledin the BIOS Setup,the CPU detectsand corrects asingle-bit error onthe DIMM interfac

Page 74 - SEL) Fatal?

66 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008PCI SERR,PERRSystem or parityerror on a PCI bus.Sync floods on HyperTranspo

Page 75 - Appendix D Error Handling 65

Appendix D Error Handling 67Multiple fanfailureFan failure isdetected by readingtach signals.The Front Fan Fault, Service Action Required,and individu

Page 76

68 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008

Page 77 - Appendix D Error Handling 67

69IndexBBIOSchanging POST options, 28event logs, 21POST code checkpoints, 33POST codes, 31POST overview, 25redirecting console output for POST, 26Boot

Page 78

viii Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008Related DocumentationThe document set for the Sun Fire X4140, X4240, and

Page 79

70 Sun Fire X4140, X4240, and X4440 Servers Diagnostics Guide • August 2008external, 3internal, 4Integrated Lights-Out Manager Service Processor,See I

Page 80

Preface ixTypographic ConventionsThird-PartyWeb SitesSun™is not responsible for the availability of third-party web sites mentioned in thisdocument. S

Comments to this Manuals

No comments