Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type Problem Resolution Sure Solution 1013102.1 : PSH: Diagnosis Issue for T5120 and T5220 (SUN4V-8000-E2)
PreviouslyPublishedAs 217952 Symptoms A PSH fault message appears on the console with the MSG-ID SUN4V-8000-E2 (uncorrectable memory error). The recommended action is to replace the faulty DIMM(s). In most cases, this diagnosis is correct. However, there is a known issue (CR 6592272) that will result in this diagnosis when there is no fault. Relief/Workaround Determine the error that caused the message by using the EVENT-ID from the fault message with the fmdump command. The EVENT-ID is highlighted in the example below: SUNW-MSG-ID: SUN4V-8000-E2, TYPE: Fault, VER: 1, SEVERITY: Critical EVENT-TIME: Mon Oct 22 14:02:36 EDT 2007 PLATFORM: SUNW,SPARC-Enterprise-T5220, CSN: -, HOSTNAME: s12y-galaxy4 SOURCE: cpumem-diagnosis, REV: 1.6 EVENT-ID: b5eeacc6-0467-e0eb-ba4e-e368249b69d4 DESC: The number of errors associated with this memory module has exceeded acceptable levels. Refer to http://sun.com/msg/SUN4V-8000-E2 for more information. AUTO-RESPONSE: Pages of memory associated with this memory module are being removed from service as errors are reported. IMPACT: Total system memory capacity will be reduced as pages are retired. REC-ACTION: Schedule a repair procedure to replace the affected memory module. Use fmdump -v -u <EVENT_ID> to identify the module. Cut and paste the EVENT-ID into the following command: # fmdump -eV -u b5eeacc6-0467-e0eb-ba4e-e368249b69d4 | grep dram-esr If the dram-esr is "0x1000000000008221", then the issue has been encountered. For any other value, the diagnosis is correct and the memory DIMM(s) diagnosed as faulty should be replaced. If the issue has been encountered, the PSH diagnosis message should be ignored and the fault should be cleared. Clear the fault by using the EVENT-ID from the fault message in the following command: # fmadm repair b5eeacc6-0467-e0eb-ba4e-e368249b69d4
fmadm: recorded repair to
b5eeacc6-0467-e0eb-ba4e-e368249b69d4
Verify that the fault has also been cleared from the service processor by switching to the service processor console and using the ALOM showfaults command: sc> showfaults Last POST Run: Tue Oct 2 16:18:22 2007 Post Status: Passed all devices No failures found in System If the fault is still displayed on the service processor, clear the fault using the ALOM clearfault command with the EVENT-ID from the fault message: sc> clearfault b5eeacc6-0467-e0eb-ba4e-e368249b69d4 Resolution On the Sun SPARC Enterprise[TM] T5120 and T5220, a patch for this issue is available. The patch is SunOS 5.10: kernel patch 127127-11 or later. Product Sun SPARC Enterprise T5120 Server Sun SPARC Enterprise T5220 Server Internal Comments Place Sun Internal-Use Only content here. This content will be published to internal SunSolve only. PSH, SUN4V-8000-E2, T5120, T5220 Previously Published As 91180 Change History Date: 2011-05-04 User name: Dencho Kojucharov Action: Currency check Comments: audited by Entry-Level SPARC Content Lead Date: 2009-11-18 User Name: Anthony Rulli Action: Updated Comment: currency check, audited by Anthony Rulli, Entry Level SPARC Content team Attachments This solution has no attachment |
||||||||||||
|