Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1013102.1
Update Date:2011-05-04
Keywords:

Solution Type  Problem Resolution Sure

Solution  1013102.1 :   PSH: Diagnosis Issue for T5120 and T5220 (SUN4V-8000-E2)  


Related Items
  • Sun SPARC Enterprise T5220 Server
  •  
  • Sun SPARC Enterprise T5120 Server
  •  
Related Categories
  • GCS>Sun Microsystems>Servers>CMT Servers
  •  

PreviouslyPublishedAs
217952


Symptoms
A PSH fault message appears on the console with the MSG-ID SUN4V-8000-E2 (uncorrectable memory error). The recommended action is to replace the faulty DIMM(s).

In most cases, this diagnosis is correct.  However, there is a known issue (CR 6592272) that will result in this diagnosis when there is no fault.



Relief/Workaround

Determine the error that caused the message by using the EVENT-ID from the fault message with the fmdump command.  The EVENT-ID is highlighted in the example below:

SUNW-MSG-ID: SUN4V-8000-E2, TYPE: Fault, VER: 1, SEVERITY: Critical
EVENT-TIME: Mon Oct 22 14:02:36 EDT 2007
PLATFORM: SUNW,SPARC-Enterprise-T5220, CSN: -, HOSTNAME: s12y-galaxy4
SOURCE: cpumem-diagnosis, REV: 1.6
EVENT-ID: b5eeacc6-0467-e0eb-ba4e-e368249b69d4
DESC: The number of errors associated with this memory module has exceeded
acceptable levels. Refer to http://sun.com/msg/SUN4V-8000-E2 for more information.
AUTO-RESPONSE: Pages of memory associated with this memory module are being removed
from service as errors are reported.
IMPACT: Total system memory capacity will be reduced as pages are retired.
REC-ACTION: Schedule a repair procedure to replace the affected memory module. Use
fmdump -v -u <EVENT_ID> to identify the module.

Cut and paste the EVENT-ID into the following command:

# fmdump -eV -u b5eeacc6-0467-e0eb-ba4e-e368249b69d4  | grep  dram-esr 

dram-esr = 0x1000000000008221

If the dram-esr is "0x1000000000008221", then the issue has been encountered.  For any other value, the diagnosis is correct and the memory DIMM(s) diagnosed as faulty should be replaced.

If the issue has been encountered, the PSH diagnosis message should be ignored and the fault should be cleared.  Clear the fault by using the EVENT-ID from the fault message in the following command:

# fmadm repair  b5eeacc6-0467-e0eb-ba4e-e368249b69d4

fmadm: recorded repair to 


b5eeacc6-0467-e0eb-ba4e-e368249b69d4


Verify that the fault has also been cleared from the service processor by switching to the service processor console and using the ALOM showfaults command:

sc> showfaults
Last POST Run: Tue Oct  2 16:18:22 2007
Post Status: Passed all devices
No failures found in System

If the fault is still displayed on the service processor, clear the fault using the ALOM clearfault command with the EVENT-ID from the fault message:

sc> clearfault   b5eeacc6-0467-e0eb-ba4e-e368249b69d4



Resolution
On the Sun SPARC Enterprise[TM] T5120 and T5220, a patch for this issue is available. The patch is SunOS 5.10: kernel patch 127127-11 or later.


Product
Sun SPARC Enterprise T5120 Server
Sun SPARC Enterprise T5220 Server

Internal Comments
Place Sun Internal-Use Only content here. This content will be published to internal SunSolve only.

Bug ID 6592272


PSH, SUN4V-8000-E2, T5120, T5220
Previously Published As
91180

Change History
Date: 2011-05-04
User name: Dencho Kojucharov
Action: Currency check
Comments: audited by Entry-Level SPARC Content Lead
Date: 2009-11-18
User Name: Anthony Rulli
Action: Updated
Comment: currency check, audited by Anthony Rulli, Entry Level SPARC Content team

Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback