Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-73-1017336.1
Update Date:2010-08-19
Keywords:

Solution Type  FAB (standard) Sure

Solution  1017336.1 :   Best practice on installing V445 CPU cage and addressing possible DOA system.  


Related Items
  • Sun Fire V445 Server
  •  
Related Categories
  • GCS>Sun Microsystems>Sun FAB>Standard>Reactive
  •  

PreviouslyPublishedAs
228389


Product
Sun Fire V445 Server

System may show ALOM errors (see details below).

Affected Part Numbers:

541-1040-01   Sun Fire V445 Motherboard Tray Assembly


Impact

At power up (including CPU cage re-installation) the system may show ALOM error messages.


Contributing Factors

Sun Fire V445 systems, with date codes of 0722 to 0749, may have some minor CPU cage bending at CPU slot 0 and 1.

Symptoms

A poorly aligned cpu card cage can generate SC Alerts that look the same as a real FRU failure.  An affected system could show one or more of the following SC messages at power on and will then power off.

Examples of SC Alerts:

a. SC Alert: no seeprom at 0xb8 Failed to read ES segment from FRU seeprom
b. SC Alert: FAULT_SENSOR @ C3.P0.FF_POK has FAILED.
c. SC Alert: Access to TEMP_SENSOR @ C1.P0.T_CORE has failed.
d. SC Alert: Access to TEMP_SENSOR @ C1.T_AMB has failed.
e. SC Alert: FAULT_SENSOR @ FIOB.I_USB2 has FAILED

Note: If after checking cpu card cage alignment the error persists, try replacing the indited FRU as described in the Resolution section below.


Root Cause

Root cause was determined to be due to thin sheet metal supplied from China. As a result there was some metal which was bending during the manufacturing process resulting in CPU cages that were causing CPU re-seat issues. This issue was corrected when the sheet metal supplier began using material suppled from Mexico which is slightly thicker than that used from China. As Engineering began to investigate this problem, it was determined that some of these parts were already in the field.

There was no stop ship purge and there will be no ECO, as there is no change to the parts. This condition was addressed by requiring the supplier to produce parts that were not bent.


Resolution

This is a best practices alert and should be done during all CPU installations.

1. When installing the CPU cage, install all the screws which secure the CPU cage to the mother board, but do not tighten the screws.  Keep them loose.

2. Locate the securing screw at CPU slot 1 and apply light pressure to the sheet metal with a finger near the screw, and toward the rear of the chassis.

3. Tighten the screw while maintaining finger pressure. Reference pictures via the below URLs:

http://sdpsweb.central/FIN_FCO/FAB/103194/SPE/DSCN1466.jpg

http://sdpsweb.central/FIN_FCO/FAB/103194/SPE/DSCN1467.jpg

CAUTION! As with all computer equipment, components may be damaged by electrostatic discharge (ESD). Please take proper ESD precautions when handling all FRUs/CRUs.

4. Then tighten the other 5 remaining screws.  Check by sliding the CPU in Slot 3 and then in Slot 0.  They should feel similar in each slot having a slight friction feel.

5. Install the CPU and check error messages are clear.

6. If the error messages do not clear, replace the CPU at the indicated location.

    As an example:  FAULT_SENSOR @ C2.P0.FF_POK has FAILED

    You would replace CPU 2 if the error message does not clear.



For information about FAB documents, its release processes, implementation strategies and billing information, go to the following URL:

For Sun Authorized Service Providers go to:

In addition to the above you may email:



Modification History
Changes made since initial publication.

10-JAN-2008
  • Added ESD Caution statement in Resolution section.
10-Aug-2009
  • corrected slot numbering error in step 4 of Resolution section.

Previously Published As
103196
Internal Contributor/submitter
Karen.Vergakes@Sun.COM

Internal Eng Responsible Engineer
Bruce.Alford@Sun.COM

Internal Services Knowledge Engineer
Joe.Davis@Sun.COM

Internal Eng Business Unit Group
SSG WGS (Workgroup Systems)

Internal Kasp FAB Legacy ID
103196

Internal Sun Alert & FAB Admin Info
Critical Category:
Significant Change Date: 2008-01-09
Avoidance: Service Procedure
Responsible Manager: Ron.Boudreau@Sun.COM
Original Admin Info: WF - completed draft and sent to Ext Rvw. - Joe 1/7/08
WF - Ext Rvw complete, sending to Publish. - Joe 1/9/08
WF - republished with ESD Caution added. - Joe 1/10/08


Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback