Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-73-1019336.1
Update Date:2011-02-22
Keywords:

Solution Type  FAB (standard) Sure

Solution  1019336.1 :   Active replacement of XSCF Unit on Sun SPARC Enterprise M8000/M9000 can lead to unit fault.  


Related Items
  • Sun SPARC Enterprise M9000-32 Server
  •  
  • Sun SPARC Enterprise M9000-64 Server
  •  
  • Sun SPARC Enterprise M8000 Server
  •  
Related Categories
  • GCS>Sun Microsystems>Sun FAB>Standard>Reactive
  •  

PreviouslyPublishedAs
238626


Bug Id
<SUNBUG: 6683763>, <SUNBUG: 6667692>

Date of Preliminary Release
10-Jun-2008

Date of Resolved Release
16-Sep-2008

Product
Sun SPARC Enterprise M9000 Server
Sun SPARC Enterprise M8000 Server

Active replacement of XSCF can lead to unit fault (see details below).

Affected Parts:

371-2228   eXtended System Control Facility Unit B, XSCF_B, RoHS:Y
371-2229   eXtended System Control Facility Unit C, XSCF_C, RoHS:Y

Impact

During an active replacement of a XSCF Unit (XSCFU) using "replacefru" you run the risk of causing the new XSCF Unit to become unusable.  This is due to a timeout of the database sync.  Once the XSCFU is in this state it is not possible to fix it in the field and the XSCFU would require replacement.

Contributing Factors

This issue can occur on either of the above listed Affected Parts if the replacefru command is used to active replace an XSCF Unit when XCP Firmware is equal to or less than 1071.

Symptoms

The following are examples of what you will see via showstatus reports when this issue occurs;

  XSCFU_B#0  Status:Faulted

or

  XSCFU_B#1  Status:Faulted

Root Cause

Using the replacefru command to carry out an active replacement on the XSCFU board can take too long causing a timeout during the database sync of the two XSCF Units.

For details refer to BugIDs 6683763 and 6667692.

Corrective Action

Workaround:

XCP releases below 1072:

Customers may elect to use cold swap to immediately replace a failed XSCF Unit.  This requires a full platform outage with all domains down.  The breakers must also be switched off to eliminate power to the XSCF Units.

Customers may also elect to defer the replacement of a failed XSCF Unit until a firmware fix is available to utilize active swap.  Domains will not be impacted with active swap.  Deferring the replacement of the failed XSCF Unit would run the platform in a non-redundant XSCF configuration.

This process is documented in the Sun SPARC Enterprise M8000/M9000 Servers Service Manual. The COLD replacement process is documented in section 11.3.

You should also check the firmware of the replacement XSCF per section 8.1.10 of the Sun SPARC Enterprise M4000/M5000/M8000/M9000 Servers XSCF User's Guide.

All documentation for these products can be found at:



Resolution:

Install XCP firmware 1072 (or later) which can be downloaded from the Oracle download site.

For instructions on how to download XCP 1072 (or later) reference "HOWTO" document id 1002631.1



For information about FAB documents, its release processes, implementation strategies and billing information, go to the following URL:

In addition to the above you may email:


@ Modification History
16-Sep-2008: Changed from Preliminary to Resolved. Updated Workaround and Resolution sections.


Internal Contributor/submitter
D.Campbell@Sun.COM

Internal Eng Responsible Engineer
Ben.Chang@Sun.COM Responsible Manager: Mary.Vigil@Sun.COM

Internal Services Knowledge Engineer
Joe.Davis@Sun.COM

Internal Eng Business Unit Group
SSG ES (Enterprise Systems)

Internal Sun Alert & FAB Admin Info
06-Jun-2008: Initial draft completed and sent to Extended Review.
10-Jun-2008: Incorporated all feedback from Ext Rvw - sending to Publish.
16-Sep-2008: chgd from Preliminary to Resolved and rePublished.
17-Dec-2009: Replaced Product with Swordfish Nomenclature

Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback