Document Audience: | INTERNAL |
Document ID: | I0924-1 |
Title: | Main Fan 6 and Main Fan 7 failures can cause reduced system availability for Sun Fire V1280 and Netra 1280 servers. |
Copyright Notice: | Copyright © 2005 Sun Microsystems, Inc. All Rights Reserved |
Update Date: | 2003-01-29 |
---------------------------------------------------------
- Sun Proprietary/Confidential: Internal Use Only -
---------------------------------------------------------------------
FIELD INFORMATION NOTICE
(For Authorized Distribution by SunService)
FIN #: I0924-1
Synopsis: Main Fan 6 and Main Fan 7 failures can cause reduced system availability for Sun Fire V1280 and Netra 1280 servers.Create Date: Jan/21/02
SunAlert: No
Top FIN/FCO Report: No
Products Reference: Sun Fire V1280 and Netra 1280 servers
Product Category: Server / Service
Product Affected:
Systems Affected:
-----------------
Mkt_ID Platform Model Description Serial Number
------ -------- ----- ----------- -------------
- A40 ALL Sun Fire V1280 -
- N40 ALL Netra 1280 -
X-Options Affected:
-------------------
Mkt_ID Platform Model Description Serial Number
------ -------- ----- ----------- -------------
- - - - -
Parts Affected:
Part Number Description Model
----------- ----------- -----
- - -
References:
RFE: 4778183 - Apply blacklisting rules for SB4/P2 SB2/P2 with fan 6
failure.
DOC: 817-0510-10: Sun Fire V1280/Netra 1280 Service Manual.
816-7124-13: Sun Fire V1280/Netra 1280 Product Notes.
Issue Description:
Main Fan 6 and Main Fan 7 failures can cause reduced system
availability in Sun Fire V1280 and Netra 1280 servers. The impact to
system operation will depend upon the number of System Boards installed
and upon the ambient temperature. In the worst case scenario, the
system will reboot with one or more System Boards disabled. This FIN
describes the different failure modes and the service procedures
required for recovery.
This issue affects any Sun Fire V1280 or Netra 1280 server in which
Main Fan 6 or Main Fan 7 has slowed down or has failed.
When this issue occurs, a Fan will fail and LOM errors will be
displayed on the console and in the LOM log files. The fault LED
located on the faulty Fan will be lit. The Service Engineer should
determine appropriate action by inspecting the Fan to see if it is
still spinning or has stopped completely. Reference the Service Manual
for the location of the 8 Fans.
Systems Operating at up to 35�C Ambient Temperature:
====================================================
Configuration: 4 x CPUs (SB0)
Any main system fan failure can be resolved with hot-swap procedure.
There is no impact to system availability.
Configuration: 8 x CPUs (SB0 & SB2)
If main system Fan(s) 0, 1, 2, 3, 4, 5 or 7 are running slow or have
stopped, the system will report alarms but will continue to
operate. These failures can be resolved with a hot-swap procedure.
There is no impact to system availability.
If main system Fan 6 has stopped, the system will reboot with SB2/P2
disabled within 9 minutes. The customer should plan a service
action to replace the failed Fan. The Fan can be hot-swapped if
SB2/P2 has been disabled.
Customer will need to re-enable the blacklisted CPU SB2/P2.
# cfgadm -c unconfigure N0.SB2
SC> enablecomponent SB2/P2
# cfgadm -c configure N0.SB2
If main system Fan 6 is running slow, the system will report
alarms but will continue to operate. The customer should plan a
service action to bring the system down to replace the faulty fan.
Consideration should be given to exchanging the entire Fan Tray
assembly as a preventative maintenance action for any other
failures.
Configuration: 12 x CPUs (SB0, SB2 & SB4)
If main system Fan(s) 0, 1, 2, 3, 4, 5 or 7 are running slow or have
stopped, System will report alarms but will continue to operate.
These failures can be resolved with hot-swap procedure. There is no
impact to system availability.
If main system Fan 6 has stopped, the system will reboot with SB4/P2
disabled within 7 minutes and again reboot with SB2/P2 disabled
within 9 minutes. The customer should plan a service action to
replace the failed Fan. The Fan can be hot-swapped if SB4/P2 and
SB2/P2 has been disabled.
The customer will need to re-enable the blacklisted CPU(s) SB2/P2
and SB4/P2.
# cfgadm -c unconfigure NO.SB4
SC> enablecomponent SB4/P2
# cfgadm -c configure NO.SB4
# cfgadm -c unconfigure NO.SB2
SC> enablecomponent SB2/P2
# cfgadm -c configure NO.SB2
If main system Fan 6 is running slow, the system will report alarms
but will continue to operate. The customer should plan a service
action to bring the system down to replace the faulty fan.
Consideration should be given to exchanging the entire Fan Tray
assembly as a preventative maintenance action for any other
failures.
Systems Operating between 35�C and 40�C Ambient Temperature:
============================================================
Configuration: 4 x CPUs (SB0)
If main system Fan(s) 0, 1, 2, 3, 4, 5 or 6 are running slow or have
stopped, the system will report alarms but will continue to
operate. These failures can be resolved with hot-swap procedure.
There is no impact to system availability.
If main system Fan 7 has stopped, the system will reboot with SB0/P2
disabled within 9 minutes. The customer should plan a service
action to replace the failed Fan. The Fan can be hot-swapped if
SB0/P2 has been disabled.
The customer will need to re-enable the blacklisted CPU SB0/P2.
First, prepare services / databases / apps etc. for a Solaris
shutdown then
lom> shutdown
lom> enablecomponent SB0/P2
lom> poweron
Then restart services / databases / apps etc.
If main system Fan 7 is running slow, the system will report alarms
but will continue to operate. The customer should plan a service
action to bring the system down to replace the faulty fan.
Consideration should be given to exchanging the entire Fan Tray
assembly as a preventative maintenance action for any other
failures.
Configuration: 8 x CPUs (SB0 & SB2)
Affected by both Fan 7 and Fan 6 failures as previously described.
Configuration: 12 x CPUs (SB0, SB2 & SB4)
Affected by both Fan 7 and Fan 6 failures as previously described.
To aid in diagnosing and servicing the Fan issues described above,
information is being added to the following documents:
. The Product Notes will be updated in time for RR of the Sun Fire
V1280.
. The Service manual will be updated for RR of the Netra 1280.
. Service training material will be updated by GA of the product.
In addition, Caution Labels are to be fitted adjacent to Fan 6 and Fan 7.
_ ^
/!\ DISABLE SB4/P2 AND SB2/P2 BEFORE REMOVING FAN 6 |
---
_ ^
/!\ IF AMBIENT >35C DISABLE SB0/P2 BEFORE REMOVING FAN 7 |
---
Engineering are responding to an RFE to allow only a single system
reboot on Fan 6 failures. This enhancement is planned to be
implemented in release 10 of the SC Firmware, and release is planned
for GA of the product.
Implementation:
---
| | MANDATORY (Fully Proactive)
---
---
| | CONTROLLED PROACTIVE (per Sun Geo Plan)
---
---
| X | REACTIVE (As Required)
---
Corrective Action:
The following recommendation is provided as a guideline for authorized
Sun Services Field Representatives who may encounter the above
mentioned problem.
When Main Fan 6 or Main Fan 7 failures are discovered for Sun Fire
V1280 or Netra 1280 systems, follow the recovery procedures described
in the Problem Description above. For further information regarding
this issue, see Sun Fire V1280/Netra 1280 Service Manual, 817-0510-10,
and Sun Fire V1280/Netra 1280 Product Notes, 816-7124-13.
Comments:
None.
============================================================================
Implementation Footnote:
i) In case of MANDATORY FINs, Sun Services will attempt to contact
all affected customers to recommend implementation of the FIN.
ii) For CONTROLLED PROACTIVE FINs, Sun Services mission critical
support teams will recommend implementation of the FIN (to their
respective accounts), at the convenience of the customer.
iii) For REACTIVE FINs, Sun Services will implement the FIN as the
need arises.
----------------------------------------------------------------------------
All released FINs and FCOs can be accessed using your favorite network
browser as follows:
SunWeb Access:
--------------
* Access the top level URL of http://sdpsweb.central/FIN_FCO/
* From there, select the appropriate link to query or browse the FIN and
FCO Homepage collections.
SunSolve Online Access:
-----------------------
* Access the SunSolve Online URL at http://sunsolve.central/
* From there, select the appropriate link to browse the FIN or FCO index.
Internet Access:
----------------
* Access the top level URL of https://spe.Sun.COM
--------------------------------------------------------------------------
General:
--------
* Send questions or comments to finfco-manager@Sun.COM
--------------------------------------------------------------------------