Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type Sun Alert Sure Solution 1021703.1 : Potential for System Outages Due to Cooling Issues on Sun Fire V1280/E2900, Netra 1280/1290
PreviouslyPublishedAs 273971 Bug Id <SUNBUG: 6864515> Product Sun Fire V1280 Server Sun Fire E2900 Server Netra 1280 Server Sun Netra 1290 Server Date of Workaround Release 09-Dec-2009 Date of Resolved Release 29-Oct-2010 Potential for System Outages Due to Cooling Issues on Sun Fire V1280/E2900, Netra 1280/1290 1. Impact A cooling issue has been found to occur on certain systems where, over time, increased temperature can contribute to system panic or reset, causing loss of availability to applications or to the system as a whole. 2. Contributing Factors This issue can occur on the following platforms: SPARC Platform
Note: Proper system maintenance in this case refers to air filters that may not have been properly cleaned or serviced over the life of the product, contributing to increased system temperature and liklihood of this condition. 3. Symptoms A typical error scenario may include one or more of the following error messages in the System Controller's (SC) log files ('showlogs -v' or from the console): Path broken between CBH and SDC:SB#(where # is the board number) 4. Workaround The following additional cooling actions may increase the life of the components over time, but will not eliminate the potential for component failure, as additional cooling may not have enough time to significantly alter the life expectancy of those components. Perform the following actions: STEP 1: Obtain output from the following commands from each system to collect baseline temperature readings (from the lom prompt on the System Controller). Archive and retain this information for your records. lom> showhostname lom> showdate lom> showenv -ltuvwSTEP 2: Improve ambient air temperature levels as much as possible for all Sun Fire E2900, V1280 and Netra 1280, 1290 systems by performing the following steps, if applicable: - Position additional vented floor tiles or perform other ventilation changes to reduce ambient air temperature. - Reposition the system to a cooler ambient air temperature environment by relocating to a different location, alter rack mount location, etc. - Reduce ambient air temperature level via increased cooling if the environment can support this. - Validate that any empty board slots have the proper filler panel installed to assure correct chassis airflow. STEP 3: Clean, remove or replace filters: For Sun Fire V1280, and E2900 systems: - Remove the left input air filter to increase the chassis airflow. Note: filters that have not been cleaned previously may stick to the door. - If 1950MHz CPUs are installed, the left air filter should already be removed. - Page 1 of the Sun Fire E2900/V1280 and Netra 1280 Systems Filter Installation Guide http://dlc.sun.com/pdf/817-2680-12/817-2680-12.pdf provides details on removing the filter. -Install System Controller Application (ScApp) Firmware 5.20.14 patch 114527-15 to increase chassis airflow through an increase in fan speed. After the firmware is installed, reboot the SC and keyswitch the domains as recommended in the firmware installation instructions (which effectively reboots the system). For Netra 1280/1290 systems: - Clean or replace the left input air filter. Note: filters that have not been cleaned previously may stick to the door. - Air Filter kits are Sun order number X6806A-Z. - Filters should be inspected and cleaned or replaced as necessary every 3-6 months as per the Periodic Maintenance instructions in the Service Manual for 1280 at http://dlc.sun.com/pdf/817-0510-15/817-0510-15.pdf and for 1290 at http://dlc.sun.com/pdf/819-4373-10/819-4373-10.pdf section C-1. - The Sun Fire E2900/V1280 and Netra 1280 Systems Filter Installation Guide at http://dlc.sun.com/pdf/817-2680-12/817-2680-12.pdf provides details on replacing the filter. - Install System Controller Application (ScApp) Firmware 5.20.14 patch 114527-15 to increase chassis airflow through an increase in fan speed. After the firmware is installed, reboot the SC and keyswitch the domains as recommended in the firmware installation instructions (which effectively reboots the system). STEP 4: As close as possible to 24 hours later, repeat STEP 1 to obtain post-change temperature readings. If the system is significantly busier or less busy for this second reading than the first, temperature differences may also be significant. Try to take this reading when system state and data center environment is nearly identical to the baseline for most accurate measurement of improvement (ie; same time of day). Archive and retain this post-change information for your records for future comparison. Filter removal (Sun Fire E2900 and v1280) and Replacement (Netra 1280 and 1290) should reduce average board temperatures (Individual server temperature results may vary). Archive and retain the "showenvironment" data in the event it needs to be referred to in the future to provide a benchmark of the impact of this procedure. 5. Resolution This issue is addressed in the following release:
Modification History 29-Oct-2010: No further Engineering activity, issue is Resolved References<SUNPATCH: 114527-15>Internal Comments (for SAs) Support Personnel: see Field Action Bulletin(FAB) 270169 regarding system specific information for Sun Fire E2900/V1280 Systems and Netra 1280/1290 Systems in <document: 1021064.1> For additional questions regarding this issue, email the d150-support@sun.com email alias. This is an internal only support alias. Internal Contributor/submitter Joshua.Freeman@sun.com, roy.stiles@sun.com Internal Eng Responsible Engineer Steve.Kurihara@Sun.COM, Jim.Ye@Sun.COM Internal Services Knowledge Engineer david.mariotto@sun.com Internal Eng Business Unit Group Systems Group - Enterprise Systems, Systems Group - Netra Systems and Networking Internal Escalation ID 1-554249604 Internal Resolution Patches 114527-15 ReferencesSUNPATCH:114527-15Attachments This solution has no attachment |
||||||||||||
|