Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type Sun Alert Sure Solution 1000338.1 : LAN Congestion may Result in Dual Controller Reboot on a Sun StorEdge 6130 Array
PreviouslyPublishedAs 200458 Product Sun StorageTek 6130 Array Date of Resolved Release 13-SEP-2006 Impact Both controllers in a Sun StorEdge 6130 array have an ethernet network port connected to a LAN for management of the array. If there is a very high volume of traffic on this LAN then this could result in both controllers rebooting. Examples of possible triggers for a high volume of LAN traffic are a "Denial of Service" attack, or a network misconfiguration causing a "routing loop". The reboot of both array controllers simultaneously will result in all data on the array going offline, and all attached data hosts losing access to that data. Contributing Factors This issue can occur on the following platform:
Symptoms If the described issue occurs, the array's log file (Major Event Log) will show a controller reboot at the same time on both the "A" and "B" controller: Date/Time: Tue Jul 25 13:36:09 BST 2006 Sequence number: 5914 Event type: 2606 Event category: Notification Priority: Informational Description: Start-of-day routine begun Event specific codes: 0/0/0 Component type: Controller Component location: Controller in slot B Logged by: Controller in slot B Raw data: ..... Date/Time: Tue Jul 25 13:36:13 BST 2006 Sequence number: 5926 Event type: 2606 Event category: Notification Priority: Informational Description: Start-of-day routine begun Event specific codes: 0/0/0 Component type: Controller Component location: Controller in slot A Logged by: Controller in slot A Raw data: ..... Workaround There is no workaround. Please see the "Resolution" section below. Resolution This issue is addressed in the following release:
Note: Controller firmware release 06.12.27.10 is supplied with Sun StorageTek Array Host Software (CAM) 5.0 available for download at: Previously Published As 102601 Internal Comments This is Engenio CR 98081. The bug is that the controllers spend too much time handling the excessive network traffic and this results in Watchdog Timeouts on both controllers. The fix in firmware 06.12.27.10 involves recognizing a state of "receive flooding" and not allowing this state to consume too much controller resources. Internal Contributor/submitter matt.ascroft@sun.com Internal Eng Business Unit Group NWS (Network Storage) Internal Eng Responsible Engineer gary.gaston@Sun.COM Internal Services Knowledge Engineer jeff.folla@sun.com Internal Escalation ID 1-18575298 Internal Sun Alert Kasp Legacy ID 102601 Internal Sun Alert & FAB Admin Info Critical Category: Availability ==> HA-Failure Significant Change Date: 2006-09-13 Avoidance: Upgrade Responsible Manager: larry.robinson@sun.com Original Admin Info: [WF 13-Sep-2006, Jeff Folla: Sent for release.] [WF 11-Sep-2006, Jeff Folla: Waiting for bus group approval.] [WF 09-Sep-2006, Jeff Folla: Sent for review.] [WF 31-Aug-2006, Jeff Folla: Sent e-mail to the submitter asking him if there is a Sun BugID that is tracking this issue.] Product_uuid 8252cb91-d771-11d8-ab52-080020a9ed93|Sun StorageTek 6130 Array Attachments This solution has no attachment |
||||||||||||
|