05-09-2013 01:46 AM
Could you please help me identify the problem with the msa2012fc storage system.
The system hanged blocking the access to data, failing 3 out of 6 hdds in a raid6 array. The first error logged in the event log related to the failure is A193790 Drive link down Chan0.
After this the controllers were down, was unable to connect through SMU. Had to reboot the system, after some time the system recovered and the drives/volumes seem ok, but the would like to understand the cause of this, and ways to prevent this occuring again.
05-08 16:42:30 19 A193803 Rescan bus done. Reason Code: 28. Found 0 drives, 1 Drive Enclosure
W 05-08 16:41:59 44 A193802 Unwritable cache data exists for volume (volume: , SN: 00c0ffd7a1440000e6691e4a01000000) comprising 1% of cache space
C 05-08 16:41:59 207 A193801 Vdisk scrub failed, error code 1. 1 error(s) found (Vdisk: datavd, SN: 00c0ffd7a1440000b9691e4a00000000)
W 05-08 16:38:10 1 A193800 Vdisk critical: datavd, SN: 00c0ffd7a1440000b9691e4a00000000
C 05-08 16:38:10 314 A193799 FRU type: drive, problem: encl 0 deviceID 6. Vendor: SEAGAT Product ID: ST3146356SS , S/N: 3QN0D0YK00009919P7SK rev: 0004. Related event ID: 193798, type: 8
W 05-08 16:38:10 8 A193798 Vdisk datavd drive down (Channel:0 ID:6 SN:3QN0D0YK00009919P7SK Encl:0 Slot:6)
W 05-08 16:38:10 1 A193797 Vdisk critical: datavd, SN: 00c0ffd7a1440000b9691e4a00000000
C 05-08 16:38:10 314 A193796 FRU type: drive, problem: encl 0 deviceID 5. Vendor: SEAGAT Product ID: ST3146356SS , S/N: 3QN0DD7F00009919P7XE rev: 0004. Related event ID: 193795, type: 8
W 05-08 16:38:10 8 A193795 Vdisk datavd drive down (Channel:0 ID:5 SN:3QN0DD7F00009919P7XE Encl:0 Slot:5)
W 05-08 16:38:10 1 A193794 Vdisk critical: datavd, SN: 00c0ffd7a1440000b9691e4a00000000
C 05-08 16:38:10 314 A193793 FRU type: drive, problem: encl 0 deviceID 3. Vendor: SEAGAT Product ID: ST3146356SS , S/N: 3QN0CZWK00009919BSS3 rev: 0004. Related event ID: 193792, type: 8
W 05-08 16:38:10 8 A193792 Vdisk datavd drive down (Channel:0 ID:3 SN:3QN0CZWK00009919BSS3 Encl:0 Slot:3)
05-08 16:36:39 59 A193791 Disk channel error (Channel:0 ID:129 SN:3QN0DC9R00009918AP1E Encl:0 Slot:1): Abort Timeout cdb:Rd 0fc19200 0080
05-08 16:36:39 114 A193790 Drive link down Chan0
05-09-2013 07:20 AM
In event log check the past events, typically Event code 58 is for drive errors. I would recommend replacement of the drives if there is any errors.
here's the link to event description guide,
05-14-2013 11:47 PM
Thanks for the reply. Unfortunately the log is somehow truncated, or cannot go back that much, so couldn't find any drive error in the past events.
Probabily have to wait for the new error in order to identify the failing disk.
05-15-2013 08:47 AM
Run ADU & Online Insight Diagnostics to verify drive health.
05-24-2013 03:21 AM
To show events older than those listed in the GUI telnet to the MSA and use the 'show events' command.
Hope this helps,
Kudos gratefully accepted - How to assign...