On our IBM DS3500 SAN we started seeing VDD errors and repairs. This was coupled with a drive returning a Check Condition and a destination drive error. After speaking with vendor support these problems are caused by the controller reading data off the hard drive which fails checksum. The array will automatically recover. While these can be transient in nature it can also preface a failing drive, especially when you see a drive error with it.
To be on the safe side we replaced the drive. When replacing the drive it's recommended to unassign an global hot spares until the new drive is inserted and the rebuild starts. If you have hot spares assigned when you fail the drive the array will immediately start rebuilding onto the hot spare.
No comments:
Post a Comment