Will risk of data loss during RAID rebuild time become major concern with increase in disk capacity?Are there any studies that looked at probability of second disk with uncorrectable errors during RAID reconstruction? If you know any studies or reliability model, send me a message through comments or via email.
How to find my email address? View my complete profile > My Web Page >Contact Us.As the disk capacity is increasing, it is taking longer to rebuild the RAID group. And during this reconstruction time, there is no protection in place for stored data against total loss other than the last good backup. With typical RAID5 rebuild rate of 10 - 15GB/hr, reconstruction of a RAID group with high capacity disk, such as 500GB disk, can even be longer than the 24 hour backup rotation.
How vulnerable and aware organizations are to data loss during RAID rebuild? What are they doing to protect themselves against the second disk failure during RAID reconstruction?
Previously, I considered several alternative but still looking at ways to mitigate this risk elegantly.
- RAID10 instead of RAID5 as default RAID group.
- Dual parity RAID techniques.
- Initiating snapshot and backup upon detection of first disk failure.