I lost data to this kind of problem[1]. The linux dm-raid handles these kind of ...

ars · on June 1, 2014

> I should have set a cronjob to recursively md5sum my / every week or so

If you use Debian then install the debsums program which will do that for you for non-user data and report any errors.

You should also install mdadm and set it to check the array every month.

And finally install smartmontools and have it do a short self check every day and a long one (i.e. a full disk read) every week.

turrini · on June 1, 2014

Yeah, now I've near 120 tb of data under ZoL (ZFS on Linux, currently on master as time of writing), replicated at 5 min interval between two datacenters...

Zero corruption, zero problems.

fulafel · on June 1, 2014

The main Linux raid impl (md) probably handles failures more robustly than dm-raid.