Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
The most convoluted systems failure I've ever experienced - and how I prevailed (hdivider.com)
1 point by hdivider on Dec 6, 2012 | hide | past | favorite | 1 comment


This is t a systems failure, it's a process failure.

Why was the data unavailable? Hardware failure Why wasn't the most recent version available? The operator didn't manually do a "full" backup Why did the operator need to manually start a full backup? only some sub directories were periodically backed up Why was the most recent periodic backup unusable? The backup system recorded partial contents for an unknown reason. Why was the backup system recording partial content? There is no system for restore verification and content checksum comparison. Why was critical correspondence unacknowledged? The email reception is system is non redundant.

I'll let you right up the action items here.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: