Posts Tagged ‘post-incident response’

Tony Howlett
By:


Date:
Mar 21, 2017
The current readout on the eastern region outage of Amazon’s popular AWS cloud service is that it was caused by single technician’s errant command.  The four hour outage caused downtime for many large websites and apps that depend on the service – including Netflix, Slack, and the SEC. It also made headlines in all the major news services.  And while the popular view holds the root cause as a techie typo, this was not the ultimate underlying problem.  Sure, the sequence of events was put in… Read More