Awesome
Major outages and incidents
A sample of major outages for infrastructure-y services, as well as other incidents. These are almost entirely sourced from the excellent SREweekly, which is much more comprehensive.
2019-07-13 New York City power outage
- Power Restored to Manhattan’s West Side After Major Blackout
- Blackout: Con Edison Apologizes, but Offers Few Clues About ‘Root Cause’
2019-07-11 Twitter
2019-07-10 Stripe
- @stripestatus
- Root cause analysis: significantly elevated error rates on 2019-07-10
- Stripe Outage Smacked Businesses for Two Hours
2019-07-02 Cloudflare
2019-06-28 Slack
Outage: Degraded functionality with several features
2019-06-24 Verizon BGP
- Internet Disruption report tweet
- How Verizon and a BGP Optimizer Knocked Large Parts of the Internet Offline Today (Cloudflare blog)
2019-06-22 Dutch telephone outage
Dutch telephone outage takes out nation's emergency number
2019-06-16 Target stores point-of-sale outage
Target checkout systems back up after hourslong outage
2019-06-16 Argentina & Uruguay power outage
‘Massive Failure’ in Power Grid Causes Blackout in Argentina and Uruguay (NY Times)
2019-06-02 Google Cloud Platform
2019-05-17 Salesforce
Multi-Instance Core and Communities Service Disruption starting May 17, 2019
2019-05-07 Mozilla add-ons
2019-05-02 Azure, Microsoft 365
RCA - Network Connectivity - DNS Resolution (Scroll down to 5/2)
2019-04-17 Gmail
Gmail Suffers Two-Hour Global Outage: Reports
2019-04-04 Slack
Outage: Customers are experiencing degraded funcionality
2019-03-27 Travis CI
Incident review for slow booting Linux builds outage