Amazon Web Services (AWS) announced late Monday that it had repaired a massive technical fault that knocked out access to thousands of websites and apps around the world for much of the day.
More than 1,000 platforms — including Snapchat and major banks like Lloyds and Halifax — went offline because of failures within Amazon’s cloud network in the United States. Downdetector, which tracks service interruptions, recorded more than 11 million problem reports during the disruption.
Experts said the event highlighted how fragile the internet has become as global systems rely heavily on a few cloud providers.
Small glitch, global consequences
Professor Alan Woodward from the University of Surrey said the outage revealed the deep interdependence of the online world. He warned that even the biggest platforms rely on outside infrastructure that can fail. “A single small error can ripple across countless services,” he said.
The first signs of trouble appeared around 07:00 BST on Monday when users reported issues accessing platforms such as Fortnite and Duolingo.
By midday, Downdetector had received more than four million reports from 500 different sites — double its usual weekday average. That number later surged past 11 million as more services, including Reddit and Lloyds Bank, were hit.
By 23:00 BST, Amazon said all AWS systems were fully restored after engineers limited parts of their network to fix the root cause.
Engineers battle system chain reaction
Mike Chapple, an information technology professor at Notre Dame University, compared the event to a regional power failure. He said Amazon likely fixed initial symptoms before tackling deeper problems. “It’s like restoring lights before realising the grid itself is unstable,” he explained.
Amazon has not yet given a full explanation of the cause. In a brief update, it said the fault seemed tied to DNS resolution within the DynamoDB API in its US-EAST-1 region.
DNS, or Domain Name System, works like a phone book for the internet, converting website names into numerical addresses. When that process breaks, browsers can’t find sites, cutting off users instantly.
Dependence on big cloud providers sparks warnings
Cloudflare’s chief executive Matthew Prince said the incident exposed how much control a few cloud giants have over the internet. “Everyone has a bad day, and today was Amazon’s,” he said. “Cloud systems are powerful, but when one fails, the world feels it.”
Cori Crider, head of the Future of Technology Institute, compared the disruption to “a key bridge collapsing in the digital economy.” She said about 70% of global cloud computing depends on Amazon, Microsoft, and Google — a concentration she called “a structural risk.”
“When one major provider stumbles, entire sectors go down with it,” Crider warned. She called for governments and businesses to support local cloud providers to prevent a repeat of such a large-scale breakdown.
Companies told to improve digital resilience
Cornell University professor Ken Birman said businesses that rely on AWS must take part of the blame. “Many firms still don’t design proper protection systems for their applications,” he said.
Birman noted that cloud outages happen regularly, though not always at this scale. “We already know how to make these systems stronger,” he said. “But too many companies prioritise convenience over stability.”
Accountability questions and future risks
The fallout from the outage may not end with technical repairs. Legal disputes could follow as affected companies count their losses. After a major CrowdStrike failure last year, Delta Airlines is still seeking to recover more than $500 million in damages after being forced to manually restart 40,000 servers.
The Amazon outage has reignited debate about whether the world’s digital backbone is too concentrated in too few hands — and whether one company’s failure can once again bring much of the internet to a halt.
