Key Highlights
- AWS cloud service restored after a major outage that affected thousands of websites and applications globally.
- The issue originated from the US-EAST-1 data center, known for previous outages in 2021 and 2020.
- Services such as Snapchat, Reddit, Zoom, and Venmo were among those affected by the outage.
- The root cause was identified as a network health monitoring issue within Amazon’s EC2 internal network.
A Major Cloud Outage Shakes Global Businesses
On October 20, 2025, the world’s largest cloud provider, AWS (Amazon Web Services), faced a significant outage that disrupted operations for thousands of businesses and individuals worldwide. The service, which hosts applications and computer processes for companies around the globe, saw its issues originate from the US-EAST-1 data center in Virginia, a region known for frequent outages over the past years.
The Outage: A Global Impact
Companies such as Snapchat, Reddit, Zoom, Venmo, and even major financial institutions were affected by the AWS disruption. The outage highlighted the interconnected nature of digital services, with one provider’s failure impacting a wide array of businesses and everyday tasks.
“This outage once again highlights the dependency we have on relatively fragile infrastructures,” stated Jake Moore, global cybersecurity advisor at European cybersecurity firm ESET. The issue affected major websites including Snapchat, Reddit, and Venmo, as well as financial platforms like Lloyd Bank in Britain. According to Downdetector’s UK site, over 4 million users reported issues due to the incident.
Technical Details and Expert Insights
AWS identified the root cause of the outage as an underlying subsystem that monitors network load balancers used to distribute traffic across several servers within their EC2 internal network. The problem was first observed in the early hours, with AWS stating that “all AWS services returned to normal operations” by 3 p.m. PT (2200 GMT).
However, some services such as AWS Config, Redshift, and Connect had a backlog of messages that would take a few hours to process.
Ken Birman, a computer science professor at Cornell University, emphasized the importance of better fault tolerance in software development. He noted that when companies cut costs by skipping critical steps, they are more vulnerable during outages.
“When people cut costs and cut corners to try to get an application up, and then forget that they skipped that last step and didn’t really protect against an outage, those companies are the ones who really ought to be scrutinized later,” Birman told Reuters. Experts like Birman highlighted the importance of redundancy and backup services with other cloud providers.
Future Implications
The incident serves as a stark reminder of the critical role that cloud services play in modern business operations. As more companies rely on these services for their digital infrastructure, the potential impact of outages becomes increasingly significant.
“For major businesses, hours of cloud downtime translate to millions in lost productivity and revenue,” said Ryan Griffin, U.S. cyber practice leader at insurance broker McGill and Partners. The Wall Street reaction was muted, with Amazon shares rising 1.6% to $216.48 post-outage.
Experts recommend that companies take proactive steps to ensure resilience in their cloud strategies, including regular backups, disaster recovery plans, and leveraging multiple cloud providers for redundancy.
The AWS outage underscores the vulnerability of global digital infrastructure and the importance of robust planning and preparation by both service providers and end-users.