Click any tag below to further narrow down your results
Links
Cloudflare experienced another major outage that lasted 25 minutes, affecting 28% of its HTTP traffic. The outage stemmed from a global configuration change intended to fix a React vulnerability, which led to HTTP 500 errors across its network. This incident follows a similar outage just weeks prior, raising concerns about Cloudflare's reliability.
This article analyzes the November 2025 outage that took down major websites, including Cloudflare, due to a configuration error. It explains how a small change in a configuration file led to a cascading failure across multiple services and provides strategies to prevent similar incidents in the future.
AWS faced a major outage on October 19-20 due to a race condition in DynamoDB’s DNS management, disrupting multiple services in the Northern Virginia region. While the incident was brief, many customers experienced issues for up to 15 hours, prompting discussions on AWS reliability and future improvements.
The article discusses the recent Google Cloud outage, detailing its causes, effects on businesses and users, and the broader implications for cloud reliability. It emphasizes the consequences of such disruptions on critical operations and highlights the need for better contingency planning in cloud services.