Click any tag below to further narrow down your results
Links
The article critiques Cloudflare's response to a recent global outage, highlighting flaws in their root cause analysis that overlook fundamental database issues. It argues that the outage stems from a mismatch between application logic and database schema, suggesting that Cloudflare needs to focus on logical design rather than just physical replication to prevent future incidents.
Cloudflare experienced another major outage that lasted 25 minutes, affecting 28% of its HTTP traffic. The outage stemmed from a global configuration change intended to fix a React vulnerability, which led to HTTP 500 errors across its network. This incident follows a similar outage just weeks prior, raising concerns about Cloudflare's reliability.
This article analyzes the November 2025 outage that took down major websites, including Cloudflare, due to a configuration error. It explains how a small change in a configuration file led to a cascading failure across multiple services and provides strategies to prevent similar incidents in the future.
The author reflects on their experience during the recent Cloudflare outage, highlighting how system limits and complex failures can lead to unexpected problems. They emphasize the importance of understanding the context behind decisions made during incidents and the value of detailed incident writeups for learning.
On December 5, 2025, Cloudflare experienced a significant outage lasting about 25 minutes due to a configuration change related to their Web Application Firewall. The issue arose from a bug triggered when turning off a testing tool, resulting in HTTP 500 errors for around 28% of customer traffic. Cloudflare is implementing measures to prevent similar incidents in the future.
Cloudflare experienced a widespread outage due to an update to its Web Application Firewall meant to address a vulnerability in React Server Components. The fix caused issues for various enterprise and consumer services, highlighting the risks of relying on single service providers.
On November 18, 2025, Cloudflare experienced a significant outage due to a change in database permissions that led to an oversized feature file for their Bot Management system. This caused widespread HTTP 5xx errors across various services until the issue was resolved later that day. The article details the incident, its impact, and steps for future prevention.
A Cloudflare outage on Tuesday affected major platforms like X and ChatGPT due to a spike in unusual traffic. The issue stemmed from a configuration file that exceeded its size limit, causing a software crash. Cloudflare confirmed there was no malicious activity involved.
Cloudflare experienced a global outage, impacting access to many websites and services. The issue stemmed from a configuration file that exceeded its size limit, causing a crash in the system that manages traffic. Although the outage was resolved within a few hours, it highlighted the vulnerability of internet infrastructure.
This article discusses the implications of Downdetector relying on Cloudflare for key services during a November 2025 outage. Despite being a multi-cloud service, Downdetector's use of Cloudflare for DNS and CDN helps manage traffic spikes and maintain performance, even if it introduces a single point of failure. The piece also highlights design considerations and potential improvements for the future.
Cloudflare experienced a significant outage due to a bad configuration, impacting many popular apps and services. This incident exposes the risks of centralization in internet infrastructure and emphasizes the need for more redundancy and resilience in our digital systems.
Cloudflare faced a global outage due to a database permission update that caused 5xx errors across its services. The issue stemmed from a regression that led to duplicate data in the Bot Management system, overwhelming memory limits and crashing the service. Cloudflare has since restored service and is reviewing its systems to prevent similar issues.
The article discusses a significant service outage that occurred at Cloudflare on June 12, 2025, affecting numerous websites and services globally. It details the causes of the outage, including technical failures and their impact on users and businesses. Additionally, the company outlines measures taken to prevent similar incidents in the future.
A significant incident occurred on July 14, 2025, involving Cloudflare's 1.1.1.1 DNS service, leading to widespread internet disruptions. The article details the nature of the incident, its impact on users, and the steps taken by Cloudflare to resolve the issues.
The article discusses an outage affecting services provided by GCP (Google Cloud Platform), Cloudflare, and Anthropic, highlighting the implications for users and businesses reliant on these platforms. It examines the causes of the outage and its impact on cloud computing reliability and security.
Cloudflare experienced a significant outage on September 12, 2023, affecting both their dashboard and API services. The incident caused disruptions for users relying on these tools, leading to increased scrutiny of the company's infrastructure and response mechanisms during downtime. Cloudflare's team worked to resolve the issues and restore services as quickly as possible.