8 links
tagged with all of: outage + dns
Click any tag below to further narrow down your results
Links
A DNS race condition in Amazon's DynamoDB system caused a significant outage that disrupted major websites and services, resulting in potential damages reaching hundreds of billions of dollars. The issue stemmed from a failure in the automated DNS management system, leading to widespread DNS failures and affecting various AWS services. Amazon has since disabled the affected systems and is working to implement safeguards against a recurrence.
AWS experienced a significant outage on October 20, primarily due to DNS issues linked to the departure of senior engineers, leading to concerns about the company's diminishing institutional knowledge. As a result, many internet services were disrupted, highlighting the potential consequences of a talent drain within AWS. The situation raises questions about the company's ability to handle future incidents with a less experienced workforce.
Amazon Web Services experienced a significant outage on Monday, affecting numerous major websites including Disney+, Reddit, and United Airlines. Although most services were restored within hours, the outage highlighted the fragility of reliance on major cloud providers, with AWS confirming it was caused by DNS issues related to its DynamoDB service.
A significant incident occurred on July 14, 2025, involving Cloudflare's 1.1.1.1 DNS service, leading to widespread internet disruptions. The article details the nature of the incident, its impact on users, and the steps taken by Cloudflare to resolve the issues.
A single software bug in Amazon's DynamoDB DNS management system caused a significant outage of Amazon Web Services, affecting millions globally for over 15 hours. The failure stemmed from a race condition triggered by the interaction of two components within the system, which led to widespread service disruptions reported by thousands of organizations.
Amazon Web Services resolved a significant outage that affected over 1,000 apps and websites, including Snapchat and major banks, highlighting the risks of relying heavily on a single cloud provider. Experts emphasized the need for companies to build more resilient systems and questioned the sustainability of the current concentration of cloud services among a few major players. The outage, attributed to DNS resolution issues, sparked discussions on the vulnerabilities in the infrastructure of online services.
Amazon's AWS experienced a significant outage due to a major DNS failure linked to a race condition within DynamoDB's infrastructure, affecting users globally for over 14 hours. The incident led to the accidental deletion of all IP addresses for the database service's regional endpoint, causing widespread connectivity issues. In response, Amazon has implemented measures to prevent future occurrences and apologized for the disruption caused to customers.
The article discusses a significant 14-hour outage of AWS's us-east-1 region, which affected 140 services including EC2, due to a latent race condition in the DynamoDB DNS management system. The author analyzes the outage's causes and emphasizes the complexity and critical nature of AWS's infrastructure, suggesting that oversimplified explanations do not capture the depth of the incident.