3 min read
|
Saved February 14, 2026
|
Copied!
Do you care about this?
A software update at Snowflake led to a 13-hour outage affecting 10 global regions, preventing customers from querying data or ingesting files. The issue stemmed from a backward-incompatible database schema change, which created version mismatch errors across the platform.
If you do, here's more
Snowflake's recent software update led to a significant 13-hour outage affecting 10 of its 23 global regions on December 16. Customers faced issues with querying data and ingesting files, receiving “SQL execution internal error” messages. The root cause was a backwards-incompatible database schema change that introduced version mismatch errors, making operations fail or slow down. The outage impacted regions including Azure East US 2, AWS US West, and Google Cloud Platform Europe, among others. Snowflake initially estimated a restoration time of 15:00 UTC but extended it to 16:30 UTC due to delays in the Virginia region.
The incident highlighted vulnerabilities in Snowflake’s architecture. Sanchit Vir Gogia from Greyhound Research pointed out that the failure stemmed from a logical issue rather than a physical one, exposing flaws in how platforms test their updates against real-world conditions. The problem arose when metadata changes affected multiple regions simultaneously, which is often overlooked in regional redundancy planning. Snowflake's staged deployment process, designed to monitor activity and allow for rollbacks, failed to prevent widespread outages, as the changes propagated before they could be detected.
Gogia also drew parallels between this outage and a security incident earlier in 2024, where 165 customers were targeted due to stolen credentials. Both events reflect a deeper issue with control maturity under stress, pointing to weaknesses in compatibility and identity governance. He emphasized the need for CIOs to rethink operational resilience by focusing on behavioral questions about how platforms respond when failures occur, rather than just compliance metrics or uptime averages.
Questions about this article
No questions yet.