Click any tag below to further narrow down your results
Links
Cloudflare experienced significant network failures in November and December 2025, prompting them to launch a "Code Orange: Fail Small" initiative. This plan focuses on improving the resilience of their network by implementing controlled rollouts for configuration changes, enhancing failure handling, and streamlining emergency response processes.
This article introduces Sumo Logic's Dojo AI, a new approach to security operations that emphasizes resilience over reaction. It details how specialized AI agents streamline analyst workflows by summarizing alerts, generating queries, and providing context, allowing analysts to focus on significant threats rather than drowning in noise.
The article discusses recent cloud outages and their impact on businesses, emphasizing the importance of resilience in online services. It advocates for a multi-vendor strategy to enhance reliability and performance, ensuring platforms can handle unexpected disruptions without downtime.
This article discusses how to prevent malicious processes from shutting down eBPF agents using kernel-level hooks. It outlines strategies for securely managing shutdown requests to ensure agents can be safely updated without compromising security.