Click any tag below to further narrow down your results
Links
HolmesGPT is an open-source AI tool designed to streamline troubleshooting in Kubernetes environments. It aggregates logs, metrics, and traces, helping on-call engineers diagnose issues faster by providing clear, actionable insights. The tool is extensible and community-driven, promoting collaboration in observability practices.
Octopus has developed a Kubernetes Live Object Status tool aimed at helping developers troubleshoot their applications more effectively. The design process involved integrating live status into existing dashboards, utilizing established status indicators, and evolving the feature based on user feedback, ultimately enhancing usability and troubleshooting capabilities. The project highlights the importance of early user engagement and iterative design in software development.
Learn how to perform a precision recovery of specific resources from etcd snapshots in Kubernetes, allowing for targeted restorations without the need for a full cluster rollback. This guide outlines a five-step process to restore critical resources like ConfigMaps while minimizing downtime and maintaining cluster stability.