Showing 21-38 of 38 projects
A free and comprehensive DevOps learning roadmap for kickstarting your DevOps career in the cloud native era.
Chaos engineering toolkit and orchestration for developers to build more reliable and resilient systems.
A cloud-native DataOps and AIOps platform for building and operating data-intensive applications.
HolmesGPT is an AI agent that helps SREs and DevOps teams solve incidents faster with automatic correlations, investigations, and more.
A tool that traces the usage of the JNI API in Android apps, useful for reverse-engineering and security analysis.
This is a study plan for becoming a Site Reliability Engineer, not a developer discovery platform for vibe coders.
A web UI for the Jaeger distributed tracing system, built with React and JavaScript.
A curated list of Site Reliability and Production Engineering tools for developers.
A collection of postmortem templates for incident reporting and site reliability engineering.
An active monitoring software to detect failures before your customers do.
A tool that generates the Google SRE ebook in multiple formats (EPUB, MOBI, PDF) from the source content.
An operational handbook for DevOps professionals, with practical shell and Python scripts.
A Python-based textual UI for the Terraform infrastructure as code tool, providing a more interactive CLI experience.
An open-source runtime for building and managing intelligent agents across local, cloud, and edge environments.
A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.
Layerform helps engineers create reusable environment stacks using plain .tf files for multiple staging environments.
A collection of useful Git utilities and scripts for developers working in a shell environment.
A guide to the NixOS operating system and the Nix declarative expression language.
Get weekly updates on trending AI coding tools and projects.