Showing 1-18 of 18 projects
Distributed systems resilience library for fault tolerance and latency management
Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.
A Python library for retrying failed operations, useful for building robust and resilient applications.
A tool for building resilient cloud-based applications by randomly failing instances to test for failure tolerance.
A Go-based tool for monitoring hard drive SMART data, tracking historical trends, and identifying real-world failure thresholds.
A Swift and Objective-C testing framework that provides a matcher-based approach for writing expressive and readable tests.
A hackable HTTP proxy for resiliency testing and simulated network conditions
A backup program for disk arrays that stores parity information and recovers from up to six disk failures.
An open-source platform for evaluating and improving Generative AI applications with 20+ preconfigured checks and root cause analysis.
A pytest plugin for distributed testing and loop-on-failures testing modes.
A Rust library for managing errors and failures in your application.
A machine learning toolkit for log-based anomaly detection in AI/ML operations (AIOps)
An active monitoring software to detect failures before your customers do.
A Kotlin multiplatform library that provides a Result monad for modeling success or failure operations.
A library of malformed servers to help test the failure handling of HTTP clients.
A Go library that implements a circuit breaker pattern to handle failure in distributed systems.
A gamified chaos engineering tool for Kubernetes that lets developers inject controlled failures and observe system behavior.
A Swift library for debugging Auto Layout issues on iOS, with a focus on vibe coders building AI-powered apps.
Get weekly updates on trending AI coding tools and projects.