Showing 1-3 of 3 projects
A Python framework for evaluating and benchmarking large language models (LLMs) and their capabilities.
A framework for testing and evaluating large language models, prompts, and AI agents for security and performance.
An AI-powered security toolkit for LLM vulnerability scanning and red teaming.
Get weekly updates on trending AI coding tools and projects.