Showing 1-4 of 4 projects
A Python framework for evaluating and benchmarking large language models (LLMs) and their capabilities.
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more for vibe coders.
An open-source guide for building a 'Tiny-Universe' of large language models and AI tools.
An open-source Python implementation for 3D multi-object tracking, with KITTI benchmarking and new evaluation metrics.
Get weekly updates on trending AI coding tools and projects.