Showing 61-80 of 222 projects
An open-source LLMOps platform for prompt playground, prompt management, LLM evaluation, and LLM observability.
A Python library for reinforcement learning environments and evaluations targeted at AI-focused developers.
Open-source toolkit for evaluating large multi-modal AI models, supporting 220+ models and 80+ benchmarks.
A PyTorch library for computing Frรฉchet Inception Distance (FID), a metric used to evaluate generative adversarial networks.
A multimodal evaluation toolkit for assessing AI models across text, image, video, and audio tasks.
A Java library for tracking and evaluating the performance of investment portfolios across stocks, crypto, and other assets.
An efficient similarity search library and toolkit for evaluating k-NN methods in non-metric spaces.
HElib is an open-source C++ library for homomorphic encryption, supporting BGV and CKKS schemes.
A GitHub repository providing comments for awesome courses on public universities' course evaluations.
A comprehensive benchmark to evaluate large language models (LLMs) as agents for various tasks.
A Python library for evaluating the capabilities of large language models trained on code.
MTEB is a benchmark for evaluating and comparing text embedding models across multiple tasks and languages.
Klipse is a JavaScript plugin for embedding interactive code snippets in tech blogs, supporting various programming languages.
An open platform for managing, monitoring, and optimizing large language models (LLMs) and AI workflows.
A Python library for evaluating the performance of neural networks for object detection.
An open-source visual programming environment for battle-testing prompts to large language models.
Provides data, models, and evaluation benchmark for large language models.
T5X is a flexible and extensible framework for training and evaluating T5 models, a popular family of language models.
Fast, portable expression evaluator with gradual typing for safe, non-Turing complete scripting in Go
Deep learning model for extracting & analyzing table structures from PDFs and images with datasets.
Get weekly updates on trending AI coding tools and projects.