Explore Projects

Discover 230 open source projects

Active filters (1):
Search: benchmarkingร—
Clear all

Showing 121-140 of 230 projects

mlcommons/training

Reference implementations of MLPerfยฎ training benchmarks for evaluating machine learning performance.

1.7K
Stable
Python
ML Ops
API Frameworks
Python
#benchmark#machine-learning#performance-evaluation

stepjam/RLBench

RLBench is a large-scale benchmark and learning environment for reinforcement learning agents.

1.7K
Archived
Python
Agents & Orchestration
#reinforcement-learning#benchmark#learning-environment

GestaltCogTeam/BasicTS

A benchmarking toolkit for fair and scalable time series forecasting, focused on long-term traffic prediction.

1.7K
Stable
Python
ML Ops
Caching
#time-series-forecasting#traffic-forecasting#benchmarking

evalplus/evalplus

A rigorous benchmark for evaluating the code quality and efficiency of large language models like GPT-4.

1.7K
Stable
Python
LLM Frameworks
Testing
Python
#benchmark#chatgpt#efficiency

RoboVerseOrg/RoboVerse

RoboVerse is a unified platform, dataset, and benchmark for scalable and generalizable robot learning.

1.7K
Active
Python
Robotics
Reinforcement Learning
Python
#robotics#reinforcement-learning#imitation-learning

martinus/nanobench

A simple, fast, and accurate C++ microbenchmarking library that can be included as a single header file.

1.7K
Archived
C++
Benchmarking
#benchmark#cpp#header-only

julienschmidt/go-http-routing-benchmark

Benchmark for Go HTTP request router and web framework performance

1.7K
Archived
Go
API Frameworks
Express
#authentication#performance#benchmarking

decisionintelligence/TFB

A comprehensive and fair benchmark for time series forecasting methods, including deep learning and statistical techniques.

1.7K
Active
Shell
Benchmarking
Time Series
#time-series-forecasting#benchmarking#deep-learning

tczhangzhi/pytorch-distributed

A quickstart and benchmark for PyTorch distributed training, useful for ML/AI developers.

1.7K
Archived
Python
ML Ops
API Frameworks
PyTorch
#distributed-training#pytorch#benchmarking

open-mmlab/mmrazor

An open-source toolbox and benchmark for model compression and acceleration in PyTorch.

1.7K
Archived
Python
ML Ops
API Frameworks
PyTorch
#model-compression#model-acceleration#benchmark

harbor-framework/terminal-bench

A benchmark for evaluating the performance of large language models (LLMs) on complex terminal-based tasks.

1.7K
Active
Python
LLM Frameworks
CLI Tools
Python
#benchmark#llm#terminal

alecthomas/go_serialization_benchmarks

Benchmarks for evaluating Go serialization methods for performance and efficiency.

1.6K
Experimental
Go
Benchmarking
API Frameworks
#benchmarking#performance#serialization

yoshitomo-matsubara/torchdistill

A PyTorch-based framework for reproducible deep learning studies with 26 knowledge distillation methods.

1.6K
Stable
Python
ML Ops
Computer Vision
PyTorch
#deep-learning#computer-vision#natural-language-processing

MLGroupJLU/LLM-eval-survey

A survey paper on evaluating large language models (LLMs) for developers building AI-powered applications.

1.6K
Experimental
LLM Frameworks
Tutorials & Courses
#benchmark#evaluation#large-language-models

privatenumber/minification-benchmarks

A comprehensive benchmark for JavaScript minification tools, comparing performance and size metrics.

1.6K
Stable
TypeScript
Build Tools
Frontend Frameworks
React
#benchmarks#minification#performance

JonMagon/KDiskMark

A simple open-source disk benchmark tool for Linux distros, focused on performance testing.

1.6K
Stable
C++
CLI Tools
API Frameworks
#benchmarking#linux#disk-performance

ByteDance-Seed/Seed1.5-VL

A powerful vision-language foundation model designed to advance multimodal AI understanding and reasoning.

1.6K
Experimental
Jupyter Notebook
LLM Frameworks
Computer Vision
Jupyter Notebook
#multimodal-ai#vision-language-model#large-language-model

opendatalab/OmniDocBench

A comprehensive benchmark for document parsing and evaluation, designed for CVPR 2025.

1.5K
Stable
Python
Computer Vision
Datasets
#computer-vision#document-parsing#benchmark

ar51an/iperf3-win-builds

A collection of prebuilt iperf3 binaries for Windows, enabling developers to easily benchmark their network limits.

1.5K
Stable
CLI Tools
API Clients & Testing
#network-benchmarking#iperf3#windows-binaries

Lifelong-Robot-Learning/LIBERO

A benchmark for evaluating knowledge transfer in lifelong robot learning using AI tools.

1.5K
Experimental
Jupyter Notebook
Agents & Orchestration
API Frameworks
Jupyter Notebook
#benchmark#imitation-learning#lifelong-learning
1...68...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.