Explore Projects

Discover 230 open source projects

Active filters (1):

Search: benchmarking×

Clear all

Showing 121-140 of 230 projects

mlcommons/training

Reference implementations of MLPerf® training benchmarks for evaluating machine learning performance.

1.7K

Stable

Python

ML Ops

API Frameworks

Python

#benchmark#machine-learning#performance-evaluation

stepjam/RLBench

RLBench is a large-scale benchmark and learning environment for reinforcement learning agents.

1.7K

Archived

Python

Agents & Orchestration

#reinforcement-learning#benchmark#learning-environment

GestaltCogTeam/BasicTS

A benchmarking toolkit for fair and scalable time series forecasting, focused on long-term traffic prediction.

1.7K

Stable

Python

ML Ops

Caching

#time-series-forecasting#traffic-forecasting#benchmarking

evalplus/evalplus

A rigorous benchmark for evaluating the code quality and efficiency of large language models like GPT-4.

1.7K

Stable

Python

LLM Frameworks

Testing

Python

#benchmark#chatgpt#efficiency

RoboVerseOrg/RoboVerse

RoboVerse is a unified platform, dataset, and benchmark for scalable and generalizable robot learning.

1.7K

Active

Python

Robotics

Reinforcement Learning

Python

#robotics#reinforcement-learning#imitation-learning

martinus/nanobench

A simple, fast, and accurate C++ microbenchmarking library that can be included as a single header file.

1.7K

Archived

C++

Benchmarking

#benchmark#cpp#header-only

julienschmidt/go-http-routing-benchmark

Benchmark for Go HTTP request router and web framework performance

1.7K

Archived

API Frameworks

Express

#authentication#performance#benchmarking

decisionintelligence/TFB

A comprehensive and fair benchmark for time series forecasting methods, including deep learning and statistical techniques.

1.7K

Active

Shell

Benchmarking

Time Series

#time-series-forecasting#benchmarking#deep-learning

tczhangzhi/pytorch-distributed

A quickstart and benchmark for PyTorch distributed training, useful for ML/AI developers.

1.7K

Archived

Python

ML Ops

API Frameworks

PyTorch

#distributed-training#pytorch#benchmarking

open-mmlab/mmrazor

An open-source toolbox and benchmark for model compression and acceleration in PyTorch.

1.7K

Archived

Python

ML Ops

API Frameworks

PyTorch

#model-compression#model-acceleration#benchmark

harbor-framework/terminal-bench

A benchmark for evaluating the performance of large language models (LLMs) on complex terminal-based tasks.

1.7K

Active

Python

LLM Frameworks

CLI Tools

Python

#benchmark#llm#terminal

alecthomas/go_serialization_benchmarks

Benchmarks for evaluating Go serialization methods for performance and efficiency.

1.6K

Experimental

Benchmarking

API Frameworks

#benchmarking#performance#serialization

yoshitomo-matsubara/torchdistill

A PyTorch-based framework for reproducible deep learning studies with 26 knowledge distillation methods.

1.6K

Stable

Python

ML Ops

Computer Vision

PyTorch

#deep-learning#computer-vision#natural-language-processing

MLGroupJLU/LLM-eval-survey

A survey paper on evaluating large language models (LLMs) for developers building AI-powered applications.

1.6K

Experimental

LLM Frameworks

Tutorials & Courses

#benchmark#evaluation#large-language-models

privatenumber/minification-benchmarks

A comprehensive benchmark for JavaScript minification tools, comparing performance and size metrics.

1.6K

Stable

TypeScript

Build Tools

Frontend Frameworks

React

#benchmarks#minification#performance

JonMagon/KDiskMark

A simple open-source disk benchmark tool for Linux distros, focused on performance testing.

1.6K

Stable

C++

CLI Tools

API Frameworks

#benchmarking#linux#disk-performance

ByteDance-Seed/Seed1.5-VL

A powerful vision-language foundation model designed to advance multimodal AI understanding and reasoning.

1.6K

Experimental

Jupyter Notebook

LLM Frameworks

Computer Vision

Jupyter Notebook

#multimodal-ai#vision-language-model#large-language-model

opendatalab/OmniDocBench

A comprehensive benchmark for document parsing and evaluation, designed for CVPR 2025.

1.5K

Stable

Python

Computer Vision

Datasets

#computer-vision#document-parsing#benchmark

ar51an/iperf3-win-builds

A collection of prebuilt iperf3 binaries for Windows, enabling developers to easily benchmark their network limits.

1.5K

Stable

CLI Tools

API Clients & Testing

#network-benchmarking#iperf3#windows-binaries

Lifelong-Robot-Learning/LIBERO

A benchmark for evaluating knowledge transfer in lifelong robot learning using AI tools.

1.5K

Experimental

Jupyter Notebook

Agents & Orchestration

API Frameworks

Jupyter Notebook

#benchmark#imitation-learning#lifelong-learning

1...68...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.