Explore Projects

Discover 222 open source projects

Active filters (1):
Search: evaluationร—
Clear all

Showing 161-180 of 222 projects

silentmatt/expr-eval

A JavaScript library for parsing and evaluating mathematical expressions.

1.3K
Archived
JavaScript
General Utilities
Frontend Frameworks
JavaScript
#math#expressions#parsing

google/adk-java

An open-source Java toolkit for building, evaluating, and deploying sophisticated AI agents.

1.3K
Active
Java
Agents & Orchestration
API Frameworks
#ai#agents#multi-agent-systems

hendrycks/math

A dataset of mathematical reasoning problems for evaluating AI systems.

1.3K
Stable
Python
Logic, Math & Reasoning
Python
#math#logic#reasoning

abo-abo/lispy

A Lisp editing library for Emacs focused on providing a streamlined and efficient coding experience.

1.3K
Active
Emacs Lisp
IDE Extensions
Backend Frameworks
Emacs
#lisp#emacs#coding-efficiency

Topdu/OpenOCR

An open-source toolkit for general OCR research and applications, with integrated training, evaluation, and production-ready OCR systems.

1.3K
Active
Python
Computer Vision
Backend Frameworks
PyTorch
#ocr#document-processing#computer-vision

tensorflow/model-analysis

This is a set of model analysis tools for TensorFlow, enabling developers to evaluate and optimize their machine learning models.

1.3K
Stable
Python
ML Ops
#tensorflow#model-evaluation#model-analysis

aws-solutions-library-samples/guidance-for-training-an-aws-deepracer-model-using-amazon-sagemaker

Guidance for training an AWS DeepRacer model using Amazon SageMaker, providing developers full control over the process.

1.3K
Archived
Jupyter Notebook
ML Ops
API Frameworks
Jupyter Notebook
#aws-deepracer#amazon-sagemaker#model-training

refreshdotdev/web-eval-agent

An autonomous web application evaluation agent powered by MCP and Playwright for vibe coders.

1.2K
Active
Python
MCP Servers
AI Code Editors
React
#debugging#qa#vibe-coding

cyberark/FuzzyAI

A powerful tool for automated LLM fuzzing to help developers and security researchers identify and mitigate potential jailbreaks.

1.2K
Stable
Jupyter Notebook
LLM Frameworks
Security Research
Jupyter Notebook
#ai#fuzzing#jailbreak

yueliu1999/Awesome-Jailbreak-on-LLMs

A collection of novel jailbreak methods for large language models (LLMs) focused on privacy and safety.

1.2K
Active
LLM Frameworks
Privacy Tools
#llms#privacy#safety

EthicalML/xai

An explainability toolbox for developers building machine learning models with interpretability and fairness in mind.

1.2K
Stable
Python
Explainability
Evaluation
Python
#ai-explainability#machine-learning-interpretability#bias-evaluation

aesara-devs/aesara

Aesara is a Python library for defining, optimizing, and efficiently evaluating mathematical expressions involving multi-dimensional arrays.

1.2K
Archived
Python
AI SDKs & Wrappers
API Frameworks
#aesara#automatic-differentiation#optimizing-compiler

open-edge-platform/training_extensions

Train, evaluate, optimize, and deploy computer vision models with OpenVINO, a toolkit for accelerating deep learning on edge devices.

1.2K
Active
Python
Computer Vision
API Frameworks
PyTorch
#computer-vision#openvino#deep-learning

Barca0412/Introduction-to-Quantitative-Finance

A collection of resources for quantitative finance, including factor-based stock quantitative framework and AI-finance related materials.

1.2K
Active
Python
LLM Frameworks
Databases
Python
#quantitative-finance#finance#investment

KMnP/vpt

A Python library for fine-tuning and evaluating large language models with visual prompts.

1.2K
Archived
Python
Fine-tuning
Inference
Python
#visual-prompts#language-models#fine-tuning

safety-research/bloom

bloom is a Python library for evaluating any behavior immediately, focused on AI safety research.

1.2K
Active
Python
LLM Frameworks
CLI Tools
Python
#safety-research#ai-safety#llm-evaluation

STMicroelectronics/STM32CubeF4

This is an MCU framework for the STM32F4 series, providing a complete set of drivers, middleware, and sample projects.

1.2K
Active
C
API Frameworks
CLI Tools
#stm32#mcu#embedded

JonathonLuiten/TrackEval

A Python library for evaluating multi-object tracking algorithms using HOTA and other metrics.

1.2K
Archived
Python
Computer Vision
#tracking#evaluation#metrics

uzh-rpg/rpg_trajectory_evaluation

A toolbox for quantitative trajectory evaluation of visual odometry and visual-inertial odometry algorithms.

1.2K
Archived
Python
API Frameworks
Databases
#trajectory-evaluation#visual-odometry#visual-inertial-odometry

google/fuzzbench

FuzzBench is a benchmarking framework for evaluating fuzzer performance and security.

1.2K
Active
Python
Testing
Security Research
Python
#fuzzing#benchmark#security
1...810...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.