Explore Projects

Discover 222 open source projects

Active filters (1):

Search: evaluation×

Clear all

Showing 161-180 of 222 projects

silentmatt/expr-eval

A JavaScript library for parsing and evaluating mathematical expressions.

1.3K

Archived

JavaScript

General Utilities

Frontend Frameworks

JavaScript

#math#expressions#parsing

google/adk-java

An open-source Java toolkit for building, evaluating, and deploying sophisticated AI agents.

1.3K

Active

Java

Agents & Orchestration

API Frameworks

#ai#agents#multi-agent-systems

hendrycks/math

A dataset of mathematical reasoning problems for evaluating AI systems.

1.3K

Stable

Python

Logic, Math & Reasoning

Python

#math#logic#reasoning

abo-abo/lispy

A Lisp editing library for Emacs focused on providing a streamlined and efficient coding experience.

1.3K

Active

Emacs Lisp

IDE Extensions

Backend Frameworks

Emacs

#lisp#emacs#coding-efficiency

Topdu/OpenOCR

An open-source toolkit for general OCR research and applications, with integrated training, evaluation, and production-ready OCR systems.

1.3K

Active

Python

Computer Vision

Backend Frameworks

PyTorch

#ocr#document-processing#computer-vision

tensorflow/model-analysis

This is a set of model analysis tools for TensorFlow, enabling developers to evaluate and optimize their machine learning models.

1.3K

Stable

Python

ML Ops

#tensorflow#model-evaluation#model-analysis

aws-solutions-library-samples/guidance-for-training-an-aws-deepracer-model-using-amazon-sagemaker

Guidance for training an AWS DeepRacer model using Amazon SageMaker, providing developers full control over the process.

1.3K

Archived

Jupyter Notebook

ML Ops

API Frameworks

Jupyter Notebook

#aws-deepracer#amazon-sagemaker#model-training

refreshdotdev/web-eval-agent

An autonomous web application evaluation agent powered by MCP and Playwright for vibe coders.

1.2K

Active

Python

MCP Servers

AI Code Editors

React

#debugging#qa#vibe-coding

cyberark/FuzzyAI

A powerful tool for automated LLM fuzzing to help developers and security researchers identify and mitigate potential jailbreaks.

1.2K

Stable

Jupyter Notebook

LLM Frameworks

Security Research

Jupyter Notebook

#ai#fuzzing#jailbreak

yueliu1999/Awesome-Jailbreak-on-LLMs

A collection of novel jailbreak methods for large language models (LLMs) focused on privacy and safety.

1.2K

Active

LLM Frameworks

Privacy Tools

#llms#privacy#safety

EthicalML/xai

An explainability toolbox for developers building machine learning models with interpretability and fairness in mind.

1.2K

Stable

Python

Explainability

Evaluation

Python

#ai-explainability#machine-learning-interpretability#bias-evaluation

aesara-devs/aesara

Aesara is a Python library for defining, optimizing, and efficiently evaluating mathematical expressions involving multi-dimensional arrays.

1.2K

Archived

Python

AI SDKs & Wrappers

API Frameworks

#aesara#automatic-differentiation#optimizing-compiler

open-edge-platform/training_extensions

Train, evaluate, optimize, and deploy computer vision models with OpenVINO, a toolkit for accelerating deep learning on edge devices.

1.2K

Active

Python

Computer Vision

API Frameworks

PyTorch

#computer-vision#openvino#deep-learning

Barca0412/Introduction-to-Quantitative-Finance

A collection of resources for quantitative finance, including factor-based stock quantitative framework and AI-finance related materials.

1.2K

Active

Python

LLM Frameworks

Databases

Python

#quantitative-finance#finance#investment

KMnP/vpt

A Python library for fine-tuning and evaluating large language models with visual prompts.

1.2K

Archived

Python

Fine-tuning

Inference

Python

#visual-prompts#language-models#fine-tuning

safety-research/bloom

bloom is a Python library for evaluating any behavior immediately, focused on AI safety research.

1.2K

Active

Python

LLM Frameworks

CLI Tools

Python

#safety-research#ai-safety#llm-evaluation

STMicroelectronics/STM32CubeF4

This is an MCU framework for the STM32F4 series, providing a complete set of drivers, middleware, and sample projects.

1.2K

Active

API Frameworks

CLI Tools

#stm32#mcu#embedded

JonathonLuiten/TrackEval

A Python library for evaluating multi-object tracking algorithms using HOTA and other metrics.

1.2K

Archived

Python

Computer Vision

#tracking#evaluation#metrics

uzh-rpg/rpg_trajectory_evaluation

A toolbox for quantitative trajectory evaluation of visual odometry and visual-inertial odometry algorithms.

1.2K

Archived

Python

API Frameworks

Databases

#trajectory-evaluation#visual-odometry#visual-inertial-odometry

google/fuzzbench

FuzzBench is a benchmarking framework for evaluating fuzzer performance and security.

1.2K

Active

Python

Testing

Security Research

Python

#fuzzing#benchmark#security

1...810...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.