Showing 201-220 of 222 projects
A Kickstarter project that provides an Autodesk 3D printer evaluation for developers.
A dataset of real user questions and answers for training and evaluating question answering systems.
EvalEx is a Java library for evaluating simple mathematical and boolean expressions.
Evaluation tool for building and testing LLM-powered QA chains in Python.
Open-source computer use agents that can operate on cross-platform environments for AI-focused developers.
A collaborative spreadsheet-like platform for building and experimenting with AI applications.
A challenging, contamination-free benchmark for large language models (LLMs) to evaluate their performance.
An index of algorithms for offline reinforcement learning (offline-rl) targeting AI and ML researchers.
A simple, customizable, and fast static site generator in the Julia programming language.
A fast and lightweight .NET expression evaluator library for math and logical operations.
A Python library to evaluate the response of large language models like GPT-4 using Prometheus metrics.
A benchmark to evaluate language models on various tasks, useful for vibe coders building AI-powered apps.
This Python repository provides code for training, evaluating, and predicting deep learning models.
A powerful 2D function plotting library for developers to visualize mathematical expressions and data.
A Python CLI tool for multi-cloud and multi-SaaS asset management, security posture monitoring, and attack surface reduction.
RAFT-Stereo is a PyTorch library for training and evaluating stereo matching models.
A framework for evaluating autoregressive code generation language models for developers building AI-powered coding tools.
A powerful Swift framework for evaluating natural language math expressions
An open-source post-building layer for AI agents, providing environment data and evaluations to power agent post-training and monitoring.
An open-source benchmark suite for evaluating PyTorch performance across various use cases.
Get weekly updates on trending AI coding tools and projects.