Showing 121-140 of 222 projects
A rigorous benchmark for evaluating the code quality and efficiency of large language models like GPT-4.
This is a large language model (LLM) focused on mental health, with pre/post-training, datasets, evaluation, and deployment tools.
A Neovim plugin that allows running code snippets independently, supporting multiple languages.
A benchmark for evaluating the performance of large language models (LLMs) on complex terminal-based tasks.
A Python library that provides a collection of commonly used machine learning evaluation metrics.
A massively multiagent game environment for training and evaluating intelligent agents.
A repository for ImageReward, a learning and evaluating human preferences for text-to-image generation
Build, enrich, and transform datasets using AI models with no code
Benchmarks for evaluating Go serialization methods for performance and efficiency.
General Assembly's 2015 Data Science course covering topics like machine learning, data analysis, and data visualization.
A Python library that provides building blocks for rapid development of generative AI applications.
A CSS file that helps developers detect accessibility issues in HTML code.
A survey paper on evaluating large language models (LLMs) for developers building AI-powered applications.
A free, open-source HRM system for vibe coders with features like recruitment management and performance evaluation.
A popular library for training, using, and evaluating word embeddings, a fundamental building block for natural language processing.
A comprehensive benchmark for document parsing and evaluation, designed for CVPR 2025.
A Python tool for quickly evaluating IAM permissions in AWS.
A benchmark for evaluating knowledge transfer in lifelong robot learning using AI tools.
This Python repository provides an evaluation framework for text-to-speech models, focusing on enabling vibe coder development with AI tools.
Comprehensive collection of resources and tools for building awesome search experiences.
Get weekly updates on trending AI coding tools and projects.