Showing 201-220 of 230 projects
A.S.E (AICGSecEval) is a repository-level AI-generated code security evaluation benchmark developed by Tencent Wukong Code Security Team.
Elastic Malware Benchmark for Empowering Researchers, a Jupyter Notebook project for malware analysis.
An open-source research platform for developing AI-powered enterprise applications using LLMs and multi-agent systems.
This GitHub repository provides performance benchmarks for running Xcode on various Mac hardware.
A TensorFlow-based rotation detection benchmark for computer vision and AI models.
Tau-Bench is a Python library for benchmarking and evaluating AI language models and tools.
A benchmarking tool for measuring performance of deep learning operations on different hardware.
LongBench is a benchmark for evaluating large language models on long-context tasks.
A Python benchmark suite for evaluating text-to-3D generation models and techniques.
A library for validating and benchmarking large language models (LLMs) for developers working with AI tools.
TableBank is a benchmark dataset for table detection and recognition, useful for building computer vision models.
An extensive benchmark for scientific machine learning, focused on physics-informed neural networks and partial differential equations.
A benchmarking infrastructure for .NET applications to measure performance and resource utilization.
A challenging, contamination-free benchmark for large language models (LLMs) to evaluate their performance.
OmniSafe is an infrastructural framework for accelerating safe reinforcement learning research.
A high-performance C++ library for generating prime numbers, optimized for modern CPU architectures.
A comprehensive benchmark for spatio-temporal predictive learning, with a focus on AI-powered weather forecasting and video prediction.
An open-source security guide covering security standards, frameworks, threat models, encryption, and benchmarks.
Open-source optical flow toolbox and benchmark for computer vision tasks powered by PyTorch.
A benchmark to evaluate language models on various tasks, useful for vibe coders building AI-powered apps.
Get weekly updates on trending AI coding tools and projects.