Category
Showing 401-450 of 6,802 trending projects
A PyTorch-based reference implementation and models for the DINOv3 self-supervised vision transformer.
KrillinAI is a video translation and dubbing tool powered by LLMs, offering 100 language translations and one-click deployment.
The ultimate toolkit for finetuning diffusion models, a key component of AI-powered coding tools.
A comprehensive collection of resources for large language models, including AI coding tools, MCP frameworks, and more.
A repository for a course on mastering large language model (LLM) engineering for AI-powered coding tools.
This repository provides a comprehensive course on Agentic AI Engineering, covering various AI coding tools and frameworks.
A Python framework for generating AI-powered PowerPoint presentations using large language models.
Multi-agent framework where personal AI agents collaborate to solve tasks together
A Python proxy for accessing the OpenAI API, allowing developers to use the Claude language model without direct access to the OpenAI API.
A Python framework for high-performance auto-regressive diffusion model-based image and video generation.
A natural language interface for computers
On-device multimodal LLM for vision, speech, and live streaming on phones
High-performance mobile-optimized neural network inference framework for deploying AI models on mobile devices
Simulates human-like AI agents in a game environment
Fully automated AI video subtitle team with one-click subtitle cutting, translation, alignment, and dubbing.
An AI-powered business intelligence tool that generates SQL, charts, and insights from natural language queries.
FinRL is a financial reinforcement learning library that helps developers build AI-driven trading agents.
An open-source project that uses deep learning and OCR to translate text in manga/images
A comprehensive CLI tool for using large language models (LLMs) like OpenAI, Claude, and more for various AI-powered tasks.
A Python-based handbook for moving beyond prompt engineering to the wider discipline of context design and optimization.
A Python library for fast, high-quality monocular view synthesis, useful for computer vision and 3D applications.
A repository containing summaries and resources for the book 'Designing Machine Learning Systems' by Chip Huyen.
An open-source robotics simulation platform for developing and testing AI-driven robots in virtual environments.
verl-agent is a Python framework for training LLM/VLM agents using reinforcement learning.
A Python library that uses Transformers to remove harmful/harmless refusals from text.
A reinforcement learning-based approach to scale training of variational autoencoders (VLAs) used in AI tools.
Voice conversion framework with web UI for training and real-time voice models
CLIP is a neural network for zero-shot image-text matching and understanding
DeepSeek-OCR for visual-text compression and OCR tasks
A flexible framework for optimizing heterogeneous LLM inference and fine-tuning workflows.
Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.
An open-source, scalable, and high-performance RL framework for building AI-powered applications and tools.
A Python library for solving and discovering nonlinear partial differential equations using physics-informed neural networks.
Open-source implementation of AlphaEvolve, a coding agent for iterative code optimization and discovery.
Open-source self-hosted sandboxes for AI agents, enabling secure and scalable AI development.
A powerful local research tool that leverages AI to search and retrieve information from multiple sources.
A tutorial and code repository for using the Hugging Face Transformers library for NLP tasks.
A semantic router system for deploying and managing a mixture of AI models at the cloud, data center, and edge.
SimpleMem: An efficient lifelong memory module for large language model agents.
A highly configurable and extensible Rime input scheme for Chinese writing with AI-powered word suggestions.
Biomni is a general-purpose biomedical AI agent that can be used for a variety of tasks in the healthcare and life sciences domains.
A curated list of state-of-the-art research in embodied AI, focusing on VLA, VLN, and related multimodal learning approaches.
A web app for interacting with any LangGraph agent (PY & TS) via a chat interface.
Autonomously trade on Polymarket using AI Agents built with Python
A powerful AI-driven tool to query, understand, and edit complex codebases across multiple languages in a monorepo.
LeetCode for PyTorch, a Jupyter Notebook-based coding practice platform for AI/ML developers.
A lightweight suite of motion imitation methods for training controllers.
Demos for the Claude AI agent SDK, enabling developers to build AI-powered applications and tools.
Segment Anything Model for image/video segmentation with AI prompts
🐸TTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.
Get weekly updates on trending AI coding tools and projects.