Category
Showing 1451-1500 of 6,802 trending projects
A 3D-informed video generation model with precise camera control for high-quality, consistent video content.
A curated list of resources on large language model-based text-to-SQL for developer productivity and database access.
NVIDIA DLSS is a deep learning neural network that boosts frame rates and generates sharp images for games.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
A CVPR 2025 video diffusion model that enables fast autoregressive video generation from slow bidirectional models.
A Python library for efficient autonomous driving using vectorized scene representation.
A tutorial to build a RAG (Retrieval Augmented Generation) system from scratch using local LLMs and no black boxes.
AudioX is a Python library for audio processing and machine learning, designed for vibe coders.
A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.
A collection of awesome test-time (domain/batch/instance) adaptation methods for machine learning models.
A Python library for Soft Actor-Critic (SAC), a powerful reinforcement learning algorithm.
An on-device LLM execution library for React Native, compatible with Vercel AI SDK.
An open-source protocol for building an efficient collaboration network for intelligent agents.
An open-source 3D generation model that uses Spatial Sparse Attention for efficient and scalable 3D creation.
A Python library for solving inverse kinematics using the MuJoCo physics simulation engine.
A web app that uses AI to index videos, enable semantic search, and export scenes for developers building with AI tools.
An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.
A systematic framework for interactive world modeling with real-time latency and geometric consistency, built with Python.
This repository contains code for a research paper on a new diffusion-based text generation model.
A Python library for deep learning to decode EEG, ECG, and MEG signals for neuroscience research.
A C++ library for simulation and deployment of robot reinforcement learning algorithms for quadruped, wheeled, and humanoid robots.
Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere.
A Python library for building AI-powered visual agents and tools for developers focused on AI coding workflows.
An open-source framework for building, evaluating, and training general multi-agent assistance systems using AI tools.
An open-source research platform for developing AI-powered enterprise applications using LLMs and multi-agent systems.
Pruna is a model optimization framework that helps developers deliver faster, more efficient AI models with minimal overhead.
A SQL-driven RAG engine that automatically builds a knowledge graph during querying, enabling knowledge-enhanced applications.
ChatdollKit enables developers to create virtual AI companions by integrating 3D models and chatbot functionality.
Official PyTorch implementation of a scalable transformer-based generative model for image generation and manipulation.
Resources for phase recovery, a computational imaging technique using deep learning and interferometry.
A high-performance and lightweight PyTorch-based license plate recognition framework.
HYPIR is a Python library for image restoration and super-resolution using diffusion-based priors.
An extensive benchmark for scientific machine learning, focused on physics-informed neural networks and partial differential equations.
Bayesian marketing toolbox in PyMC with models for media mix, customer lifetime value, and buy-till-you-die.
A collection of notes and projects on machine learning, deep learning, computer vision, NLP, and web scraping.
Utilities for decoding deep representations (like sentence embeddings) back to text.
Adds support for very large language models (vLLMs) to IndexTTS, enabling faster AI-powered text-to-speech inference.
A curated list of papers on trajectory and motion prediction, a key topic in computer vision and robotics.
A project that bridges the gap between large language models (LLMs) and recommender systems.
A repository containing diagrams for visualizing neural network architectures, useful for AI/ML developers.
Sequence-to-sequence toolkit for NLP tasks like translation and summarization
Roadmap to becoming an Artificial Intelligence Expert in 2022
Study plan for software engineers to become machine learning engineers
Bayesian methods and probabilistic programming in Python with PyMC
AudioCraft is a PyTorch library for audio generation with deep learning models like MusicGen and AudioGen.
A comprehensive tutorial on Kalman filters, extended Kalman filters, and more using Jupyter Notebooks.
Get weekly updates on trending AI coding tools and projects.