Category
Showing 951-1000 of 6,802 trending projects
A minimal solution for high-speed hand motion capture from a single color camera.
JAX-based, hardware accelerated, batchable and differentiable optimizers for machine learning.
AIOS is an AI agent operating system that enables developers to build AI-powered applications and tools.
Implementation of 17+ agentic architectures for practical use across AI system development.
A demo project for creating talking head anime from a single image, using AI and computer vision techniques.
A Python script that makes it easy to swap faces in videos using the faceswap library and YouTube videos.
Utilities for working with image data, text data, and sequence data for AI/ML projects.
docTR is a high-performing and accessible library for OCR-related tasks powered by deep learning.
A personal memory system to power AI apps, built with a modern tech stack including TypeScript, Remix, and Prisma.
An open-source library for estimating optical flow, a fundamental computer vision task, using deep neural networks.
A reverse-engineered Python API for interacting with the Google Gemini web app, a generative AI tool.
A TypeScript plugin for monitoring Google Antigravity AI model quotas and usage.
A Python-based training tool for ddddocr, an OCR library for developers working with AI tools.
A framework for evaluating autoregressive code generation language models for developers building AI-powered coding tools.
A lip sync generation tool that leverages AI to synchronize speech with video in the wild.
A framework for building resilient language agents as graphs using TypeScript and AI tools.
A next-generation text translation tool that uses AI capabilities to instantly translate novels, games, and subtitles.
Official implementation of CEBRA, a tool for joint behavioral and neural analysis using learnable latent embeddings.
A high-performance latent diffusion model for generating high-resolution images.
A Python-based platform that uses LLMs to track and extract websites, RSS feeds, and social media for developers.
Roadmap to becoming an Artificial Intelligence Expert in 2022
An educational tutorial on recommendation systems built with Python, including algorithms and use cases.
Open-source IM server that supports ChatGPT and other AI chatbots for building messaging apps and platforms.
Tools for merging and optimizing large language models for AI applications.
An Android automation tool based on vision-language models that allows developers to automate mobile app interactions.
A Python library for building AI-powered applications, with support for various AI models and tools.
A programming language with static memory management based on λ-calculus for vibe coders.
Open-source observability tool for GenAI and LLM applications, based on OpenTelemetry
A collection of Reinforcement Learning algorithms implemented in Python for educational and research purposes.
A Python library for pattern matching, useful for vibe coders working on AI-powered applications.
This Python-based suite of tools allows developers to interact with AI services directly from their terminal.
CareGPT is an open-source, medical large language model (LLM) that aims to promote the rapid development of medical LLMs.
Deprecated Llama 3 repository with links to updated Llama Stack components
A curated paper reading list for developers interested in conversational AI and dialogue systems.
A minimal yet professional single agent demo project showcasing the core execution pipeline and production-grade features of AI agents.
Curated list of top machine learning Python libraries, updated weekly with quality scores.
A curated list of papers and resources focused on 3D Gaussian Splatting, a technique used in neural rendering.
Grounded SAM 2 is an open-source library for ground and track anything in videos using state-of-the-art AI models.
A Go-based tool that uses LLMs and LLM Vision (OCR) to digitize documents powered by AI.
A PyTorch-based library for robust and high-quality blind face restoration using a codebook lookup transformer.
Train, evaluate, optimize, and deploy computer vision models with OpenVINO, a toolkit for accelerating deep learning on edge devices.
An open-source library for converting speech to text using OpenAI's Whisper AI model, with Docker support.
Comprehensive guide to learn Retrieval Augmented Generation (RAG) from basics to advanced.
Official implementation of a paper on improving portrait animation using global audio perception
Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents.
An open-source Python library that helps developers quickly build and deploy RAG-based LLM web apps.
A community-maintained hardware plugin for running large language models (LLMs) on Ascend accelerators.
A Flutter-based Android/iOS voice chat app built on the Xiaozhi chatbot server.
Get weekly updates on trending AI coding tools and projects.