Category
Showing 301-350 of 6,799 trending projects
An automated document analyzer for Paperless-ngx using various AI services to tag your documents.
This repository contains the releases for the Eden emulator, a tool for developers working with AI tools and technologies.
A media player for language learning with AI-powered features like dual subtitles, real-time translation, and more.
Official code repo for the O'Reilly book on LLMs with Jupyter Notebooks
A set of Jupyter notebooks for learning the fundamentals of Machine Learning and Deep Learning in Python.
Effortless data labeling with AI support from Segment Anything and other powerful models.
AIInfra is a platform for building and managing AI infrastructure, including hardware, software, and frameworks.
Open-source AI-powered presentation generator and API for creating stunning presentations without PowerPoint.
A library for generating 3D models using compact, structured latent representations.
A comprehensive interview prep resource for AIGC, LLM, and AI-related engineering roles.
An AI-powered document image analysis package designed specifically for the Japanese language.
RAG & Agent app for LLMs with local knowledge base support
Build and deploy AI agent workflows with visual design and Copilot integration.
Open-source voice AI models for speech synthesis and recognition
Open-source infrastructure for AI agents that can control full desktops (macOS, Linux, Windows).
A Python library for building AI-powered applications and integrating with large language models.
An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.
A multimodal evaluation toolkit for assessing AI models across text, image, video, and audio tasks.
DORA is middleware for creating AI-powered robotic applications with low latency, composable, and distributed dataflow capabilities.
DeepSpeed optimizes deep learning training and inference with distributed computing techniques.
OpenChat is a hackable Next.js AI chatbot template for building chatbot apps with AI SDK, supporting multiple model providers and Vercel deployment.
A curated collection of prompts and examples for mastering the Nano Banana Pro AI image model.
AI observability and evaluation tooling for developers building with large language models and AI agents.
An open-source Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Real-time AI assistant for Ray-Ban smart glasses using vision, voice, and agentic actions via Gemini Live.
A robotics simulation platform for training generalist robots on everyday tasks.
Practical image/video restoration with Real-ESRGAN for developers
FaceFusion is an industry-leading face manipulation platform for deepfake and face-swapping tasks.
A general-purpose physics simulator for robotics and dynamic simulation.
An open-source deep research agent optimized for research and prediction with state-of-the-art AI capabilities.
RF-DETR is a SOTA real-time object detection and segmentation model architecture designed for fine-tuning.
This is a stock prediction AI project that uses a GAN with LSTM and CNN to forecast stock price movements.
Spring AI Alibaba DataAgent is a Java-based library for integrating AI-powered features into applications.
Real-time voice cloning using deep learning
A TypeScript library for building context-aware reasoning applications using large language models.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
An AI-powered personalized learning assistant that leverages large language models and multi-agent systems.
LeetCUDA is a comprehensive collection of modern CUDA learning resources, including 200+ CUDA kernels, Tensor Cores, HGEMM, and FA-2 MMA.
A local-first AI coworker with memory, helping developers build with AI tools and agents.
NarratoAI is a Python-based tool that uses AI models to automatically provide commentary and edit videos with a single click.
Data transformation framework for AI with ultra-fast, incremental processing capabilities.
This repository allows developers to train their own medical language models using the ChatGPT training pipeline.
A curated catalogue of awesome agentic AI patterns for developers building with AI tools.
LimiX is a Python library that enables structured-data modeling capability for generalist intelligence.
GLM-5 LLM framework enabling agentic AI development and autonomous coding workflows
State-of-the-art diffusion models for image, audio, and video generation in PyTorch.
FishAudio-S1 is a high-quality open-source TTS model with voice cloning capabilities.
AI-powered image inpainting tool for removing/erasing objects and replacing them with realistic content
Get weekly updates on trending AI coding tools and projects.