Category
Showing 551-600 of 6,802 trending projects
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
All-in-one AI framework for semantic search, LLM orchestration and language model workflows
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
Open-source trading OS with pluggable AI brain for market data analysis, AI reasoning, and trade execution
OptiScaler bridges upscaling/frame gen across GPUs, supporting DLSS2+, XeSS, FSR2+ and more.
docTR is a high-performing and accessible library for OCR-related tasks powered by deep learning.
A Python library for serving large language models (LLMs) with high performance, including GPU acceleration and distributed inference.
A Python library for converting PDF files into various formats, focused on processing scanned book PDFs.
Optimize AI inference performance on GPUs with this Python library for selecting and tuning inference engines.
An open-source, modern-design AI training tracking and visualization tool with support for various ML frameworks.
A library that provides insights into the internals of Apple's Neural Engine for iOS developers.
An integrated solution for building and evaluating knowledge graphs using AI tools like GraphRAG and LightRAG.
An all-in-one AI companion with features like AI desktop girlfriend, virtual streamer, chatbot, browser, and smart home.
A Kotlin library that runs Stable Diffusion on Android devices with Snapdragon NPU acceleration.
A C++ fork of the llama.cpp project with additional SOTA quants and improved performance for AI-focused developers.
OpenAgents is a Python library for building AI agent networks for open collaboration and community-driven AI development.
Efficient visual geometry learning for AI-powered applications with permutation-equivariant representations.
Swift API for MLX, a platform focused on enabling developers to build with AI tools.
A Python-based desktop assistant for the Limbus Company game, aimed at 'vibe coders' who build with AI tools.
Build production-ready LLM applications and advanced agents using Python, LangChain, and LangGraph.
Framework for building AI skills using expert methodologies & proven patterns
Minimal and annotated implementations of key ideas from modern deep learning research.
Token compression library for AI agents—5 techniques to cut LLM costs by ~50%.
DeepSeek-R1 is a reasoning model series with open-source versions and distillations for enhanced performance in math, code, and reasoning tasks.
Instant voice cloning model with tone color cloning and multi-lingual support
Industrial-strength NLP library for Python with pretrained models and fast processing
Curated list of deep learning resources including books, courses, tutorials, and tools for developers.
Stability AI's advanced 4D video generation model for high-fidelity novel-view synthesis
Kortix is a platform for building and managing autonomous AI agents with capabilities in browser automation, file management, and system operations.
A customizable, multi-modal AI chatbot that can be integrated with various chat platforms and leverages LLMs like ChatGPT, Bard, and GPT-3.
State-of-the-art text embedding library for building advanced natural language processing applications.
OpenVINO is an open-source toolkit for optimizing and deploying AI inference on a variety of hardware.
A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.
This is a Python script that provides an automated assistant for the video game Honkai: Star Rail.
LangChain for Go, the easiest way to write LLM-based programs in Go
nnUNet is a powerful medical image segmentation library that uses state-of-the-art deep learning techniques.
A multilingual document layout parsing model that can extract text, images, and structure from documents in a single vision-language model.
A highly capable foundation model for monocular depth estimation, a key component in computer vision.
A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.
This repository provides a curated collection of resources for Prompt Engineering with a focus on large language models like ChatGPT and GPT-3.
A small, fast GPT model (124M) for quick prototyping and experimentation with AI-powered coding tools.
HelixDB is an open-source graph-vector database built from scratch in Rust for AI coding tools and workflows.
This repository provides the Chinese Simplified documentation for the TVM (Tensor Virtual Machine) framework, a popular open-source deep learning compiler stack.
HolmesGPT is an AI agent that helps SREs and DevOps teams solve incidents faster with automatic correlations, investigations, and more.
Implementation of a real-time audio-driven avatar generation system for vibe coders.
A comprehensive benchmark for document parsing and evaluation, designed for CVPR 2025.
A project-based course on developing AI agents using LangChain v1+ and LangGraph for search, RAG, reflection, and code interpreters.
An open framework for blind navigation based on ESP32 and Python, focused on AI-powered accessibility solutions.
Get weekly updates on trending AI coding tools and projects.