Category
Showing 1701-1750 of 6,802 trending projects
A roadmap to become a Visual SLAM (Simultaneous Localization and Mapping) developer in 2023.
An intelligent trading bot that generates signals and trades using machine learning and feature engineering.
Bailing is an open-source AI voice assistant built with ASR, LLM, and TTS, supporting low-latency response on low-end devices.
A TypeScript framework for building AI-powered applications and workflows using a modular, agent-based architecture.
Official repository for a paper on a large vision-language model for medical applications
Implementation of the state-of-the-art YOLOv13 object detection model with hypergraph-enhanced visual perception.
A collection of papers and code for CVPR conferences focused on low-level computer vision tasks.
A C++ library for real-time behavior synthesis using the MuJoCo physics simulation engine and model predictive control.
Measuring Massive Multitask Language Understanding with GPT-3 and few-shot learning techniques.
A Unity plugin for MFCC-based lip sync using the Job System and Burst Compiler.
A Python framework that makes it easy to build AI applications using large language models like GPT and LLMs.
An offline AI knowledge base and agent built with .NET, Blazor, and Semantic Kernel, supporting local AI models.
A collection of resources on using large language models for recommender systems
A TypeScript SDK for the LM Studio platform, an AI-powered developer tools ecosystem.
A C# library for creating AI-powered characters and chatbots in the Unity game engine.
A Python library for building AI-powered applications using LangChain, a framework for building applications with large language models.
A collection of content from a Chinese internet personality, including livestream transcripts and text analysis.
MultiAgentPPT is an AI-powered presentation generation system that leverages multi-agent collaboration and streaming concurrency.
A community-driven platform that transforms study materials into interactive resources like quizzes, flashcards, notes, and podcasts.
Adapts Meta AI's Segment Anything model to downstream tasks using adapters and prompts.
A knowledge graph-based question answering system for medical diagnosis using Python.
Wrapper around tool using LLMs for agentic workflows for vibe coders (AI-focused developers)
A curated list of awesome resources for building recommender systems, including papers, libraries, and more.
A C++ powered OCR tool that outputs JSON for easy integration with other programs.
A curated list of representative LLMs text datasets for AI/ML developers to explore and leverage.
A collection of must-read papers on Physics-Informed Neural Networks (PINNs), a powerful approach for solving differential equations.
An open-source project for Windows developers to build AI-powered apps with local models and APIs.
An AI-powered framework to evaluate the safety and alignment of large language models.
A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.
A lightweight, open-source speech synthesis library written in C for embedded devices like the Commodore 64.
A WebUI tool to create song covers using RVC v2 AI voices from audio files or YouTube videos.
An open-source package that allows game creators, AI researchers and hobbyists to build complex behaviors for non-player characters or agents.
An AI-powered assistant that generates PowerPoint presentations from natural language prompts.
A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.
Distributed compiler based on Triton for parallel systems, focused on AI and high-performance computing.
A high-performance, AI-powered Riichi Mahjong engine written in Rust for developers interested in game AI.
Generate music based on natural language prompts using local LLM models.
An efficient one-step diffusion framework for streaming video super-resolution with AI-powered tools.
Next-gen AI+IoT framework for fast IoT and AI agent hardware integration
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion for AI art and prompt engineering.
Official YOLOv8 model training and deployment library for computer vision tasks.
A framework to red team LLMs and LLM systems, focused on improving the safety and reliability of large language models.
A platform for accelerating embodied AI research, with a focus on robotics and simulation.
Comprehensive repository showcasing the latest advances in System-2 reasoning and LLM-based AI models.
A curated collection of real-world ML & LLM system design case studies from 100+ companies, helping developers learn how top tech firms implement GenAI.
A C# library that provides the easiest way to use the Ollama AI language model in .NET applications.
A tool for segmenting 3D objects in scenes using AI-powered computer vision techniques.
An AI-powered video agent framework for next-generation video interactions and workflows.
Truncated diffusion model for real-time end-to-end autonomous driving, using AI tools.
Access OpenAI models programmatically through your ChatGPT subscription.
Get weekly updates on trending AI coding tools and projects.