Category
Showing 3501-3550 of 6,802 trending projects
A large language model framework for reasoning in a continuous latent space.
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation using PyTorch.
An open-source stock trading algorithm backtesting platform for training AI models on financial data.
A comprehensive Python library for evaluating object detection models using various metrics like mAP, AR, and STT-AP.
A C# framework for computer vision, artificial intelligence, and other research-oriented tasks.
Official code and checkpoint release for mobile robot foundation models like GNM, ViNT, and NoMaD.
A collection of cool computer vision, learning, and graphics papers focused on cats.
A neural network-based OCR library for JavaScript, useful for building document scanning and text extraction features.
A C++ inference library for various SVC/TTS models, including DiffSinger, DiffSVC, HiFiGAN, and VITS.
A fast, scalable library for Factorization Machines, a powerful machine learning model for recommendation systems.
A collection of Jupyter Notebooks for a Machine Learning course, focused on Python and AI/ML concepts.
Algorithm to texture 3D reconstructions from multi-view stereo images, useful for computer vision and 3D graphics projects.
Aim is an open-source experiment tracker that makes it easy to track and visualize machine learning experiments.
OpenChat is a JavaScript-based library for building custom chatbots and conversational AI assistants.
A repository with summaries and notes on deep learning research papers for developers interested in AI.
A task-aware, agent-driven prompt optimization framework for prompt engineering and fine-tuning language models.
Zero-shot voice conversion and singing voice conversion library with real-time support for vibe coders.
A plugin for the Neural Amp Modeler, a tool for AI-powered audio processing and generation.
A curated collection of resources about graph-related large language models (LLMs) for developers.
A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.
Open-source library for detecting 20,000+ classes using image-level supervision, useful for computer vision applications.
A Rust library for interacting with the OpenAI API, enabling vibe coders to leverage AI capabilities in their projects.
An official repository for a paper on using executable code actions to elicit better LLM agents
Implementation of the state-of-the-art YOLOv13 object detection model with hypergraph-enhanced visual perception.
AgentTuning: A Python library for enabling generalized agent abilities in large language models (LLMs).
Chat with your PDFs using AI, allowing you to easily extract information from documents.
A library for using ONNX, an open format for machine learning models, with the TensorFlow deep learning framework.
Instant-NGP in PyTorch+CUDA with PyTorch Lightning for high-quality, high-speed 3D reconstruction and novel view synthesis.
Build production-ready LLM applications and advanced agents using Python, LangChain, and LangGraph.
A critical perspective on understanding R1-Zero-Like Training, a technique for large language models.
A multilingual word vector library for natural language processing and machine translation tasks.
Hephaestus is a semi-structured, agentic framework that allows workflows to build themselves as agents discover what needs to be done.
A Python library for fast reconstruction of neural radiance fields from direct voxel grid optimization.
Neva is a dataflow programming language and compiler that enables parallel computing with static typing.
An efficient multi-scale 3D convolutional neural network for medical image segmentation tasks.
A real-time AI image generator library powered by TypeScript and modern web technologies.
Kimi-Audio is an open-source audio foundation model for understanding, generating, and conversing with audio.
An end-to-end multimodal AI model that can understand and generate text, audio, vision, and video in real-time.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
A multi-task multi-sensor fusion library for bird's-eye view perception in 3D computer vision applications.
An automated translation solution for visual novels (Galgames) that supports major language models like GPT-4 and Claude.
PiLiDAR is a Python library for processing and analyzing data from LiDAR (Light Detection and Ranging) sensors.
Underthesea is a powerful Vietnamese NLP toolkit for developers working with natural language processing tasks.
A Python framework for GPR underground hazard detection using reservoir computing and SAM techniques.
AI-powered news aggregator that summarizes Twitter/RSS feeds with structured summaries and web dashboard
A differentiable rendering library that enables Monte Carlo ray tracing without approximation for computer graphics and vision applications.
OctoTools is an agentic framework with extensible tools for complex reasoning, targeting vibe coders.
A TypeScript library that integrates ChatGPT into the Obsidian note-taking app, enabling AI-powered note-taking and writing.
A C# library that provides the easiest way to use the Ollama AI language model in .NET applications.
An extension for the AUTOMATIC1111 Stable Diffusion web UI that enables creating videos using img2img and ebsynth.
Get weekly updates on trending AI coding tools and projects.