Category
Showing 801-850 of 6,802 trending projects
An open-source project that uses deep learning and OCR to translate text in manga/images
An offline Android SDK for face recognition, liveness detection, and 1:N & M:N face search
MSF is a modular framework for multi-sensor fusion based on an Extended Kalman Filter, useful for robotics and computer vision applications.
Industrial-strength NLP library for Python with pretrained models and fast processing
Facebook AI Research's end-to-end speech recognition toolkit written in C++.
Configure Caffe, a deep learning framework, for Windows users in a quick and easy way.
A robotics simulation platform for training generalist robots on everyday tasks.
3D Point Cloud Annotation Platform for Autonomous Driving
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
A browser automation framework and ecosystem for vibe coders.
Safe Rust wrapper around the CUDA toolkit for GPU acceleration in AI/ML applications.
A general SLAM framework supporting different sensors and methods for 3D reconstruction and localization.
A2V is a next-gen AI value compute protocol for building AI agent networks and decentralized AI applications.
Run state-of-the-art 🤗 Transformers AI models directly in the browser, without a server.
Optuna is a hyperparameter optimization framework that enables efficient and scalable model tuning for machine learning.
A collection of Variational Autoencoders (VAEs) implemented in PyTorch for deep learning research and applications.
ViMax is an all-in-one tool for agentic video generation, allowing developers to act as directors, screenwriters, producers, and video generators.
A collection of research papers and resources on computer vision tasks like image classification, object detection, and face recognition.
Mooncake is a serving platform for Kimi, a leading LLM service provided by Moonshot AI, focused on disaggregation, inference, and RDMA.
An open-source TypeScript library that provides a wallet-like interface for managing AI agents and their resources.
This C++ framework provides a powerful and flexible command-and-control (C2) infrastructure for developers building AI-powered applications.
A curated list of state-of-the-art research in embodied AI, focusing on VLA, VLN, and related multimodal learning approaches.
A curated collection of papers on generative information extraction using large language models.
Comprehensive TensorFlow tutorials and best practices for deep learning and machine learning developers.
CVAT is an industry-leading data engine for machine learning, trusted by teams for annotating data at scale.
A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.
This is a curriculum for learning Natural Language Processing (NLP) from Siraj Raval's YouTube channel.
AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks
A Python library that trains a large language model on data from specific time periods to reduce modern bias.
An open-source project that enables voice control for Xiaomi AI speakers, unlocking new possibilities for developers.
A Python script to automate actions in the popular mobile game Onmyoji, targeted at 'vibe coders' who use AI tools.
A free MLOps course that covers machine learning model deployment and monitoring
A comprehensive collection of best practices and examples for natural language processing (NLP) using Python.
A Python-based automation tool for the game Zenless Zone Zero, with features like auto-dodge, daily tasks, and hollow farming.
A benchmark for evaluating the performance of large language models (LLMs) on complex terminal-based tasks.
Coloring black and white images using deep learning techniques, including a Jupyter Notebook tutorial.
A PyTorch library for building CNN-based text classification models.
High-performance Node.js image processing library for resizing and converting images quickly
A cross-platform asynchronous chatbot framework written in Python for building AI-powered conversational applications.
A deep learning-powered Chinese couplet generation tool using the seq2seq model.
High-performance mobile-optimized neural network inference framework for deploying AI models on mobile devices
Official code for a text-to-speech model that generates fluent and faithful speech with flow matching.
A TypeScript SDK for building AI-powered web applications and tools
An audio communication library in Python that enables secure data transfer over air-gapped systems.
Highly performant CUDA-accelerated library for gaussian splatting, useful for AI and computer vision applications.
High-performance LLM deployment engine for cross-platform AI model execution
One-stop solution for creating your digital avatar from chat history and fine-tuning LLMs to capture your unique style
A tutorial for building Neural Machine Translation models using TensorFlow.
A web app for interacting with any LangGraph agent (PY & TS) via a chat interface.
A Python library for generating multi-view-consistent images from a single-view image using diffusion models.
Get weekly updates on trending AI coding tools and projects.