Category
Showing 5851-5900 of 6,802 trending projects
A Rust library for image similarity comparison, simulating human perception using multiscale SSIM.
A voice-powered AI assistant that can answer questions about any application, in context and in audio.
Automatically find issues in image datasets and practice data-centric computer vision.
An AI inference operator for Kubernetes that makes it easy to serve ML models in production.
A tutorial repository for PyTorch Geometric, a library for deep learning on graphs and other structured data.
A PyTorch library for implementing 2D Discrete Wavelet Transform and Dual Tree Complex Wavelet Transform, useful for vibe coders working with AI and signal processing.
A collection of state-of-the-art metaheuristic optimization algorithms in Python for developers working with AI tools.
A comprehensive survey of deep learning-based image fusion techniques for computer vision applications.
A Python library for deep learning-based object pose estimation, with ROS inference capabilities.
A Solution Accelerator for the RAG pattern running in Azure, using Azure AI Search and OpenAI for chat and Q&A.
This Jupyter Notebook series covers the fundamentals of NLP and Computer Vision, leading to cutting-edge Vision-Language Models.
An open-source C library for developers building AI-powered applications and tools.
A tutorial showing how to set up TensorFlow's Object Detection API on the Raspberry Pi for computer vision projects.
SlimYOLOv3 is a narrower, faster and better object detection model for real-time applications on UAVs.
A knowledge graph attention network for building explainable recommendation systems.
A plug-and-play language model implementation that allows steering the topic and attributes of GPT-2 models.
An open-source implementation of a state-of-the-art unsupervised image denoising algorithm for developers working with computer vision.
A curated collection of recent advances in vision-language pretrained models (VL-PTMs) for AI and multimodal applications.
A repository containing puzzles and challenges for training large language models (LLMs).
A robust dense feature matcher for estimating pixel-dense warps and reliable certainties between image pairs.
A high-quality lip sync tool using deep learning techniques like GFPGAN and Wav2Lip.
A Python framework for building and training neural networks using the Theano library.
Minimal solvers for calibrated camera pose estimation, useful for computer vision applications.
A curated list of ML videos, links, projects and datasets to help developers learn and master machine learning.
Open-source implementation of the YOLOv5 object detection model in PyTorch for training custom models.
Provides a collection of workflows for the ComfyUI AI tool, designed for developers working with AI tools.
A PyTorch library for image synthesis and editing using stochastic differential equations.
This GitHub repository is a open-source TTS (Text-to-Speech) tracking tool for developers.
Federated learning library for distributed AI model training across multiple devices or servers.
A Python library that combines the latest version of YOLOv5 and DeepSort for object detection and tracking.
A C# framework for computer vision, artificial intelligence, and other research-oriented tasks.
Official PyTorch repo for GAN's N' Roses, providing diverse im2im and vid2vid selfie to anime translation.
A Python CLI tool that generates and runs shell commands using LangChain and large language models.
This repository provides a Python-based chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activeloop's Deep Lake for vibe coders.
Official code and checkpoint release for mobile robot foundation models like GNM, ViNT, and NoMaD.
A point-based neural radiance field for 3D reconstruction and rendering from multi-view images.
An open framework for blind navigation based on ESP32 and Python, focused on AI-powered accessibility solutions.
A dataset and methods for word-level sign language recognition from video, useful for developers building sign language applications.
A website that provides free API tokens for testing AI products, updated daily.
MeZO: A novel fine-tuning method for language models that requires just forward passes, ideal for vibe coders.
A Python library that integrates Scikit-learn into the Apache Spark distributed computing framework.
A comprehensive collection of deep learning paper reviews and code practices for AI developers.
A Java library for developers building AI-powered applications and tools.
An open-source machine learning framework for developers to build AI-powered applications.
A Jupyter Notebook for visualizing 3D object detection and LiDAR point clouds from the KITTI dataset.
A Jupyter Notebook library for applying data augmentation techniques to improve object detection models.
A generative and self-guided robotic agent that endlessly proposes and masters new skills.
A vehicle detection project using machine learning and computer vision for self-driving cars.
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project, focused on AI and computer vision.
A Spark accelerator for Apache DataFusion, a SQL query engine written in Rust, aimed at vibe coders.
Get weekly updates on trending AI coding tools and projects.