Category
Showing 5901-5950 of 6,802 trending projects
PixelLib is a Python library for image and video segmentation using deep learning models like Mask R-CNN, DeepLab, and PointRend.
Provides a powerful object detection and tracking solution for vehicle and pedestrian counting using YOLOv5 and DeepSort.
RStan is an R interface to the Stan probabilistic programming language, used for Bayesian data analysis and inference.
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
A curated list of resources for image captioning and related areas, useful for developers working on AI projects.
PyTorch code for a CVPR 2018 paper on few-shot learning, a technique for training ML models with limited data.
A Python-based face editing tool for Stable Diffusion, a popular AI image generation model.
Vocos is a high-quality audio synthesis library that bridges the gap between time-domain and Fourier-based neural vocoders.
A curated list of practical natural language processing tools and libraries in Ruby for developers.
Powerful workflow engine and end-to-end pipeline solutions implemented with native Kubernetes resources.
A Python library for generating expressive talking video using memory-guided diffusion models.
An interactive visualization tool to explore the geometric intuition behind diffusion models.
A lightweight collaborative text span annotation tool for named entity recognition and event extraction.
A ChatGPT plugin for the Yunzai bot framework, enabling AI-powered chatbots on QQ platforms.
VideoGPT is a Jupyter Notebook-based project for generating videos from text prompts using AI models.
A library of recommendation algorithms based on Graph Neural Networks for information retrieval and recommendation systems.
Deprecated Scikit-learn integration package for Apache Spark, useful for machine learning on big data.
A Python library for sentiment analysis, text classification, and other machine learning tasks on Weibo data.
A research project focused on open-world object detection using continual and contrastive learning techniques.
A learning platform that leverages LLMs to assist students, scholars, and lifelong learners.
A JavaScript library for creating neural style transfer images, a type of AI-powered image generation.
A Python library based on PaddlePaddle for developing virtual anchors (vtubers) with AI tools.
Unsupervised language modeling and robust sentiment classification for AI/ML developers.
Tutorials on implementing image classification architectures using PyTorch and TorchVision.
An open-source SLAM (Simultaneous Localization and Mapping) library for 3D LiDAR odometry and mapping.
A blueprint for building production-ready RAG systems that minimize hallucination, with switchable pipelines.
Repository for the AdaBelief Optimizer, a NeurIPS 2020 Spotlight paper on an adaptive optimizer for AI/ML models.
A CVPR 2024 and TPAMI 2025 AI-powered multimodal learning architecture for vibe coders.
A high-quality, open-source text-to-speech library in Rust for developers to build AI-powered voice applications.
A universal, AI-powered chat application built with Go for developers to build on top of.
A PyTorch implementation of a neural radiance field model for talking head synthesis driven by audio.
A library for explaining the decisions made by Vision Transformers, a type of AI model used for computer vision tasks.
An open-source app that allows users to search, read, bookmark, and summarize academic papers from arXiv using AI-powered features.
This repository contains a list of speech synthesis papers for developers interested in AI-powered voice and speech technology.
Algorithm to texture 3D reconstructions from multi-view stereo images, useful for computer vision and 3D graphics projects.
A series of math-focused large language models for AI-powered coding and analysis
A collection of machine learning tutorials covering various topics like anomaly detection, time series forecasting, and object detection.
A deep learning-based library for efficient lane detection, using self-attention distillation.
Real-time audio analysis in Python, with visualizations and FFT feature extraction from streaming audio.
A general representation model for cross-modal learning across vision, audio, and language.
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation.
An open-source Chinese medical multimodal model that can summarize chest radiographs.
A Python library for less-is-more reasoning with large language models, focused on the COLM 2025 conference.
A PyTorch implementation of Prototypical Networks for Few-Shot Learning, a powerful technique for training AI models on small datasets.
A comprehensive guide for developers to stay up-to-date with the latest advancements in AI, ML, DL, and computer vision.
Open source deep learning framework for building AI-powered iOS, macOS, and tvOS apps in Swift.
A PyTorch implementation of the TernausNet model for image segmentation, pre-trained on the Kaggle Carvana dataset.
A natural language detection library for Rust that can identify the language of text samples.
A next-generation recommendation system library for building AI-powered content discovery platforms.
A Python library for training deep neural networks with weights and activations constrained to +1 or -1.
Get weekly updates on trending AI coding tools and projects.