Category
Showing 3801-3850 of 6,802 trending projects
PyDMD is a Python library for dynamic mode decomposition, a data-driven method for model reduction.
Official PyTorch implementation of BigVGAN, a neural vocoder for generating high-quality audio, music, and speech.
English pronunciation correction tool built with the Gemini language model
An implementation of the Attention Is All You Need paper, built with PyTorch and Jupyter Notebooks.
This is an AI-focused development platform for building with AI tools and services.
A curated list of resources related to domain adaptation, a technique used to improve AI model performance on new datasets.
ArrayFire is a general-purpose GPU library that provides a high-performance, cross-platform programming interface for developers.
MT3 is a Python library for multi-task multitrack music transcription, a powerful tool for audio analysis.
A review of Neural Style Transfer, a technique for applying the artistic style of one image to the content of another.
ROSA is an AI agent that helps robot developers inspect, diagnose, understand, and operate ROS1- and ROS2-based robotics systems using natural language.
A curated collection of research papers and resources for natural language processing (NLP) practitioners.
PyTorch implementation of a lossless image compression technique using super-resolution.
A library for building and training discrete world models for Atari game environments using reinforcement learning.
StyleGAN2 is an official TensorFlow implementation of a state-of-the-art generative adversarial network.
A Python tool for the Stable Diffusion WebUI that allows for the visualization of model differences.
Official implementation of a CVPR2020 paper for video-based 3D human pose and shape estimation.
An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.
Notebooks for a course on Deep Learning with TensorFlow 2 and Keras, focused on AI and machine learning.
Example code and machine learning models for the Teachable Machine web-based AI training tool.
A deep learning library for maximal update parametrization, useful for AI-powered coding tools.
Reference implementations of several LangChain agents as Streamlit apps for AI-powered developers.
A Python library for deep probabilistic analysis of single-cell and spatial omics data.
A fast and tightly-coupled LiDAR-Inertial-Visual Odometry (LIVO) system for 3D reconstruction and sensor fusion.
Official implementation of a paper on a unified Transformer-based framework for object detection and segmentation.
A comprehensive Python library for time series forecasting using machine learning models.
StableVideo is a Python library for text-driven, consistency-aware diffusion-based video editing, presented at ICCV 2023.
A Python library implementing various multi-robot path-planning algorithms.
A PyTorch implementation of MotionBERT, a unified approach for learning human motion representations.
A curated collection of resources for mixture-of-experts models, a powerful AI technique.
PantoMatrix is a Python library for generating facial and body animations from speech, designed for vibe coders building AI-powered projects.
AnomalyGPT is a powerful tool for detecting industrial anomalies using large vision-language models.
A high-resolution network (HRNet) model for image classification trained on the ImageNet dataset.
A curated list of efficient attention modules for building AI-powered applications with transformers.
A PyTorch implementation of a real-time scene text detection model with differentiable binarization.
A real-time approach for mapping 2D images to a 3D surface-based model of the human body.
A C++ SQLite extension that provides efficient vector search capabilities using the Faiss library.
A Python library for computing BERT-based text generation evaluation metrics.
SunoAPI allows developers to create music and audio in seconds using AI-powered tools.
A graph convolutional network for text classification, useful for NLP tasks and AI-powered applications.
A tutorial to build a RAG (Retrieval Augmented Generation) system from scratch using local LLMs and no black boxes.
A collection of research papers from the Computer Vision and Pattern Recognition (CVPR) conference in 2024.
A C++ tutorial for the TensorRT deep learning inference engine optimized for NVIDIA GPUs.
A collection of AI-related tutorials and resources for developers, focused on topics like machine learning, NLP, and data science.
AudioGPT is a powerful tool for understanding and generating speech, music, sound, and talking heads using AI.
A PyTorch baseline implementation for person re-identification and vehicle re-identification tasks.
A Python library that implements Graph Attention Networks, a powerful neural network architecture for graph-structured data.
A C++ library for detecting custom wake words using a deep neural network, useful for AI voice assistants.
SUSI.AI Smart Box is an open-source virtual assistant platform for building conversational AI agents.
The KDL document language specifications - a serialization format for vibe coders using AI tools.
SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.
Get weekly updates on trending AI coding tools and projects.