Category
Showing 2051-2100 of 6,802 trending projects
A roadmap to learn Generative AI tools and technologies in 2025
An end-to-end multimodal AI model that can understand and generate text, audio, vision, and video in real-time.
A Python library for building recommender systems using popular deep learning techniques like DeepFM, NCF, and more.
Deprecated Tinder automation tool using AI that has been redirected towards Bernie AI.
A powerful text-to-image diffusion model that can be used for recaptioning, planning, and generating with multimodal LLMs.
Create chatbots with ease using a TypeScript-based framework that supports various AI language models.
This repository provides code for a paper on using Python execution for visual reasoning tasks.
A Java library that provides a set of examples for using the LangChain framework, which helps build applications with large language models.
A codebase for Time-series Generative Adversarial Networks (TimeGAN), a deep learning model for time series data generation.
PyTorch3D is a library of reusable components for deep learning with 3D data, developed by Facebook AI Research.
A Python library for video inpainting, outpainting, and object removal using propagation and transformer models.
The official implementation of RAPTOR, a framework for Recursive Abstractive Processing for Tree-Organized Retrieval.
An all-in-one video production workstation with AI-powered content planning, generation, and automation.
A Python library for fast reconstruction of neural radiance fields from direct voxel grid optimization.
An open-source app that allows users to search, read, bookmark, and summarize academic papers from arXiv using AI-powered features.
Clean, robust, and unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms.
An open-source C++ library for Bayesian inference and data analysis using Markov Chain Monte Carlo (MCMC) methods.
An easy-to-use natural language processing library built on the Gluon deep learning framework.
A generic U-Net implementation in TensorFlow for image segmentation tasks.
Official repository of Point Transformer V3 (PTv3), a computer vision AI model for point cloud processing.
Envoy-based API gateway that manages unified access to generative AI services like GPT, DALL-E, and Stable Diffusion.
A CVPR 2025 video diffusion model that enables fast autoregressive video generation from slow bidirectional models.
JoyCaption is an open, uncensored image captioning Visual Language Model (VLM) for training Diffusion models.
A sample repository showcasing workflows and experiences built on top of Microsoft Cognitive Services.
Go client for OpenAI's ChatGPT, GPT-5, DALL-E, and Whisper APIs, enabling AI-powered applications in Go.
A production-ready multi-agent AI framework for automating tasks and solving complex problems using LLMs.
A curated list of resources related to domain adaptation, a technique used to improve AI model performance on new datasets.
A retargetable MLIR-based machine learning compiler and runtime toolkit for AI/ML developers.
A Python library for grounding image matching in 3D using the MASt3R algorithm.
A large collection of system log datasets for AI-driven log analytics.
An open-source library that simplifies the use of Retrieval-Augmented Generation (RAG) with small language models.
Underthesea is a powerful Vietnamese NLP toolkit for developers working with natural language processing tasks.
Python implementation of the reinforcement learning concepts from the classic textbook.
An open-source real-time object detection library powered by the YOLOv10 neural network model.
Official implementation of a paper on multilingual visual text generation and editing using AI.
MTEB is a benchmark for evaluating and comparing text embedding models across multiple tasks and languages.
PyTorch compiler for NVIDIA GPUs using TensorRT, enabling efficient deep learning inference on CUDA hardware.
An integrated federated learning library for research and production use cases.
A powerful WYSIWYG interface for creating Machine Learning models without writing code.
Swift API for MLX, a platform focused on enabling developers to build with AI tools.
A collection of machine learning projects for developers to learn and apply various ML techniques.
PyMuPDF4LLM is a Python library for working with PDF documents, optimized for use with Large Language Models (LLMs).
A SQL-driven RAG engine that automatically builds a knowledge graph during querying, enabling knowledge-enhanced applications.
Gorilla is a Python tool for training and evaluating large language models (LLMs) for API/function calls.
A framework to enable multimodal AI models to control a computer, automating various tasks.
A private & local AI personal knowledge management app for high entropy people with a focus on vibe coders.
A collection of scripts for the Stable Diffusion AI image generation model, useful for vibe coders.
A 3D visualization tool for exploring and understanding the inner workings of GPT-style large language models (LLMs).
A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.
This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.
Get weekly updates on trending AI coding tools and projects.