Category
Showing 4051-4100 of 6,802 trending projects
Open-source bilingual chat language model for developers building AI-powered applications.
A curated list of deep learning resources for computer vision developers
Efficiently computes derivatives of NumPy code, a useful tool for AI/ML developers.
A high-quality video frame interpolation library that can significantly improve video smoothness and quality.
Zadig is an AI-powered, cloud-native DevOps platform designed to improve developer productivity.
An API for extracting, anonymizing, and parsing text from various document formats using state-of-the-art OCR and LLM models.
A high-performance, auto-diff neural network library for 3D and 4D sparse tensor computations.
Official implementation of the UNet++ medical image segmentation model, useful for computer vision projects.
A Python implementation of DragGAN, an interactive point-based tool for manipulating generative image models.
OminiControl is a minimal and universal control system for diffusion transformer models like DALL-E and Stable Diffusion.
A Python library for creating invisible watermarks on images, useful for protecting digital content.
A Python library for ensembling object detection models using the Weighted Boxes Fusion (WBF) method.
zetane/viewer is a Python-based 3D visualizer for ML models and internal tensors.
This repository contains a dataset of Chinese medical dialogues for NLP and conversational AI research.
Live Transcribe is an Android app that provides real-time captioning for people who are deaf or hard of hearing.
Lightweight, tightly coupled lidar-inertial odometry using parallel sparse incremental voxels.
M2Det is a single-shot object detection model based on a multi-level feature pyramid network.
An open-source, multi-tenant, self-building knowledge graph for developers building with AI tools.
An open-source 6DoF head tracking software for gaming, simulations, and virtual experiences.
A Python package for chatting with an AI model and executing the InstructLab workflow to train a model using custom taxonomy data.
An OpenMMLab toolkit for 3D human parametric model development and benchmarking.
LLM-powered fuzzing tool that integrates with the OSS-Fuzz security platform to find bugs in open-source projects.
A fundamental toolkit for music, song, and audio generation using PyTorch.
An open-source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifact.
This repository contains mini machine learning projects with Jupyter Notebook files.
An open-source, AI-powered web monitoring platform that helps developers automate web searches and email alerts.
An efficient 3D Gaussian splatting method for novel view synthesis from sparse multi-view images.
Prompty is a Python library that makes it easy to create, manage, debug, and evaluate LLM prompts for AI applications.
A comprehensive survey of deep learning-based image fusion techniques for computer vision applications.
A book on SLAM (Simultaneous Localization and Mapping) that covers geometric methods and deep learning approaches.
A 3D Gaussian Splatting framework with various derived algorithms and an interactive web viewer.
TensorFlow's visualization toolkit for machine learning model debugging and exploration.
An arbitrary face-swapping framework on images and videos with one single trained model!
A PyTorch repository for practicing image classification on the CIFAR-100 dataset using various deep learning models.
A comprehensive guide to prompt engineering, generative AI, and large language models for developers.
A lite C++ AI toolkit with 100+ models for computer vision tasks like detection, segmentation, and image generation.
A comprehensive library for audio and music analysis, feature extraction, and deep learning applications.
A high-performance library for efficient neural network pruning and compression across LLMs, vision models, and more.
A collection of computer vision and AI projects in Python, C++, and embedded systems for developers.
LLM model optimized for coding tasks and autonomous agent workflows
This repository provides an official implementation of a paper on using transformers for time series forecasting.
Open-source platform for evaluating state-of-the-art in AI and machine learning models and challenges.
Robust real-time face and facial landmark tracking on CPU with Unity integration for virtual youtubers/VTubers.
An AI-powered image generation tool that allows users to create custom images using DALL-E 3.
The Alan AI SDK for Flutter enables building conversational AI-powered apps and voice interfaces.
A Python library for advanced audio and music signal processing tasks, useful for vibe coders building AI-powered music apps.
A powerful vision-language foundation model designed to advance multimodal AI understanding and reasoning.
A TypeScript library for building better AI agents and tools in the MCP ecosystem.
An AI-powered video super-resolution model that enhances real-world videos using text-to-video generation.
Multiversal tree writing interface for human-AI collaboration on creative coding projects.
Get weekly updates on trending AI coding tools and projects.