Category
Showing 1651-1700 of 6,802 trending projects
Curated computer vision resources for developers
A tutorial on deep learning by renowned professor Li Hongy, covering a wide range of AI and machine learning topics.
Keras implementation of RetinaNet object detection, a powerful deep learning-based object detection model.
This repository provides examples and tutorials to help developers build AI systems using popular AI tools and frameworks.
Official repository for the OFA (Unifying Architectures, Tasks, and Modalities) AI model, supporting various vision-language tasks.
Compares the performance of multiple NVIDIA GPUs and Apple Silicon for running large language model inference.
This is a minimal machine learning study plan for developers interested in learning ML concepts and techniques.
A blueprint for building production-ready RAG systems that minimize hallucination, with switchable pipelines.
A curated list of resources and projects related to quantum machine learning, algorithms, and frameworks.
An offline iOS and macOS library for running large language models like LLAMA, GPT-2, and RWKV using the GGML library.
A Python script that generates high-resolution depth maps for the Stable Diffusion WebUI, a popular AI-powered image generation tool.
Caption-Anything is a versatile AI-powered tool for generating tailored image captions with diverse controls.
This repository contains materials, slides, and notebooks for Andrew Ng's deeplearning.ai course on machine learning and AI.
Unofficial API wrapper for Perplexity.ai with a web interface for generating Perplexity AI accounts
VideoMamba is a state space model for efficient video understanding, focused on AI and machine learning.
Fay is an agent framework that helps connect digital humans and large language models to business systems.
A collection of examples demonstrating the usage of the MLX framework for building AI-powered applications.
This is a code repository for learning deep learning with PyTorch, a popular machine learning library.
An AI-powered platform for building machine learning models from natural language prompts.
Cross-modal lip reading using 3D convolutional neural networks for speech recognition.
A robust Python tool for text-based AI training and generation using GPT-2.
A collection of deep learning tutorials in Jupyter Notebooks for developers interested in AI tools.
Real-time speech recognition and voice activity detection for offline use on multiple platforms.
An intelligent document parsing tool that extracts and converts data from various document formats to structured data like Markdown, JSON, CSV, and HTML.
A powerful multi-resolution diffusion transformer for fine-grained Chinese text understanding and generation.
A C++ toolbox for calibrating multi-sensor systems in autonomous driving applications.
A library for running Caffe models in TensorFlow, enabling developers to leverage existing Caffe models in their TensorFlow-based projects.
A CVPR'24 highlighted Python library for building Gaussian Splatting SLAM systems for robotics.
A high-performance 3D rendering library for ray tracing and hybrid rasterization of Gaussian particles.
A diffusion transformer model for generating high-quality 4K text-to-image art, focused on vibe coders and AI developers.
A library for running machine learning models on FPGAs using high-level synthesis (HLS) tools.
Scalable reinforcement learning solution for advanced reasoning of language models.
Bender is a Swift library that makes it easy to build fast neural networks on iOS using TensorFlow models with Metal under the hood.
A free, open-source browser extension to filter NSFW content using TypeScript and TensorFlow.js.
An AI-powered library for extracting and searching information from scientific and medical papers.
A C++ library for rendering and synthesizing light fields, useful for computer vision and graphics applications.
Highly performant, modular, and production-ready inference, ingestion, and indexing library built in Rust for AI-powered applications.
A fully functional poker bot that uses computer vision, neural networks, and genetic algorithms to play poker on various platforms.
A Python library for finding big moving stocks using machine learning and anomaly detection.
A simple API for the VITS text-to-speech model, with additional features for vibe coders.
OpenWash is a format and specification for defining and sharing AI model deployment packaging.
An open-source, large language model-based multimodal dialogue system that achieves near-GPT-4o performance.
StarCoder is a Python library for fine-tuning and inference of large language models.
A full-featured AI app that supports GPT, Tongyiqianwen, Wenxinyiyan, Stable Diffusion and more for vibe coders.
A computationally efficient and robust LiDAR-inertial odometry (LIO) package for robotics and autonomous systems.
Snips NLU is a Python library for extracting meaning from text using natural language processing and machine learning.
A high-performance deep learning architecture that improves upon the classic VGG model.
This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.
A Python library that enables creating web demos using OpenAI's GPT-3 API with just a few lines of code.
A Python library for deep and online learning with spiking neural networks (SNNs) using PyTorch.
Get weekly updates on trending AI coding tools and projects.