Showing 221-240 of 848 projects
Motion is a software motion detector, a useful tool for developers working on computer vision projects.
An end-to-end multimodal AI model that can understand and generate text, audio, vision, and video in real-time.
TorchGeo is a Python library for working with geospatial data using PyTorch, providing datasets, samplers, transforms, and pre-trained models.
A repository to research and share machine learning articles, with a focus on computer vision, NLP, and reinforcement learning.
openMVS is a C++ library for 3D reconstruction from multi-view stereo images.
Sing-box is a multi-protocol network proxy tool with support for various protocols and platforms.
List of satellite image training datasets with annotations for computer vision and deep learning
Open-source toolkit for evaluating large multi-modal AI models, supporting 220+ models and 80+ benchmarks.
A computer vision toolbox for state-of-the-art person re-identification methods.
A pre-training toolbox and benchmark for vision AI models, including self-supervised learning and state-of-the-art architectures.
A highly efficient visual representation learning framework for AI-powered coding tools and applications.
JAX library for computer vision research with transformers, attention mechanisms, and vision models
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
An open-source cookbook for getting started with Phi, a family of high-performance small language models from Microsoft.
A comprehensive Ruby API for integrating with various AI tools and platforms, including OpenAI, Anthropic, Gemini, and more.
A Python library for self-supervised learning on images, with a focus on computer vision and AI-powered tooling.
A high-resolution audio-driven portrait image animation tool for AI and generative vision applications.
A PyTorch-based library for 3D facial alignment, useful for computer vision and AI-powered applications.
A comprehensive collection of AI-powered tools, techniques, and resources for building advanced data-driven applications.
A TensorFlow library for the YOLOv3 object detection model, enabling vibe coders to build cutting-edge computer vision applications.
Get weekly updates on trending AI coding tools and projects.