Computer Vision

Explore 2,275 open source projects in Computer Vision

Showing 241-260 of 2,275 projects

PaddlePaddle/PaddleGAN

PaddleGAN is a high-performance GAN library for developers working with AI-powered applications like image editing, style transfer, and motion transfer.

8.1K
Archived
Python
Computer Vision
ML Ops
React
#image-generation#style-transfer#motion-transfer

mikel-brostrom/boxmot

A powerful multi-object tracking library with modular SOTA tracking modules for segmentation, detection, and pose estimation.

8.0K
Active
Python
Computer Vision
CLI Tools
Python
#multi-object-tracking#segmentation#object-detection

LiheYoung/Depth-Anything

A foundation model for monocular depth estimation that leverages large-scale unlabeled data.

8.0K
Archived
Python
Computer Vision
Python
#depth-estimation#image-synthesis#metric-depth-estimation

alex-damian/pulse

A Python library for upsampling images using a self-supervised generative model approach.

8.0K
Archived
Python
Computer Vision
#image-upsampling#generative-models#self-supervised-learning

rednote-hilab/dots.ocr

A multilingual document layout parsing model that can extract text, images, and structure from documents in a single vision-language model.

7.9K
Stable
Python
Computer Vision
Component Libraries (React)
React
#document-parsing#ocr#layout-extraction

google/model-viewer

An open-source 3D model viewer library that enables interactive 3D visualizations on the web and in AR.

7.9K
Stable
TypeScript
Animation & Motion
Frontend Frameworks
Three.js
#3d-visualization#augmented-reality#webxr

Project-MONAI/MONAI

An open-source AI toolkit for medical imaging and healthcare applications built on PyTorch.

7.9K
Active
Python
Computer Vision
API Frameworks
PyTorch
#medical-imaging#healthcare#deep-learning

jwyang/faster-rcnn.pytorch

A faster PyTorch implementation of the Faster R-CNN object detection algorithm.

7.9K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#faster-rcnn#object-detection#computer-vision

MochiDiffusion/MochiDiffusion

A Swift-based library for running Stable Diffusion AI models natively on Apple Silicon Macs.

7.8K
Active
Swift
Computer Vision
iOS
Swift
#stable-diffusion#apple-silicon#macos

exadel-inc/CompreFace

An open-source face recognition system with a rich set of computer vision features and APIs.

7.8K
Archived
Java
Computer Vision
API Development
#face-detection#face-recognition#face-verification

apple/ml-sharp

A Python library for fast, high-quality monocular view synthesis, useful for computer vision and 3D applications.

7.8K
Stable
Python
Computer Vision
#computer-vision#3d-reconstruction#view-synthesis

cartographer-project/cartographer

Cartographer is a real-time SLAM system for 2D and 3D localization and mapping across multiple platforms and sensors.

7.8K
Archived
C++
Computer Vision
API Frameworks
#robotics#self-driving#localization

nadermx/backgroundremover

An open-source Python tool that uses AI to remove backgrounds from images and videos with a simple command line interface.

7.8K
Stable
Python
Computer Vision
Photo Editing
PyTorch
#ai#background-removal#image-editing

XavierXiao/Dreambooth-Stable-Diffusion

This is an implementation of the Dreambooth technique for fine-tuning Stable Diffusion models.

7.8K
Archived
Jupyter Notebook
Fine-tuning
Inference
PyTorch
#stable-diffusion#text-to-image#machine-learning

kroma-network/tachyon

A modular ZK backend accelerated by GPU for building blockchain and cryptocurrency applications.

7.7K
Archived
C++
Smart Contracts
API Frameworks
#blockchain#cryptocurrency#cryptography

NVlabs/SPADE

SPADE is a Python library for semantic image synthesis, enabling high-quality generation of images from semantic segmentation maps.

7.7K
Archived
Python
Computer Vision
Animation & Motion
PyTorch
#computer-vision#image-synthesis#semantic-segmentation

wang-xinyu/tensorrtx

TensorRT implementation of popular deep learning networks for efficient inference on GPUs

7.7K
Active
C++
Computer Vision
API Frameworks
#computer-vision#tensorrt#deep-learning

google-deepmind/alphafold3

AlphaFold 3 is a Python-based inference pipeline for protein structure prediction using deep learning.

7.7K
Active
Python
Inference
Computer Vision
#protein-structure-prediction#deep-learning#computer-vision

DepthAnything/Depth-Anything-V2

A highly capable foundation model for monocular depth estimation, a key component in computer vision.

7.7K
Archived
Python
Computer Vision
NeurIPS Submissions
Python
#depth-estimation#monocular-depth#foundation-model

HumanAIGC/EMO

This repository contains a diffusion model for generating expressive portrait videos from audio.

7.6K
Archived
Computer Vision
AI Image & Video
#computer-vision#video-generation#audio-to-video
1...1214...114

Stay in the loop

Get weekly updates on trending AI coding tools and projects.