Computer Vision

Explore 2,275 open source projects in Computer Vision

Showing 241-260 of 2,275 projects

PaddlePaddle/PaddleGAN

PaddleGAN is a high-performance GAN library for developers working with AI-powered applications like image editing, style transfer, and motion transfer.

8.1K

Archived

Python

Computer Vision

ML Ops

React

#image-generation#style-transfer#motion-transfer

mikel-brostrom/boxmot

A powerful multi-object tracking library with modular SOTA tracking modules for segmentation, detection, and pose estimation.

8.0K

Active

Python

Computer Vision

CLI Tools

Python

#multi-object-tracking#segmentation#object-detection

LiheYoung/Depth-Anything

A foundation model for monocular depth estimation that leverages large-scale unlabeled data.

8.0K

Archived

Python

Computer Vision

Python

#depth-estimation#image-synthesis#metric-depth-estimation

alex-damian/pulse

A Python library for upsampling images using a self-supervised generative model approach.

8.0K

Archived

Python

Computer Vision

#image-upsampling#generative-models#self-supervised-learning

rednote-hilab/dots.ocr

A multilingual document layout parsing model that can extract text, images, and structure from documents in a single vision-language model.

7.9K

Stable

Python

Computer Vision

Component Libraries (React)

React

#document-parsing#ocr#layout-extraction

google/model-viewer

An open-source 3D model viewer library that enables interactive 3D visualizations on the web and in AR.

7.9K

Stable

TypeScript

Animation & Motion

Frontend Frameworks

Three.js

#3d-visualization#augmented-reality#webxr

Project-MONAI/MONAI

An open-source AI toolkit for medical imaging and healthcare applications built on PyTorch.

7.9K

Active

Python

Computer Vision

API Frameworks

PyTorch

#medical-imaging#healthcare#deep-learning

jwyang/faster-rcnn.pytorch

A faster PyTorch implementation of the Faster R-CNN object detection algorithm.

7.9K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#faster-rcnn#object-detection#computer-vision

MochiDiffusion/MochiDiffusion

A Swift-based library for running Stable Diffusion AI models natively on Apple Silicon Macs.

7.8K

Active

Swift

Computer Vision

iOS

Swift

#stable-diffusion#apple-silicon#macos

exadel-inc/CompreFace

An open-source face recognition system with a rich set of computer vision features and APIs.

7.8K

Archived

Java

Computer Vision

API Development

#face-detection#face-recognition#face-verification

apple/ml-sharp

A Python library for fast, high-quality monocular view synthesis, useful for computer vision and 3D applications.

7.8K

Stable

Python

Computer Vision

#computer-vision#3d-reconstruction#view-synthesis

cartographer-project/cartographer

Cartographer is a real-time SLAM system for 2D and 3D localization and mapping across multiple platforms and sensors.

7.8K

Archived

C++

Computer Vision

API Frameworks

#robotics#self-driving#localization

nadermx/backgroundremover

An open-source Python tool that uses AI to remove backgrounds from images and videos with a simple command line interface.

7.8K

Stable

Python

Computer Vision

Photo Editing

PyTorch

#ai#background-removal#image-editing

XavierXiao/Dreambooth-Stable-Diffusion

This is an implementation of the Dreambooth technique for fine-tuning Stable Diffusion models.

7.8K

Archived

Jupyter Notebook

Fine-tuning

Inference

PyTorch

#stable-diffusion#text-to-image#machine-learning

kroma-network/tachyon

A modular ZK backend accelerated by GPU for building blockchain and cryptocurrency applications.

7.7K

Archived

C++

Smart Contracts

API Frameworks

#blockchain#cryptocurrency#cryptography

NVlabs/SPADE

SPADE is a Python library for semantic image synthesis, enabling high-quality generation of images from semantic segmentation maps.

7.7K

Archived

Python

Computer Vision

Animation & Motion

PyTorch

#computer-vision#image-synthesis#semantic-segmentation

wang-xinyu/tensorrtx

TensorRT implementation of popular deep learning networks for efficient inference on GPUs

7.7K

Active

C++

Computer Vision

API Frameworks

#computer-vision#tensorrt#deep-learning

google-deepmind/alphafold3

AlphaFold 3 is a Python-based inference pipeline for protein structure prediction using deep learning.

7.7K

Active

Python

Inference

Computer Vision

#protein-structure-prediction#deep-learning#computer-vision

DepthAnything/Depth-Anything-V2

A highly capable foundation model for monocular depth estimation, a key component in computer vision.

7.7K

Archived

Python

Computer Vision

NeurIPS Submissions

Python

#depth-estimation#monocular-depth#foundation-model

HumanAIGC/EMO

This repository contains a diffusion model for generating expressive portrait videos from audio.

7.6K

Archived

Computer Vision

AI Image & Video

#computer-vision#video-generation#audio-to-video

1...1214...114

Stay in the loop

Get weekly updates on trending AI coding tools and projects.