Explore Projects

Discover 49 open source projects

Active filters (1):
Search: iccvร—
Clear all

Showing 1-20 of 49 projects

zziz/pwc

This repository is an archived collection of papers and code related to computer vision and machine learning.

15.4K
Archived
Computer Vision
#computer-vision#machine-learning#research

sczhou/ProPainter

A Python library for video inpainting, outpainting, and object removal using propagation and transformer models.

6.6K
Experimental
Python
Computer Vision
Backend Frameworks
Python
#video-inpainting#video-outpainting#object-removal

nianticlabs/monodepth2

This repository provides a PyTorch implementation for monocular depth estimation from a single image.

4.5K
Archived
Jupyter Notebook
Computer Vision
Depth Estimation
PyTorch
#computer-vision#depth-estimation#neural-network

cvg/LightGlue

LightGlue is a high-performance local feature matching library for computer vision tasks like pose estimation.

4.4K
Experimental
Python
Computer Vision
CLI Tools
Python
#computer-vision#pose-estimation#feature-matching

showlab/Tune-A-Video

Tune-A-Video is a one-shot text-to-video generation tool that fine-tunes image diffusion models.

4.4K
Archived
Python
Computer Vision
Fine-tuning
Python
#text-to-video#diffusion-models#fine-tuning

Picsart-AI-Research/Text2Video-Zero

A powerful text-to-video generation model that can turn prompts into high-quality videos, built for AI-driven developers.

4.2K
Archived
Python
Computer Vision
AI Image & Video
Python
#text-to-video#video-generation#diffusion-models

clovaai/deep-text-recognition-benchmark

Text recognition (OCR) with deep learning methods, a benchmark for scene text recognition.

3.9K
Archived
Jupyter Notebook
Computer Vision
#ocr#text-recognition#deep-learning

ali-vilab/VACE

Official implementation of a paper on VACE, a video creation and editing tool powered by AI.

3.7K
Stable
Python
Computer Vision
Video Generation
Python
#video-editing#video-generation#computer-vision

JiahuiYu/generative_inpainting

An image inpainting model using deep neural networks and attention mechanisms, useful for vibe coders working on AI-powered applications.

3.5K
Archived
Python
Computer Vision
Deep Neural Networks
TensorFlow
#image-inpainting#generative-adversarial-network#deep-learning

tianzhi0549/FCOS

A PyTorch implementation of the FCOS one-stage object detection model for computer vision tasks.

3.3K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#object-detection#computer-vision#deep-learning

extreme-assistant/ICCV2023-Paper-Code-Interpretation

A curated collection of ICCV conference papers, code, and interpretations for computer vision developers.

2.3K
Archived
Computer Vision
Tutorials & Courses
#computer-vision#deep-learning#image-recognition

mit-han-lab/temporal-shift-module

A highly efficient module for temporal modeling in video understanding tasks.

2.2K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#acceleration#efficient-model#low-latency

apple/ml-fastvit

Official implementation of the FastViT research paper, a fast hybrid vision transformer for AI/ML applications.

2.0K
Archived
Python
Computer Vision
ML Ops
Python
#vision-transformer#deep-learning#computer-vision

Yuanshi9815/OminiControl

OminiControl is a minimal and universal control system for diffusion transformer models like DALL-E and Stable Diffusion.

1.9K
Experimental
Python
LLM Frameworks
Inference
Python
#diffusion-models#computer-vision#image-generation

junshutang/Make-It-3D

A high-fidelity 3D creation tool from a single image using diffusion models for vibe coders.

1.9K
Archived
Python
Computer Vision
Generative Art
Python
#3d-generation#computer-vision#deep-learning

KlingAIResearch/ReCamMaster

ReCamMaster is a novel video generation model that enables camera-controlled generative rendering from a single input video.

1.8K
Stable
Python
Computer Vision
Video Generation
Python
#computer-vision#video-generation#camera-control

svip-lab/impersonator

PyTorch implementation of a unified framework for human motion imitation, appearance transfer, and novel view synthesis.

1.7K
Archived
Python
React
#gan#pose#pytorch

ramprs/grad-cam

Grad-CAM is a deep learning visualization technique for interpreting and explaining CNN-based models.

1.6K
Archived
Lua
Computer Vision
Documentation
Lua
#convolutional-neural-networks#deep-learning#interpretability

yoshitomo-matsubara/torchdistill

A PyTorch-based framework for reproducible deep learning studies with 26 knowledge distillation methods.

1.6K
Stable
Python
ML Ops
Computer Vision
PyTorch
#deep-learning#computer-vision#natural-language-processing

hkchengrex/Tracking-Anything-with-DEVA

An open-vocabulary video segmentation model that can track any object in a video, for video editing and processing.

1.5K
Experimental
Python
Computer Vision
Video Segmentation
Python
#object-tracking#open-vocabulary#video-editing

Stay in the loop

Get weekly updates on trending AI coding tools and projects.