Explore Projects

Discover 49 open source projects

Active filters (1):

Search: iccv×

Clear all

Showing 1-20 of 49 projects

zziz/pwc

This repository is an archived collection of papers and code related to computer vision and machine learning.

15.4K

Archived

Computer Vision

#computer-vision#machine-learning#research

sczhou/ProPainter

A Python library for video inpainting, outpainting, and object removal using propagation and transformer models.

6.6K

Experimental

Python

Computer Vision

Backend Frameworks

Python

#video-inpainting#video-outpainting#object-removal

nianticlabs/monodepth2

This repository provides a PyTorch implementation for monocular depth estimation from a single image.

4.5K

Archived

Jupyter Notebook

Computer Vision

Depth Estimation

PyTorch

#computer-vision#depth-estimation#neural-network

cvg/LightGlue

LightGlue is a high-performance local feature matching library for computer vision tasks like pose estimation.

4.4K

Experimental

Python

Computer Vision

CLI Tools

Python

#computer-vision#pose-estimation#feature-matching

showlab/Tune-A-Video

Tune-A-Video is a one-shot text-to-video generation tool that fine-tunes image diffusion models.

4.4K

Archived

Python

Computer Vision

Fine-tuning

Python

#text-to-video#diffusion-models#fine-tuning

Picsart-AI-Research/Text2Video-Zero

A powerful text-to-video generation model that can turn prompts into high-quality videos, built for AI-driven developers.

4.2K

Archived

Python

Computer Vision

AI Image & Video

Python

#text-to-video#video-generation#diffusion-models

clovaai/deep-text-recognition-benchmark

Text recognition (OCR) with deep learning methods, a benchmark for scene text recognition.

3.9K

Archived

Jupyter Notebook

Computer Vision

#ocr#text-recognition#deep-learning

ali-vilab/VACE

Official implementation of a paper on VACE, a video creation and editing tool powered by AI.

3.7K

Stable

Python

Computer Vision

Video Generation

Python

#video-editing#video-generation#computer-vision

JiahuiYu/generative_inpainting

An image inpainting model using deep neural networks and attention mechanisms, useful for vibe coders working on AI-powered applications.

3.5K

Archived

Python

Computer Vision

Deep Neural Networks

TensorFlow

#image-inpainting#generative-adversarial-network#deep-learning

tianzhi0549/FCOS

A PyTorch implementation of the FCOS one-stage object detection model for computer vision tasks.

3.3K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#object-detection#computer-vision#deep-learning

extreme-assistant/ICCV2023-Paper-Code-Interpretation

A curated collection of ICCV conference papers, code, and interpretations for computer vision developers.

2.3K

Archived

Computer Vision

Tutorials & Courses

#computer-vision#deep-learning#image-recognition

mit-han-lab/temporal-shift-module

A highly efficient module for temporal modeling in video understanding tasks.

2.2K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#acceleration#efficient-model#low-latency

apple/ml-fastvit

Official implementation of the FastViT research paper, a fast hybrid vision transformer for AI/ML applications.

2.0K

Archived

Python

Computer Vision

ML Ops

Python

#vision-transformer#deep-learning#computer-vision

Yuanshi9815/OminiControl

OminiControl is a minimal and universal control system for diffusion transformer models like DALL-E and Stable Diffusion.

1.9K

Experimental

Python

LLM Frameworks

Inference

Python

#diffusion-models#computer-vision#image-generation

junshutang/Make-It-3D

A high-fidelity 3D creation tool from a single image using diffusion models for vibe coders.

1.9K

Archived

Python

Computer Vision

Generative Art

Python

#3d-generation#computer-vision#deep-learning

KlingAIResearch/ReCamMaster

ReCamMaster is a novel video generation model that enables camera-controlled generative rendering from a single input video.

1.8K

Stable

Python

Computer Vision

Video Generation

Python

#computer-vision#video-generation#camera-control

svip-lab/impersonator

PyTorch implementation of a unified framework for human motion imitation, appearance transfer, and novel view synthesis.

1.7K

Archived

Python

React

#gan#pose#pytorch

ramprs/grad-cam

Grad-CAM is a deep learning visualization technique for interpreting and explaining CNN-based models.

1.6K

Archived

Lua

Computer Vision

Documentation

Lua

#convolutional-neural-networks#deep-learning#interpretability

yoshitomo-matsubara/torchdistill

A PyTorch-based framework for reproducible deep learning studies with 26 knowledge distillation methods.

1.6K

Stable

Python

ML Ops

Computer Vision

PyTorch

#deep-learning#computer-vision#natural-language-processing

hkchengrex/Tracking-Anything-with-DEVA

An open-vocabulary video segmentation model that can track any object in a video, for video editing and processing.

1.5K

Experimental

Python

Computer Vision

Video Segmentation

Python

#object-tracking#open-vocabulary#video-editing

2 3

Stay in the loop

Get weekly updates on trending AI coding tools and projects.