Computer Vision

Explore 2,275 open source projects in Computer Vision

Showing 2061-2080 of 2,275 projects

xemle/home-gallery

Self-hosted open-source web gallery with AI-powered image discovery and tagging for photos and videos.

1.1K
Active
JavaScript
Component Libraries (React)
Computer Vision
React
#photo-gallery#mobile-friendly#tagging

jeeliz/jeelizWeboji

Real-time face tracking, expression detection, and animated emoticons for web applications.

1.1K
Archived
JavaScript
Animation & Motion
Computer Vision
Three.js
#face-tracking#expression-detection#animated-emoticons

jhansireddy/AndroidScannerDemo

Android document scanning library built on OpenCV, allowing cropping and perspective transformation of scanned documents.

1.1K
Archived
C++
Android
CLI Tools
Android
#document-scanning#opencv#image-processing

NVlabs/NVAE

Official PyTorch implementation of the NVAE deep hierarchical variational autoencoder for AI and ML developers.

1.1K
Archived
Python
ML Ops
Computer Vision
PyTorch
#variational-autoencoder#deep-learning#computer-vision

satellite-image-deep-learning/datasets

A collection of datasets for deep learning with satellite and aerial imagery.

1.1K
Active
Computer Vision
Datasets
#earth-observation#remote-sensing#satellite-data

o0o0o0o0o0o0o/image-processing-from-scratch

This project contains image processing algorithms written from scratch in Python and C++.

1.1K
Archived
C++
Backend Frameworks
Computer Vision
#image-processing#computer-vision#algorithms

yu4u/noise2noise

An unofficial Keras implementation of the Noise2Noise image denoising algorithm, useful for vibe coders working with AI-powered tools.

1.1K
Archived
Python
Computer Vision
Caching
Keras
#denoising#image-processing#deep-learning

sml2h3/ddddocr-fastapi

A simple API built with FastAPI and ddddocr for solving captchas, with Docker support.

1.1K
Archived
Python
API Frameworks
Computer Vision
FastAPI
#captcha#ddddocr#docker

OpenGVLab/SAM-Med2D

Official implementation of SAM-Med2D, a tool for medical image segmentation using transformers.

1.1K
Archived
Jupyter Notebook
Computer Vision
Jupyter Notebook
#medical-imaging#image-segmentation#transformers

ShareGPT4Omni/ShareGPT4Video

An official implementation of a system for improving video understanding and generation with better captions.

1.1K
Archived
Python
LLM Frameworks
Computer Vision
PyTorch
#chatgpt#gpt-4#computer-vision

ethanhe42/channel-pruning

A library for accelerating deep neural networks through channel pruning, a model compression technique.

1.1K
Archived
Python
Computer Vision
Build Tools
Python
#acceleration#channel-pruning#deep-neural-networks

LeonLok/Multi-Camera-Live-Object-Tracking

Multi-camera live object tracking and traffic counting using YOLO v4, Deep SORT, and Flask.

1.1K
Archived
Python
Computer Vision
API Frameworks
Flask
#object-detection#object-tracking#traffic-monitoring

murtazahassan/Learn-OpenCV-in-3-hours

A Python library for learning and using OpenCV, a popular computer vision library.

1.1K
Archived
Python
Computer Vision
Tutorials & Courses
#computer-vision#opencv#python

feima09/GMTalker

GMTalker is a 3D digital human system that integrates speech recognition, speech synthesis, natural language understanding, and mouth animation for fast deployment on Windows, Linux, and Android.

1.1K
Active
Python
AI Voice & Speech
Computer Vision
Python
#3d-human#speech-recognition#speech-synthesis

stereolabs/zed-sdk

The spatial perception framework for rapidly building smart robots and spaces

1.1K
Stable
C++
Computer Vision
API Frameworks
#3d-reconstruction#depth-estimation#object-detection

Rock-100/FaceKit

A C++ library for real-time, rotation-invariant face detection using progressive calibration networks.

1.1K
Archived
C++
Computer Vision
API Frameworks
#face-detection#computer-vision#real-time

IDEA-Research/Grounding-DINO-1.5-API

A powerful open-world object detection model for computer vision tasks, leveraging the DINO framework.

1.1K
Archived
Python
Computer Vision
Inference
PyTorch
#object-detection#open-world#zero-shot

qiucheng025/zao-

An open-source AI-powered face swap tool, focused on the Chinese market.

1.1K
Archived
Python
Computer Vision
Backend Frameworks
Python
#ai#faceswap#computer-vision

fpgaminer/joycaption

JoyCaption is an open, uncensored image captioning Visual Language Model (VLM) for training Diffusion models.

1.1K
Stable
Jupyter Notebook
LLM Frameworks
Computer Vision
Jupyter Notebook
#captioning#joycaption#vlm

andyzeng/visual-pushing-grasping

A deep reinforcement learning library for training robotic agents to plan pushing and grasping actions for manipulation tasks.

1.1K
Archived
Python
Computer Vision
Deep Learning
Python
#computer-vision#deep-learning#deep-reinforcement-learning
1...103105...114

Stay in the loop

Get weekly updates on trending AI coding tools and projects.