Computer Vision

Explore 2,275 open source projects in Computer Vision

Showing 1661-1680 of 2,275 projects

tianzhi0549/CTPN

A Jupyter Notebook project for detecting text in natural images using the Connectionist Text Proposal Network (CTPN) algorithm.

1.3K
Archived
Jupyter Notebook
Computer Vision
Backend Frameworks
Jupyter Notebook
#ocr#text-detection#computer-vision

google/neuroglancer

Neuroglancer is a WebGL-based viewer for volumetric data, enabling visualization and analysis of 3D biological and scientific data.

1.3K
Active
TypeScript
Animation & Motion
Charts & Visualization
TypeScript
#visualization#3d-data#scientific-data

fahadshamshad/awesome-transformers-in-medical-imaging

A curated collection of resources on applying Transformers to medical imaging tasks like segmentation, classification, and synthesis.

1.3K
Archived
Computer Vision
Tutorials & Courses
#medical-imaging#transformers#computer-vision

apple/ml-neuman

Official repository for NeuMan, a neural human radiance field model from a single video.

1.3K
Archived
Python
Computer Vision
#computer-vision#neural-networks#3d-reconstruction

jbilcke-hf/ai-comic-factory

A TypeScript library that generates comic panels using a large language model and SDXL, powered by Hugging Face.

1.3K
Stable
TypeScript
LLM Frameworks
Computer Vision
TypeScript
#comics#llm#computer-vision

leofan90/Awesome-World-Models

A comprehensive list of papers on World Models, a technique for general video generation, embodied AI, and autonomous driving.

1.3K
Active
Agents & Orchestration
Computer Vision
#world-models#video-prediction#autonomous-driving

s9roll7/ebsynth_utility

An extension for the AUTOMATIC1111 Stable Diffusion web UI that enables creating videos using img2img and ebsynth.

1.3K
Archived
Python
Computer Vision
Animation & Motion
React
#stable-diffusion#computer-vision#animation

LinXueyuanStdio/LaTeX_OCR_PRO

A powerful math formula OCR tool that supports handwritten, Chinese-mixed formulas and simple symbol reasoning.

1.3K
Archived
Jupyter Notebook
Computer Vision
OCR
Jupyter Notebook
#ocr#math-formulas#handwritten

zeusees/License-Plate-Detector

A high-performance and accurate license plate detection library built using Yolov5 and ncnn.

1.3K
Archived
Python
Computer Vision
API Frameworks
#deep-learning#plate-detection#ncnn

YuanxunLu/LiveSpeechPortraits

Real-time photorealistic talking-head animation system built with Python and deep learning.

1.3K
Archived
Python
Computer Vision
AI Voice & Speech
React
#computer-vision#talking-head#speech-animation

nv-tlabs/GEN3C

A 3D-informed video generation model with precise camera control for high-quality, consistent video content.

1.3K
Stable
Jupyter Notebook
Computer Vision
Video Diffusion Model
Jupyter Notebook
#3d-graphics#camera-control#video-generation

jashkenas/ruby-processing

A Ruby library for creating interactive art and visuals using the Processing language.

1.3K
Archived
Ruby
Backend Frameworks
CLI Tools
#ruby#processing#computer-vision

ImprintLab/Medical-SAM-Adapter

A lightweight adapter that bridges the Segment Anything Model (SAM) with medical imaging applications.

1.3K
Stable
Python
Computer Vision
API Frameworks
Python
#medical-imaging#segmentation#deep-learning

MasterBin-IIAU/UNINEXT

Universal instance perception model for object detection, segmentation, and tracking in videos.

1.3K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#computer-vision#object-detection#object-tracking

ali-vilab/TeaCache

A Python library for accelerating inference of video diffusion models using timestep embedding caching.

1.3K
Experimental
Python
Inference
Computer Vision
Python
#video-generation#diffusion-models#inference-acceleration

jayleicn/animeGAN

A simple PyTorch implementation of Generative Adversarial Networks for generating anime-style faces.

1.3K
Archived
Jupyter Notebook
Computer Vision
Example Projects
PyTorch
#generative-adversarial-network#anime#computer-vision

soeaver/caffe-model

A collection of Caffe models and deployment files for popular machine learning networks like classification, detection, and segmentation.

1.3K
Archived
Python
Computer Vision
API Frameworks
Caffe
#caffe#caffemodel#classification

ShichenLiu/SoftRas

A differentiable renderer for 3D reasoning and reconstruction, useful for AI-driven 3D applications.

1.3K
Stable
Python
Computer Vision
API Frameworks
PyTorch
#3d-reconstruction#differentiable-rendering#computer-graphics

streamlit/demo-self-driving

A Streamlit app that demonstrates real-time object detection on the Udacity self-driving-car dataset.

1.3K
Active
Python
Computer Vision
Charts & Visualization
Streamlit
#computer-vision#object-detection#yolo

DAMO-NLP-SG/VideoLLaMA2

VideoLLaMA 2 is a Python library that advances spatial-temporal modeling and audio understanding in video-based large language models.

1.3K
Archived
Python
LLM Frameworks
Computer Vision
Python
#large-language-models#video-modeling#audio-understanding
1...8385...114

Stay in the loop

Get weekly updates on trending AI coding tools and projects.