Computer Vision

Explore 2,275 open source projects in Computer Vision

Showing 1661-1680 of 2,275 projects

tianzhi0549/CTPN

A Jupyter Notebook project for detecting text in natural images using the Connectionist Text Proposal Network (CTPN) algorithm.

1.3K

Archived

Jupyter Notebook

Computer Vision

Backend Frameworks

Jupyter Notebook

#ocr#text-detection#computer-vision

google/neuroglancer

Neuroglancer is a WebGL-based viewer for volumetric data, enabling visualization and analysis of 3D biological and scientific data.

1.3K

Active

TypeScript

Animation & Motion

Charts & Visualization

TypeScript

#visualization#3d-data#scientific-data

fahadshamshad/awesome-transformers-in-medical-imaging

A curated collection of resources on applying Transformers to medical imaging tasks like segmentation, classification, and synthesis.

1.3K

Archived

Computer Vision

Tutorials & Courses

#medical-imaging#transformers#computer-vision

apple/ml-neuman

Official repository for NeuMan, a neural human radiance field model from a single video.

1.3K

Archived

Python

Computer Vision

#computer-vision#neural-networks#3d-reconstruction

jbilcke-hf/ai-comic-factory

A TypeScript library that generates comic panels using a large language model and SDXL, powered by Hugging Face.

1.3K

Stable

TypeScript

LLM Frameworks

Computer Vision

TypeScript

#comics#llm#computer-vision

leofan90/Awesome-World-Models

A comprehensive list of papers on World Models, a technique for general video generation, embodied AI, and autonomous driving.

1.3K

Active

Agents & Orchestration

Computer Vision

#world-models#video-prediction#autonomous-driving

s9roll7/ebsynth_utility

An extension for the AUTOMATIC1111 Stable Diffusion web UI that enables creating videos using img2img and ebsynth.

1.3K

Archived

Python

Computer Vision

Animation & Motion

React

#stable-diffusion#computer-vision#animation

LinXueyuanStdio/LaTeX_OCR_PRO

A powerful math formula OCR tool that supports handwritten, Chinese-mixed formulas and simple symbol reasoning.

1.3K

Archived

Jupyter Notebook

Computer Vision

OCR

Jupyter Notebook

#ocr#math-formulas#handwritten

zeusees/License-Plate-Detector

A high-performance and accurate license plate detection library built using Yolov5 and ncnn.

1.3K

Archived

Python

Computer Vision

API Frameworks

#deep-learning#plate-detection#ncnn

YuanxunLu/LiveSpeechPortraits

Real-time photorealistic talking-head animation system built with Python and deep learning.

1.3K

Archived

Python

Computer Vision

AI Voice & Speech

React

#computer-vision#talking-head#speech-animation

nv-tlabs/GEN3C

A 3D-informed video generation model with precise camera control for high-quality, consistent video content.

1.3K

Stable

Jupyter Notebook

Computer Vision

Video Diffusion Model

Jupyter Notebook

#3d-graphics#camera-control#video-generation

jashkenas/ruby-processing

A Ruby library for creating interactive art and visuals using the Processing language.

1.3K

Archived

Ruby

Backend Frameworks

CLI Tools

#ruby#processing#computer-vision

ImprintLab/Medical-SAM-Adapter

A lightweight adapter that bridges the Segment Anything Model (SAM) with medical imaging applications.

1.3K

Stable

Python

Computer Vision

API Frameworks

Python

#medical-imaging#segmentation#deep-learning

MasterBin-IIAU/UNINEXT

Universal instance perception model for object detection, segmentation, and tracking in videos.

1.3K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#computer-vision#object-detection#object-tracking

ali-vilab/TeaCache

A Python library for accelerating inference of video diffusion models using timestep embedding caching.

1.3K

Experimental

Python

Inference

Computer Vision

Python

#video-generation#diffusion-models#inference-acceleration

jayleicn/animeGAN

A simple PyTorch implementation of Generative Adversarial Networks for generating anime-style faces.

1.3K

Archived

Jupyter Notebook

Computer Vision

Example Projects

PyTorch

#generative-adversarial-network#anime#computer-vision

soeaver/caffe-model

A collection of Caffe models and deployment files for popular machine learning networks like classification, detection, and segmentation.

1.3K

Archived

Python

Computer Vision

API Frameworks

Caffe

#caffe#caffemodel#classification

ShichenLiu/SoftRas

A differentiable renderer for 3D reasoning and reconstruction, useful for AI-driven 3D applications.

1.3K

Stable

Python

Computer Vision

API Frameworks

PyTorch

#3d-reconstruction#differentiable-rendering#computer-graphics

streamlit/demo-self-driving

A Streamlit app that demonstrates real-time object detection on the Udacity self-driving-car dataset.

1.3K

Active

Python

Computer Vision

Charts & Visualization

Streamlit

#computer-vision#object-detection#yolo

DAMO-NLP-SG/VideoLLaMA2

VideoLLaMA 2 is a Python library that advances spatial-temporal modeling and audio understanding in video-based large language models.

1.3K

Archived

Python

LLM Frameworks

Computer Vision

Python

#large-language-models#video-modeling#audio-understanding

1...8385...114

Stay in the loop

Get weekly updates on trending AI coding tools and projects.