Computer Vision

Explore 2,275 open source projects in Computer Vision

Showing 681-700 of 2,275 projects

InternLM/InternLM-XComposer

A comprehensive multimodal system for long-term streaming video and audio interactions using large language models.

2.9K
Experimental
Python
LLM Frameworks
Computer Vision
PyTorch
#chatgpt#gpt-4#multimodal

victordibia/handtrack.js

A JavaScript library for prototyping real-time hand detection and tracking in the browser.

2.9K
Archived
JavaScript
Computer Vision
Animation & Motion
JavaScript
#handtracking#computer-vision#animation

mazzzystar/Queryable

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos using semantic search.

2.9K
Archived
Swift
Computer Vision
iOS
Swift
#clip-model#mobileclip#natural-language-image-search

meagmohit/EEG-Datasets

A curated collection of public EEG datasets for researchers and developers working with brain-computer interfaces.

2.9K
Stable
Computer Vision
Tutorials & Courses
#eeg#brain-computer-interface#neuroscience

iscyy/ultralyticsPro

A Python library focused on improving YOLO models for object detection and computer vision tasks.

2.9K
Stable
Python
Computer Vision
API Frameworks
PyTorch
#yolo#computer-vision#object-detection

zhaipro/easy12306

A Python library that uses machine learning to automatically recognize 12306 (Chinese railway) captchas.

2.9K
Archived
Python
Computer Vision
API Frameworks
#captcha#machine-learning#deep-learning

Tencent/PocketFlow

An automatic model compression framework for developing smaller and faster AI applications.

2.9K
Archived
Python
Model Compression
Computer Vision
Python
#automl#model-compression#deep-learning

ndrplz/self-driving-car

This GitHub repository contains projects for the Udacity Self-Driving Car Engineer Nanodegree, focusing on computer vision, deep learning, and autonomous vehicle technology.

2.9K
Archived
C++
Computer Vision
Deep Learning
#computer-vision#deep-learning#autonomous-vehicles

biometrics/openbr

Open-source biometrics and face recognition library written in C++.

2.9K
Active
C++
Computer Vision
#biometrics#face-recognition#computer-vision

TylerYep/torchinfo

A Python library that provides a simple way to view and summarize PyTorch models.

2.9K
Active
Python
Computer Vision
CLI Tools
PyTorch
#pytorch#computer-vision#model-visualization

Saiyan-World/goku

A video generation foundation model for AI-powered developers and creators.

2.9K
Experimental
Python
LLM Frameworks
Computer Vision
Python
#video-generation#foundation-model#machine-learning

sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

A PyTorch tutorial for building an image captioning model using the Show, Attend, and Tell technique.

2.9K
Archived
Python
Computer Vision
Tutorials & Courses
PyTorch
#image-captioning#attention-mechanism#encoder-decoder

zhulf0804/3D-PointCloud

A comprehensive collection of papers and datasets for 3D point cloud processing, useful for developers working on autonomous driving and computer vision.

2.9K
Archived
Python
Computer Vision
Datasets
Python
#point-cloud#autonomous-driving#classification

jeeliz/jeelizFaceFilter

A lightweight JavaScript library for real-time multi-face detection, tracking and AR face filters

2.9K
Stable
JavaScript
Animation & Motion
Computer Vision
Three.js
#face-detection#face-tracking#augmented-reality

sxyu/svox2

An efficient, neural-network-free 3D radiance field renderer for virtual view synthesis and reconstruction.

2.9K
Archived
Python
Computer Vision
Backend Frameworks
Python
#computer-vision#3d-rendering#neural-networks

nickliqian/cnn_captcha

A Python library that uses Tensorflow and convolutional neural networks to recognize character-based image captchas.

2.9K
Archived
Python
Computer Vision
API Frameworks
Tensorflow
#captcha-recognition#image-recognition#neural-networks

Dicklesworthstone/llm_aided_ocr

Corrects OCR errors via LLM post-processing, smart chunking & markdown formatting for PDFs

2.9K
Active
Python
Computer Vision
LLM Wrappers & SDKs
Tesseract
#ocr-correction#llm-postprocessing#pdf-parsing

NVIDIA/MinkowskiEngine

A high-performance, auto-diff neural network library for 3D and 4D sparse tensor computations.

2.9K
Archived
Python
Computer Vision
ML Ops
PyTorch
#3d-convolutional-network#4d-convolutional-neural-network#sparse-tensor-network

MashiroSaber03/Saber-Translator

AI-powered manga translator with OCR & bubble detection for Japanese comics to Chinese

2.9K
Active
Python
Computer Vision
LLM Wrappers & SDKs
Python
#manga-translation#ocr#ai-detection

microsoft/table-transformer

Deep learning model for extracting & analyzing table structures from PDFs and images with datasets.

2.9K
Archived
Python
Computer Vision
ETL & Pipelines
PyTorch
#table-extraction#computer-vision#document-processing
1...3436...114

Stay in the loop

Get weekly updates on trending AI coding tools and projects.