Explore Projects

Discover 29 open source projects

Active filters (1):
Search: perceptionร—
Clear all

Showing 1-20 of 29 projects

google-ai-edge/mediapipe

Cross-platform ML framework for real-time media processing

34.0K
Active
C++
Inference
Computer Vision
MediaPipe
#ml-framework#real-time-processing#android

isl-org/Open3D

Open3D is a modern C++ library for 3D data processing, including reconstruction, registration, and visualization.

13.4K
Active
C++
Computer Vision
React
#3d-perception#computer-graphics#mesh-processing

OpenDriveLab/UniAD

An open-source autonomous driving framework that focuses on planning-oriented autonomous driving.

4.5K
Stable
Python
Computer Vision
API Frameworks
Python
#autonomous-driving#motion-planning#perception-prediction-planning

fundamentalvision/BEVFormer

An open-source, camera-only framework for autonomous driving perception tasks like 3D object detection and semantic map segmentation.

4.3K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#autonomous-driving#object-detection#semantic-segmentation

open-mmlab/mmtracking

An open-source video perception toolbox for object detection, tracking, and instance segmentation tasks.

3.9K
Archived
Python
Computer Vision
API Frameworks
Python
#multi-object-tracking#single-object-tracking#video-instance-segmentation

mit-han-lab/efficientvit

Efficient vision foundation models for high-resolution generation and perception.

3.3K
Stable
Python
Computer Vision
ML Ops
Python
#deep-learning#computer-vision#image-generation

jixiaozhong/Sonic

Official implementation of a paper on improving portrait animation using global audio perception

3.2K
Active
Python
Computer Vision
API Frameworks
Python
#computer-vision#portrait-animation#audio-perception

mit-han-lab/bevfusion

A multi-task multi-sensor fusion library for bird's-eye view perception in 3D computer vision applications.

3.0K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#3d-perception#sensor-fusion#object-detection

Pointcept/Pointcept

Pointcept is a codebase for point cloud perception research, featuring the latest works on 3D computer vision.

2.9K
Active
Python
Computer Vision
API Frameworks
PyTorch
#3d-vision#point-cloud#computer-vision

hustvl/YOLOP

YOLOP is a Python-based autonomous-driving perception project for panopitic driving perception.

2.2K
Archived
Python
React
#authentication#streaming#real-time

opendatalab/DocLayout-YOLO

An open-source library that enhances document layout analysis using diverse synthetic data and adaptive perception.

2.0K
Experimental
Python
Computer Vision
API Frameworks
Python
#document-layout-analysis#computer-vision#synthetic-data

ZHOUYI1023/awesome-radar-perception

A curated list of radar datasets, detection, tracking and fusion for autonomous driving and vehicles.

1.8K
Experimental
React
#radar#autonomous-driving#fusion

nv-tlabs/vipe

Open-source video pose engine for geometric 3D perception using Python.

1.8K
Stable
Python
React
#3D#camera#depth estimation

iMoonLab/yolov13

Implementation of the state-of-the-art YOLOv13 object detection model with hypergraph-enhanced visual perception.

1.6K
Stable
Python
Computer Vision
API Frameworks
Python
#object-detection#real-time#hypergraph-learning

ShisatoYano/AutonomousVehicleControlBeginnersGuide

This Python project is a technical guide for beginners to study algorithms and software architectures for autonomous vehicle control.

1.4K
Active
Python
Computer Vision
API Frameworks
#autonomous-vehicles#computer-vision#algorithm

OpenDriveLab/Birds-eye-view-Perception

OpenDriveLab's Birds-eye-view Perception project provides a cookbook and research for autonomous driving, with Python implementation.

1.4K
Experimental
Python
React
#authentication#perception-algorithm#autonomous-driving

CUT3R/CUT3R

Continuous 3D Perception Model with Persistent State, a Python library for 3D computer vision tasks.

1.3K
Stable
Python
Computer Vision
#computer-vision#3d-perception#machine-learning

a-real-ai/pywinassistant

An open-source AI agent that interacts with graphical user interfaces using natural language

1.3K
Experimental
Python
Agents & Orchestration
Graphical-User-Interface
Python
#artificial-intelligence#natural-language-processing#graphical-user-interface

MasterBin-IIAU/UNINEXT

Universal instance perception model for object detection, segmentation, and tracking in videos.

1.3K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#computer-vision#object-detection#object-tracking

NVIDIA-ISAAC-ROS/isaac_ros_visual_slam

Visual SLAM/odometry package for NVIDIA-accelerated cuVSLAM

1.3K
Stable
C++
ROS
#ros2#visual-odometry#gpu
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.