Showing 1441-1460 of 2,275 projects
Audio Spectrogram Transformer (AST) for audio classification and representation learning tasks.
SimpleRecon is a 3D reconstruction library that uses a novel approach without 3D convolutions, optimized for computer vision tasks.
A research framework for easy and efficient training of Generative Adversarial Networks (GANs) based on PyTorch.
A C++ library for accelerating YOLO-based computer vision models using NVIDIA's TensorRT framework.
GVHMR is a Jupyter Notebook project for recovering human motion via gravity-view coordinates.
A PyTorch-based object detection library with a Receptive Field Block Network for fast and accurate performance.
A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.
A unified UI and API for processing and training images for facial recognition across various AI tools.
An open-source differentiable dense SLAM library for PyTorch, focused on 3D reconstruction and robotics.
A Python script that enables swapping facial features between images, useful for developers working on computer vision projects.
Object tracking implementation with YOLOv4, DeepSort, and TensorFlow for computer vision applications.
An open-source 6DoF head tracking software for gaming, simulations, and virtual experiences.
This is a dataset of character animation and motion capture data for developers working on AI-powered animation tools.
Curated collection of Brain-Computer Interface (BCI) resources for developers and researchers.
A Python library for processing and analyzing scientific audio data, particularly for bird song detection and recognition.
A labeling extension for Automatic1111's Stable Diffusion web UI, focused on AI-assisted content tagging.
A curation of the latest CVPR (Computer Vision and Pattern Recognition) papers, code, and demos for AI-powered developers.
A Python script that automatically updates daily Computer Vision papers from the ArXiv using GitHub Actions.
A simple image captioning model built using the CLIP neural network for generating captions for images.
StableAnimator is an end-to-end video diffusion framework for synthesizing high-quality, ID-preserving videos from a reference image and pose sequence.
Get weekly updates on trending AI coding tools and projects.