Showing 101-120 of 848 projects
XGo is a programming language that lets you leverage assets from C/C++, Go, Python, and JavaScript/TypeScript.
A tiny vision language model for developers building AI-powered applications.
A robust and highly performant video matting library for PyTorch, TensorFlow, TensorFlow.js, ONNX, and CoreML.
A collection of tutorials and notebooks on state-of-the-art computer vision models and techniques for developers.
A powerful, high-performance React Native camera library for building mobile apps with advanced camera features.
A PyTorch library with pre-trained convolutional networks for various computer vision tasks.
Versatile database for AI, supporting storage, querying, versioning, and visualization of any AI data.
Curated list of Machine Learning, NLP, Vision, and Recommender Systems project ideas for developers.
A Python library that empowers developers to build applications and systems with self-contained computer vision capabilities.
A guide to deploying deep-learning inference networks and computer vision primitives with NVIDIA Jetson hardware and TensorRT.
A comprehensive collection of study materials for deep learning interviews, covering various topics like machine learning, computer vision, and NLP.
Generates hierarchical audio-driven visual synthesis for portrait image animation
An ultra-simple, state-of-the-art codebase for autoregressive image generation using advanced AI models.
A comprehensive computer vision and robotics library for tasks like SLAM, object detection, and more.
A comprehensive SDK for Intel's RealSense depth cameras, enabling 3D computer vision applications.
Lab materials for an introductory course on deep learning from MIT, covering computer vision, music generation, and more.
DAIN is a Python library for depth-aware video frame interpolation, a computer vision technique.
A Java library that provides a high-level API to access computer vision and multimedia functionality from OpenCV, FFmpeg, and other libraries.
Effortless data labeling with AI support from Segment Anything and other powerful models.
A multilingual document layout parsing model that can extract text, images, and structure from documents in a single vision-language model.
Get weekly updates on trending AI coding tools and projects.