Showing 1-20 of 46 projects
TensorRT LLM provides a Python API and optimizations to efficiently run large language models on NVIDIA GPUs.
NVIDIA TensorRT is an SDK for high-performance deep learning inference on NVIDIA GPUs.
YOLOX is a high-performance anchor-free YOLO model for object detection
A guide to deploying deep-learning inference networks and computer vision primitives with NVIDIA Jetson hardware and TensorRT.
A powerful multi-object tracking library with modular SOTA tracking modules for segmentation, detection, and pose estimation.
TensorRT implementation of popular deep learning networks for efficient inference on GPUs
A curated list of awesome papers and code for optimizing LLM/VLM inference performance
An easy-to-use PyTorch to TensorRT converter for optimizing AI model inference on NVIDIA Jetson devices.
A tutorial repository for learning PyTorch, covering computer vision, NLP, and deep learning model deployment.
A lite C++ AI toolkit with 100+ models for computer vision tasks like detection, segmentation, and image generation.
A nearly-live implementation of OpenAI's Whisper, a powerful speech recognition and translation tool.
Collection of generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
ONNX-TensorRT is a C++ library that provides a TensorRT backend for the ONNX deep learning framework.
A fast and accurate object detection method with new technologies like NAS backbones and efficient RepGFPN.
A powerful object detection and instance segmentation library built on top of YOLOv7 and transformers, with TensorRT acceleration.
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM.
OpenMMLab's model deployment framework for optimizing and deploying computer vision models across various backends.
PyTorch compiler for NVIDIA GPUs using TensorRT, enabling efficient deep learning inference on CUDA hardware.
A collection of computer vision and AI projects in Python, C++, and embedded systems for developers.
A C++ API and server for deep learning that supports popular frameworks like PyTorch, TensorFlow, and XGBoost.
Get weekly updates on trending AI coding tools and projects.