Showing 141-160 of 351 projects
ONNX-TensorRT is a C++ library that provides a TensorRT backend for the ONNX deep learning framework.
A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.
A tactical manual to systematically maximize the performance of your deep learning models.
A sparsity-aware deep learning inference runtime for CPUs, optimized for performance and efficiency.
A unified inference and post-training framework for accelerated video generation powered by AI.
A recurrent neural network for generating little stories about images.
A distributed inference engine for large language models and StableDiffusion on mobile, desktop and server.
A Python framework for efficient model inference with omni-modality AI models.
High-performance vector graph neural network database in Rust for real-time AI inference and graph ML.
A lightweight, self-contained Rust library for running Tensorflow and ONNX models with no dependencies
Easily compute CLIP embeddings and build a CLIP-based retrieval system with this Jupyter Notebook library.
A simple framework for accelerating LLM generation with multiple decoding heads
This repository provides an AI-powered toolkit for developing applications with the Rockchip RKNN inference engine.
This repository provides code and models for running inference with the SAM 3D Body Model, a tool for 3D body reconstruction.
An AI-powered tool for training supervised models without manual labeling, using foundation models and multimodal learning.
RamaLama simplifies local serving of AI models and enables their use for inference in production via containers.
A probabilistic programming library powered by NumPy and JAX for Bayesian inference and MCMC sampling.
Achieve state-of-the-art inference performance on modern accelerators with this Kubernetes-based solution.
Official PyTorch implementation of U-GAT-IT, an unsupervised generative adversarial network for image-to-image translation.
Simplified implementations of deep learning related works for developers interested in AI and machine learning.
Get weekly updates on trending AI coding tools and projects.