Showing 61-80 of 321 projects
A guide to deploying deep-learning inference networks and computer vision primitives with NVIDIA Jetson hardware and TensorRT.
An accelerator for local LLM inference and fine-tuning on Intel XPUs, with seamless integration into popular LLM frameworks.
Ultra-efficient large language models (LLMs) for end devices, enabling fast on-device reasoning and inference.
BentoML is an easy-to-use framework for building and deploying production-ready machine learning models as APIs.
An extensible toolkit for finetuning and inference of large language models, enabling 'large models for all'.
The repository provides code and tools for running inference and fine-tuning with the Meta Segment Anything Model 3 (SAM 3).
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions.
TensorRT implementation of popular deep learning networks for efficient inference on GPUs
AlphaFold 3 is a Python-based inference pipeline for protein structure prediction using deep learning.
LMDeploy is a toolkit for compressing, deploying, and serving large language models (LLMs).
Supercharge your large language models (LLMs) with the fastest key-value cache layer for lightning-fast inference.
StarCoder is a Python library for fine-tuning and inference of large language models.
A lightweight face detection model optimized for inference on edge devices.
PaddlePaddle Lite is a high-performance deep learning inference engine for mobile and edge devices.
A runtime type system for IO decoding/encoding in TypeScript, providing a flexible and type-safe way to work with data.
A lightweight, standalone C++ inference engine for Google's Gemma AI models.
A Rust library for blazingly fast LLM inference, useful for AI coding and ML applications.
A comprehensive collection of best practices and examples for natural language processing (NLP) using Python.
Modeling, training, evaluation, and inference code for OLMo, a large language model.
A distributed inference serving framework for AI applications, built with Rust for high performance and scalability.
Get weekly updates on trending AI coding tools and projects.