Showing 201-220 of 321 projects
A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web.
A community-maintained hardware plugin for running large language models (LLMs) on Ascend accelerators.
A Python library that provides a unified interface for communicating with large language models (LLMs).
Vitis AI is Xilinx's development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.
A toolkit for easier and faster deployment of YOLO (You Only Look Once) object detection models using NVIDIA TensorRT.
This repository provides code for a paper on using Python execution for visual reasoning tasks.
Highly optimized library for accelerating neural network inference on multi-core CPUs
Malli is a high-performance, data-driven data specification library for Clojure and ClojureScript.
Efficient CPU/GPU inference server for Hugging Face transformer models
OpenMLDB is an open-source machine learning database that provides a feature platform for training and inference.
A lightweight OCR system based on PaddleOCR, with ultra-fast inference speed and decoupled from the PaddlePaddle deep learning framework.
Efficient context window extension for large language models, enabling faster and more accurate inference.
A large-scale LLM inference engine built in C++ with support for various AI hardware accelerators.
Simple samples for TensorRT programming, a powerful GPU-accelerated inference optimization library from NVIDIA.
Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference
Inference tool for Microsoft's Florence2 Versatile Language Model (VLM), built for vibe coders using AI tools.
A deep learning gateway for Raspberry Pi and other edge devices, enabling AI inference at the edge.
Examples for using ONNX Runtime for machine learning inferencing.
SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.
Tigramite is a Python library for causal inference and time series analysis with a focus on time series data.
Get weekly updates on trending AI coding tools and projects.