Explore Projects

Discover 321 open source projects

Active filters (1):
Search: inferenceร—
Clear all

Showing 201-220 of 321 projects

webonnx/wonnx

A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web.

1.7K
Archived
Rust
Inference
Frontend Frameworks
Rust
#onnx#webassembly#webgpu

vllm-project/vllm-ascend

A community-maintained hardware plugin for running large language models (LLMs) on Ascend accelerators.

1.7K
Active
C++
LLM Frameworks
Inference
#ascend#llm-serving#llmops

mozilla-ai/any-llm

A Python library that provides a unified interface for communicating with large language models (LLMs).

1.7K
Active
Python
LLM Wrappers & SDKs
API Clients & Testing
Python
#language-models#inference#text-generation

Xilinx/Vitis-AI

Vitis AI is Xilinx's development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.

1.7K
Stable
Python
Inference
Embedded
#ai-inference#embedded-systems#hardware-acceleration

laugh12321/TensorRT-YOLO

A toolkit for easier and faster deployment of YOLO (You Only Look Once) object detection models using NVIDIA TensorRT.

1.7K
Active
C++
Computer Vision
API Frameworks
#object-detection#yolo#tensorrt

cvlab-columbia/viper

This repository provides code for a paper on using Python execution for visual reasoning tasks.

1.7K
Archived
Jupyter Notebook
Computer Vision
#computer-vision#visual-reasoning#python-execution

Maratyszcza/NNPACK

Highly optimized library for accelerating neural network inference on multi-core CPUs

1.7K
Archived
C
Inference
API Frameworks
C
#cpu#neural-networks#high-performance

metosin/malli

Malli is a high-performance, data-driven data specification library for Clojure and ClojureScript.

1.7K
Active
Clojure
Backend Frameworks
ORMs & Query Builders
Clojure
#clojure#clojurescript#data-validation

ELS-RD/transformer-deploy

Efficient CPU/GPU inference server for Hugging Face transformer models

1.7K
Archived
Python
React
#inference-server#transformer-models#hugging-face

4paradigm/OpenMLDB

OpenMLDB is an open-source machine learning database that provides a feature platform for training and inference.

1.7K
Active
C++
ML Ops
Databases
#database-for-ai#feature-engineering#feature-extraction

jingsongliujing/OnnxOCR

A lightweight OCR system based on PaddleOCR, with ultra-fast inference speed and decoupled from the PaddlePaddle deep learning framework.

1.7K
Stable
Python
Computer Vision
API Frameworks
Python
#ocr#computer-vision#deep-learning

jquesnelle/yarn

Efficient context window extension for large language models, enabling faster and more accurate inference.

1.7K
Archived
Python
LLM Wrappers & SDKs
CLI Tools
Python
#language-models#inference-optimization#context-windows

aphrodite-engine/aphrodite-engine

A large-scale LLM inference engine built in C++ with support for various AI hardware accelerators.

1.7K
Active
C++
LLM Frameworks
Inference
#machine-learning#inference-engine#cuda

NVIDIA/trt-samples-for-hackathon-cn

Simple samples for TensorRT programming, a powerful GPU-accelerated inference optimization library from NVIDIA.

1.7K
Active
Python
AI SDKs & Wrappers
#gpu-acceleration#inference-optimization#nvidia-tensorrt

timoschick/pet

Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference

1.6K
Archived
Python
NLP
React
#natural-language-inference#few-shot-text-classification#cloze-questions

kijai/ComfyUI-Florence2

Inference tool for Microsoft's Florence2 Versatile Language Model (VLM), built for vibe coders using AI tools.

1.6K
Active
Python
LLM Frameworks
Inference
Python
#llm#language-model#inference

DT42/BerryNet

A deep learning gateway for Raspberry Pi and other edge devices, enabling AI inference at the edge.

1.6K
Archived
Python
Edge AI
Raspberry Pi
TensorFlow
#aiot#edge-ai#edge-computing

microsoft/onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

1.6K
Active
C++
Inference
#machine-learning#onnx#inference

mit-han-lab/smoothquant

SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.

1.6K
Archived
Python
LLM Frameworks
Inference
Python
#quantization#large-language-models#performance-optimization

jakobrunge/tigramite

Tigramite is a Python library for causal inference and time series analysis with a focus on time series data.

1.6K
Active
Jupyter Notebook
ML Ops
Databases
#causal-inference#time-series#data-analysis
1...1012...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.