Explore Projects

Discover 321 open source projects

Active filters (1):
Search: inferenceร—
Clear all

Showing 301-320 of 321 projects

tspeterkim/flash-attention-minimal

A minimal implementation of the Flash Attention algorithm in CUDA for efficient AI model inference.

1.1K
Archived
Cuda
LLM Frameworks
Inference
#cuda#attention-mechanism#deep-learning

PaddlePaddle/Paddle.js

Paddle.js is a web project for the Baidu PaddlePaddle deep learning framework, enabling browser-based inference.

1.1K
Archived
JavaScript
Inference
Frontend Frameworks
React
#deep-learning#inference-engine#paddlepaddle

stan-dev/rstan

RStan is an R interface to the Stan probabilistic programming language, used for Bayesian data analysis and inference.

1.1K
Active
R
Bayesian Inference
Databases
#bayesian-statistics#mcmc#r-package

Ksuriuri/index-tts-vllm

Adds support for very large language models (vLLMs) to IndexTTS, enabling faster AI-powered text-to-speech inference.

1.1K
Stable
Python
LLM Frameworks
Inference
Python
#text-to-speech#llm#inference

grf-labs/grf

A powerful C++ library for building causal models and performing advanced statistical analysis.

1.1K
Stable
C++
ML Ops
API Frameworks
#causal-inference#econometrics#random-forest

TencentCloudADP/youtu-graphrag

A Python library that boosts cost efficiency, inference accuracy, and cross-domain adaptability for complex QA systems.

1.1K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#llm#graph#rag

warmshao/FasterLivePortrait

A real-time portrait animation library that supports ONNX and TensorRT for fast inference on various platforms.

1.1K
Experimental
Python
Computer Vision
Animation & Motion
Python
#real-time#portrait#animation

alibaba/rtp-llm

RTP-LLM is a high-performance LLM inference engine from Alibaba for diverse AI applications.

1.1K
Active
Cuda
LLM Frameworks
LLM Inference
CUDA
#gpt#llama#llm

LitLeo/TensorRT_Tutorial

A C++ tutorial for the TensorRT deep learning inference engine optimized for NVIDIA GPUs.

1.0K
Archived
C++
Inference
API Frameworks
#machine-learning#inference#gpu-acceleration

bayesgroup/deepbayes-2018

This GitHub repository contains seminars from the DeepBayes Summer School 2018, focused on Bayesian deep learning and variational inference.

1.0K
Archived
Jupyter Notebook
Variational Inference
Bayesian Deep Learning
Jupyter Notebook
#bayesian#deep-learning#variational-inference

replit/ReplitLM

Inference code and configs for the ReplitLM model family, a large language model for AI-powered coding assistants.

1.0K
Archived
Python
LLM Frameworks
Inference
Python
#large-language-model#code-generation#ai-coding-assistant

avehtari/BDA_py_demos

Provides Bayesian data analysis demos in Python for developers interested in probabilistic modeling.

1.0K
Stable
Jupyter Notebook
Databases
LLM Frameworks
Python
#bayesian-inference#mcmc#probabilistic-modeling

sipeed/TinyMaix

TinyMaix is a tiny inference library for microcontrollers, enabling efficient AI/ML on resource-constrained devices.

1.0K
Experimental
C
Inference
Arduino & Embedded
#tinyml#microcontroller#inference

samuel-vitorino/lm.rs

Minimal LLM inference in Rust, a lightweight library for running large language models.

1.0K
Archived
Rust
LLM Frameworks
#llm#rust#inference

huggingface/optimum-nvidia

This repository provides optimized PyTorch models and inference tools for NVIDIA GPUs, aimed at vibe coders building with AI tools.

1.0K
Experimental
Python
LLM Wrappers & SDKs
Inference
PyTorch
#machine-learning#nvidia#optimization

microsoft/Llama-2-Onnx

A Python library for converting the Llama language model to ONNX format for faster inference.

1.0K
Archived
Python
LLM Frameworks
LLM Wrappers & SDKs
Python
#llm#onnx#language-model

blackjax-devs/blackjax

BlackJAX is a Bayesian inference library for Python, focused on ease of use, speed, and modularity.

1.0K
Stable
Python
Inference
Sampling Methods
Python
#bayesian-inference#hamiltonian-monte-carlo#probabilistic-programming

awslabs/multi-model-server

A tool for serving neural network models for inference, built with Java and supporting various AI frameworks.

1.0K
Archived
Java
Inference
API Frameworks
#ai#deep-learning#inference

rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

1.0K
Active
R
Data Validation
Testing
#data-quality#data-validation#data-testing

aangelopoulos/conformal-prediction

Lightweight implementation of conformal prediction, a method for uncertainty estimation in machine learning.

1.0K
Stable
Jupyter Notebook
Uncertainty Estimation
Machine Learning
Python
#conformal-prediction#uncertainty-estimation#machine-learning
1...1517

Stay in the loop

Get weekly updates on trending AI coding tools and projects.