Showing 301-320 of 321 projects
A minimal implementation of the Flash Attention algorithm in CUDA for efficient AI model inference.
Paddle.js is a web project for the Baidu PaddlePaddle deep learning framework, enabling browser-based inference.
RStan is an R interface to the Stan probabilistic programming language, used for Bayesian data analysis and inference.
Adds support for very large language models (vLLMs) to IndexTTS, enabling faster AI-powered text-to-speech inference.
A powerful C++ library for building causal models and performing advanced statistical analysis.
A Python library that boosts cost efficiency, inference accuracy, and cross-domain adaptability for complex QA systems.
A real-time portrait animation library that supports ONNX and TensorRT for fast inference on various platforms.
RTP-LLM is a high-performance LLM inference engine from Alibaba for diverse AI applications.
A C++ tutorial for the TensorRT deep learning inference engine optimized for NVIDIA GPUs.
This GitHub repository contains seminars from the DeepBayes Summer School 2018, focused on Bayesian deep learning and variational inference.
Inference code and configs for the ReplitLM model family, a large language model for AI-powered coding assistants.
Provides Bayesian data analysis demos in Python for developers interested in probabilistic modeling.
TinyMaix is a tiny inference library for microcontrollers, enabling efficient AI/ML on resource-constrained devices.
Minimal LLM inference in Rust, a lightweight library for running large language models.
This repository provides optimized PyTorch models and inference tools for NVIDIA GPUs, aimed at vibe coders building with AI tools.
A Python library for converting the Llama language model to ONNX format for faster inference.
BlackJAX is a Bayesian inference library for Python, focused on ease of use, speed, and modularity.
A tool for serving neural network models for inference, built with Java and supporting various AI frameworks.
Data quality assessment and reporting tool for data frames and database tables in R
Lightweight implementation of conformal prediction, a method for uncertainty estimation in machine learning.
Get weekly updates on trending AI coding tools and projects.