Explore Projects

Discover 321 open source projects

Active filters (1):
Search: inferenceร—
Clear all

Showing 81-100 of 321 projects

microsoft/LLMLingua

This project aims to speed up large language model (LLM) inference and enhance their understanding of key information through prompt and KV-Cache compression.

5.9K
Stable
Python
LLM Frameworks
Inference
Python
#llm#language-model#inference-optimization

Trusted-AI/adversarial-robustness-toolbox

A Python library for machine learning security, providing tools for adversarial attacks and defenses.

5.9K
Stable
Python
AI SDKs & Wrappers
Security Research
Python
#adversarial-attacks#adversarial-examples#machine-learning-security

katanemo/plano

Delivers infrastructure for agentic apps with AI-native proxy and data plane.

5.9K
Active
Rust
Rust
#proxy#gateway#LLM

vimeo/psalm

A powerful PHP static analysis tool that helps find errors and security vulnerabilities in PHP applications.

5.8K
Active
PHP
Linters & Formatters
Authentication
#php#security-analysis#static-analysis

uber/causalml

Causal inference and uplift modeling library for machine learning applications.

5.8K
Active
Python
Causal Inference
Uplift Modeling
Python
#causal-inference#uplift-modeling#machine-learning

aidlearning/AidLearning-FrameWork

AidLearning is a powerful AIOT development platform that provides a Linux environment with GUI, deep learning, and visual IDE support on Android.

5.7K
Archived
Python
AI SDKs & Wrappers
Cross-Platform
Android
#aiot#android-linux#android-ai

argmaxinc/WhisperKit

An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.

5.7K
Active
Swift
AI Voice & Speech
iOS
#speech-recognition#transformers#inference

NVIDIA/DALI

A highly optimized GPU-accelerated library for accelerating deep learning training and inference applications.

5.6K
Active
C++
GPU
Data Processing
PyTorch
#gpu#data-processing#deep-learning

huggingface/parler-tts

Inference and training library for high-quality text-to-speech (TTS) models.

5.5K
Archived
Python
AI Voice & Speech
API Frameworks
Python
#text-to-speech#tts#speech-synthesis

algorithmicsuperintelligence/openevolve

Open-source implementation of AlphaEvolve, a coding agent for iterative code optimization and discovery.

5.5K
Active
Python
Agents & Orchestration
AI Coding Agents
Python
#alpha-evolve#coding-agent#llm-engineering

leejet/stable-diffusion.cpp

A C/C++ implementation of Stable Diffusion and other diffusion models for image generation and processing.

5.5K
Active
C++
Computer Vision
Inference
#ai#image-generation#diffusion

OpenCSGs/csghub

An open-source platform for managing large language models, datasets, and agents with features similar to Hugging Face.

5.5K
Active
Vue
LLM Frameworks
LLM Wrappers & SDKs
Vue
#ai#llm#dataset

open-edge-platform/anomalib

An open-source anomaly detection library with state-of-the-art algorithms and features like experiment management and edge inference.

5.4K
Active
Python
Computer Vision
Inference
Python
#anomaly-detection#anomaly-localization#anomaly-segmentation

winfunc/deepreasoning

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

5.4K
Stable
Rust
LLM Frameworks
LLM Wrappers & SDKs
#anthropic#claude#chain-of-thought

NVIDIA/tacotron2

A PyTorch implementation of Tacotron 2, a state-of-the-art text-to-speech model, with faster-than-realtime inference.

5.3K
Archived
Jupyter Notebook
Speech & Audio
Inference
PyTorch
#text-to-speech#audio-generation#machine-learning

superduper-io/superduper

Superduper is an end-to-end framework for building custom AI applications and agents using Python, PyTorch, and Transformers.

5.3K
Stable
Python
LLM Frameworks
Agents & Orchestration
PyTorch
#ai#chatbot#mlops

flashinfer-ai/flashinfer

A Python library for serving large language models (LLMs) with high performance, including GPU acceleration and distributed inference.

5.1K
Active
Python
LLM Frameworks
Inference
PyTorch
#llm#inference#cuda

xlite-dev/Awesome-LLM-Inference

A curated list of awesome papers and code for optimizing LLM/VLM inference performance

5.0K
Active
Python
LLM Frameworks
LLM Wrappers & SDKs
#llm#inference#optimization

FellouAI/eko

Eko is an agentic framework that helps developers build production-ready AI-powered workflows with natural language interactions.

4.9K
Active
TypeScript
Agents & Orchestration
LLM Frameworks
TypeScript
#agent#agentic-ai#natural-language-inference

kvcache-ai/Mooncake

Mooncake is a serving platform for Kimi, a leading LLM service provided by Moonshot AI, focused on disaggregation, inference, and RDMA.

4.9K
Active
C++
LLM Frameworks
API Frameworks
C++
#llm#inference#rdma
1...46...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.