Showing 41-60 of 321 projects
NVIDIA TensorRT is an SDK for high-performance deep learning inference on NVIDIA GPUs.
A lightweight Chinese OCR library that supports vertical text recognition and NCNN/MNN/TNN inference with a small model size.
Deploy open-source LLMs as OpenAI-compatible API endpoints using BentoML's model serving framework.
A powerful Python library for working with large language models (LLMs) and natural language processing tasks.
Open-source cloud platform with elastic compute, storage, databases, AI, and Kubernetes services.
Spark-TTS is an open-source Python library for high-quality text-to-speech inference.
A collection of Jupyter notebooks showcasing how to build and deploy machine learning models with Amazon SageMaker.
Large Language Model Text Generation Inference library for developers working with AI tools and models.
Official inference library for Mistral models, a platform for building AI-powered applications.
An optimized cloud and edge inference solution for deploying and running machine learning models.
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Production ready AI toolkit for local AI inference
A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.
OpenVINO is an open-source toolkit for optimizing and deploying AI inference on a variety of hardware.
A high-performance distributed file system designed for AI workloads like training and inference.
A powerful Bayesian modeling and probabilistic programming library for Python developers.
Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.
A deep universal probabilistic programming library for Python and PyTorch, enabling Bayesian machine learning.
Easily fine-tune, evaluate and deploy open-source large language models like GPT-OSS and Llama.
High-performance C++ library for fast local deployment of large language models (LLMs) like LLaMA.
Get weekly updates on trending AI coding tools and projects.