Showing 1-6 of 6 projects
Official implementation of EAGLE, a framework for developing AI-powered coding tools and language models.
A Python library that provides SOTA compression techniques and efficient LLM inference on Intel platforms to build chatbots quickly.
A Python library for optimizing deep learning models for faster inference on deployment platforms like TensorRT.
A large-scale LLM inference engine built in C++ with support for various AI hardware accelerators.
A Python-based Bitcoin trading bot with a real-time dashboard for the Bitstamp exchange.
A curated collection of must-read papers and blogs on speculative decoding techniques for developers.
Get weekly updates on trending AI coding tools and projects.