turboderp/exllama

A memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

View on GitHub

Python

AI & Machine Learning

LLM Wrappers & SDKs

MIT

2.9K

Stars

221

Forks

May 4, 2023

Created

Sep 30, 2023

Last Updated

Project Analytics

Stars Growth (1 Month)

+0.2% change

Avg Daily Growth (1 Month)

+0.2

stars per day

Fork/Star Ratio (All Time)

7.6%

Normal engagement

Lifetime Growth

2.8

stars/day over 1.0K days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

llama

transformers

quantized-weights

memory-efficient

language-model

Similar Projects

EbookFoundation/free-programming-books

Free programming books repository with AI-powered features

383.6K

Python

jwasham/coding-interview-university

A comprehensive computer science study plan for software engineers, with a focus on coding interviews and AI-related topics.

337.6K

vinta/awesome-python

Awesome Python framework and resource collection

285.7K

Python

openclaw/openclaw

Personal AI assistant for messaging platforms, self-hosted and multi-channel

265.0K

TypeScript

Comments (0)

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.