togethercomputer/RedPajama-Data

A repository for preparing large datasets for training large language models (LLMs).

Python
AI & Machine Learning
LLM Frameworks
Apache-2.0

4.9K

Stars

372

Forks

Apr 14, 2023

Created

Dec 7, 2024

Last Updated

Project Analytics

Stars Growth (1 Month)

+0

+0.0% change

Avg Daily Growth (1 Month)

+0.0

stars per day

Fork/Star Ratio (All Time)

7.6%

Normal engagement

Lifetime Growth

4.7

stars/day over 1.1K days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

language-models
dataset-preparation
cli-tool
machine-learning
open-source

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.