facebookresearch/cc_net

Tools to download and cleanup Common Crawl data, a large web crawl dataset, for further analysis and processing.

Python
Data & Databases
ETL & Pipelines
MIT

1.0K

Stars

153

Forks

Oct 29, 2019

Created

Apr 25, 2023

Last Updated

Project Analytics

Stars Growth (1 Month)

+0

+0.0% change

Avg Daily Growth (1 Month)

+0.0

stars per day

Fork/Star Ratio (All Time)

14.7%

Good engagement

Lifetime Growth

0.4

stars/day over 2.3K days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

data-processing
web-crawling
data-cleanup
common-crawl
etl
cli-tool

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.