google-research/deduplicate-text-datasets

A Rust library for deduplicating text datasets, potentially useful for machine learning projects.

Rust
AI & Machine Learning
Data & Databases
Apache-2.0

1.3K

Stars

128

Forks

Jul 16, 2021

Created

Jul 30, 2024

Last Updated

Project Analytics

Stars Growth (1 Month)

+6

+0.5% change

Avg Daily Growth (1 Month)

+0.2

stars per day

Fork/Star Ratio (All Time)

10.2%

Good engagement

Lifetime Growth

0.7

stars/day over 1.7K days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

data-deduplication
text-processing
machine-learning
data-cleaning
rust
cli-tool

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.