Showing 21-24 of 24 projects
A Rust library for deduplicating text datasets, potentially useful for machine learning projects.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Attic is a deduplicating backup program that can be used to securely backup data to remote or local storage.
A powerful Python library for record linkage and duplicate detection in data-driven applications.
Get weekly updates on trending AI coding tools and projects.