Showing 1-5 of 5 projects
NLTK Data is a collection of datasets, models, and other resources for natural language processing in Python.
A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.
A Python library for easy data augmentation of Chinese text corpora using the EDA (Easy Data Augmentation) technique.
An open-source library for automatic high-quality phrase mining from large text corpora.
A data repository for pre-trained NLP models and corpora to use in language processing projects.
Get weekly updates on trending AI coding tools and projects.