attardi/wikiextractor

A Python tool for extracting plain text from Wikipedia dumps, useful for natural language processing tasks.

Python
Backend & APIs
API Frameworks
AGPL-3.0

4.0K

Stars

1.0K

Forks

Mar 22, 2015

Created

May 23, 2024

Last Updated

Project Analytics

Stars Growth (1 Month)

+2

+0.1% change

Avg Daily Growth (1 Month)

+0.1

stars per day

Fork/Star Ratio (All Time)

25.3%

High engagement

Lifetime Growth

1.0

stars/day over 4.0K days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

wikipedia
text-extraction
nlp
data-pipeline
command-line-tool

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.