Unstructured-IO/unstructured

Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.

HTML
AI & Machine Learning
Document Processing
Apache-2.0

14.1K

Stars

1.2K

Forks

Sep 26, 2022

Created

Mar 4, 2026

Last Updated

Project Analytics

Stars Growth (1 Month)

+232

+1.7% change

Avg Daily Growth (1 Month)

+8.3

stars per day

Fork/Star Ratio (All Time)

8.4%

Normal engagement

Lifetime Growth

11.2

stars/day over 1.3K days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

document-processing
data-pipelines
natural-language-processing
pdf-to-text
ocr
machine-learning

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.