Explore Projects

Discover 10 open source projects

Active filters (1):
Search: data-extractionร—
Clear all

Showing 1-10 of 10 projects

firecrawl/firecrawl

Convert websites into LLM-ready data with API for scraping, crawling, and structured data extraction

88.5K
Active
TypeScript
Web Scraping AI
Agents & Orchestration
TypeScript
#ai-scraping#web-crawler#llm-data

D4Vinci/Scrapling

Powerful, flexible Python library for effortless web scraping with AI-powered features.

23.6K
Active
Python
Web Scraping
Backend Frameworks
Python
#web-scraping#automation#data-extraction

ScrapeGraphAI/Scrapegraph-ai

AI-powered web scraping library for extracting data from websites and documents

22.9K
Active
Python
Web Scraping AI
RAG & Vector
Python
#ai-scraping#llm#rag

getmaxun/maxun

Turn websites into clean data pipelines & structured APIs in minutes with a low-code web scraping tool.

15.2K
Active
TypeScript
API Clients & Testing
React
#web-scraping#automation#no-code

vi3k6i5/flashtext

A powerful Python library for keyword extraction and text processing for natural language tasks.

5.7K
Experimental
Python
NLP
Data Extraction
#keyword-extraction#text-processing#nlp

brightdata/brightdata-mcp

A powerful MCP server that provides an all-in-one solution for public web access and data extraction.

2.2K
Active
JavaScript
MCP Servers
Backend Frameworks
Node.js
#mcp#web-scraping#data-extraction

shcherbak-ai/contextgem

A Python library for extracting data and LLM outputs from various document types with ease.

1.8K
Stable
Python
LLM Frameworks
Data Extraction
#llm#data-extraction#document-intelligence

hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

1.5K
Archived
Python
ETL & Pipelines
API Frameworks
#big-data-cleaning#data-analysis#data-cleaning

raznem/parsera

Lightweight Python library for scraping websites using large language models (LLMs) and the Playwright browser automation tool.

1.3K
Stable
Python
LLM Frameworks
Backend Frameworks
Python
#ai#scraping#data-extraction

thinh-vu/vnstock

A beginner-friendly Python toolkit for financial data extraction, analysis, and automation.

1.2K
Active
Python
ETL & Pipelines
Backend Frameworks
Python
#data-extraction#quantitative-analysis#stock-market

Stay in the loop

Get weekly updates on trending AI coding tools and projects.