Showing 21-40 of 390 projects
A high-performance async web scraping tool for extracting data from Douyin, TikTok, Bilibili and more.
A fast and full-featured command-line argument parser for Rust developers.
A Go library for parsing and querying HTML documents, providing a jQuery-like API.
Unified framework for building enterprise RAG pipelines with small, specialized models
Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.
A Pythonic HTML parsing library that simplifies web scraping and interaction with HTTP resources.
Nvim Treesitter is a Neovim plugin that provides a high-performance incremental parsing system for various programming languages.
An educational resource for developers to learn how to build a compiler from scratch in C.
A modern, pirate-themed command-line interface for parsing complex options and arguments.
A powerful Rust parser combinator framework for building efficient and extensible parsers.
CoreNLP is a comprehensive NLP toolkit that provides powerful language processing capabilities for Java developers.
A pure-Python library for manipulating PDF documents, including splitting, merging, cropping, and transforming pages.
A Python library that provides a powerful API for extracting text and tables from PDF files.
A PHP library for parsing and manipulating DocBlocks, which are essential for documenting code.
A Haskell library for parsing, analyzing, and comparing source code across many programming languages.
A Python library for parsing and transpiling SQL queries across various databases and engines.
A robust and flexible query string parsing and serializing library for JavaScript projects.
Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.
A high-performance PDF processor written in Go for tasks like parsing, manipulating, and converting PDF files.
A modular and type-safe schema library for validating structural data, focused on developer productivity.
Get weekly updates on trending AI coding tools and projects.