Explore Projects

Discover 390 open source projects

Active filters (1):
Search: parseร—
Clear all

Showing 21-40 of 390 projects

Evil0ctal/Douyin_TikTok_Download_API

A high-performance async web scraping tool for extracting data from Douyin, TikTok, Bilibili and more.

16.5K
Stable
Python
API Frameworks
FastAPI
#api#async#scraper

clap-rs/clap

A fast and full-featured command-line argument parser for Rust developers.

16.2K
Active
Rust
CLI Tools
#argument-parsing#command-line#rust

PuerkitoBio/goquery

A Go library for parsing and querying HTML documents, providing a jQuery-like API.

14.9K
Active
Go
Backend Frameworks
#html-parsing#selector-strings#jquery

llmware-ai/llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

14.9K
Active
Python
Next.js
#LLM Frameworks#RAG Pipelines#Small Specialized Models

Unstructured-IO/unstructured

Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.

14.1K
Active
HTML
Document Processing
#document-processing#data-pipelines#natural-language-processing

psf/requests-html

A Pythonic HTML parsing library that simplifies web scraping and interaction with HTTP resources.

13.9K
Archived
Python
Backend Frameworks
#web-scraping#http-client#html-parsing

nvim-treesitter/nvim-treesitter

Nvim Treesitter is a Neovim plugin that provides a high-performance incremental parsing system for various programming languages.

13.3K
Active
Tree-sitter Query
IDE Extensions
Neovim
#neovim#nvim-treesitter#tree-sitter

DoctorWkt/acwj

An educational resource for developers to learn how to build a compiler from scratch in C.

12.8K
Stable
C
Tutorials & Courses
#compiler#lexical-analysis#parsing

yargs/yargs

A modern, pirate-themed command-line interface for parsing complex options and arguments.

11.5K
Stable
JavaScript
React
#command-line interface#options parsing#argument handling

rust-bakery/nom

A powerful Rust parser combinator framework for building efficient and extensible parsers.

10.3K
Stable
Rust
Build Tools
#parser#parser-combinators#byte-array

stanfordnlp/CoreNLP

CoreNLP is a comprehensive NLP toolkit that provides powerful language processing capabilities for Java developers.

10.1K
Active
Java
NLP Frameworks
Java
#natural-language-processing#named-entity-recognition#parsing

py-pdf/pypdf

A pure-Python library for manipulating PDF documents, including splitting, merging, cropping, and transforming pages.

9.8K
Active
Python
API Frameworks
#pdf#pdf-manipulation#pdf-parser

jsvine/pdfplumber

A Python library that provides a powerful API for extracting text and tables from PDF files.

9.8K
Active
Python
API Frameworks
Python
#pdf#pdf-parsing#table-extraction

phpDocumentor/ReflectionDocBlock

A PHP library for parsing and manipulating DocBlocks, which are essential for documenting code.

9.4K
Active
PHP
Documentation
#docblocks#documentation#php

github/semantic

A Haskell library for parsing, analyzing, and comparing source code across many programming languages.

9.1K
Experimental
Haskell
CLI Tools
Linters & Formatters
#source-code-analysis#language-agnostic#parsing

tobymao/sqlglot

A Python library for parsing and transpiling SQL queries across various databases and engines.

9.0K
Active
Python
API Frameworks
ORMs & Query Builders
Python
#sql-parser#sql-transpiler#database-abstraction

ljharb/qs

A robust and flexible query string parsing and serializing library for JavaScript projects.

8.9K
Active
JavaScript
API Clients & Testing
Backend Frameworks
Node
#url-parsing#query-strings#encoding

bytedance/Dolphin

Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.

8.9K
Stable
Python
Computer Vision
API Frameworks
Python
#document-analysis#layout-analysis#ocr

pdfcpu/pdfcpu

A high-performance PDF processor written in Go for tasks like parsing, manipulating, and converting PDF files.

8.5K
Active
Go
API Frameworks
Backend Frameworks
#pdf#pdf-processing#pdf-library

open-circle/valibot

A modular and type-safe schema library for validating structural data, focused on developer productivity.

8.5K
Active
TypeScript
API Clients & Testing
CLI Tools
TypeScript
#type-safe#modular#schema
13...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.