Explore Projects

Discover 390 open source projects

Active filters (1):
Search: parsingร—
Clear all

Showing 41-60 of 390 projects

ericchiang/pup

A command-line tool for parsing HTML, useful for web scraping and data extraction tasks.

8.4K
Archived
HTML
Backend Frameworks
CLI Tools
Node.js
#web-scraping#data-extraction#html-parsing

rednote-hilab/dots.ocr

A multilingual document layout parsing model that can extract text, images, and structure from documents in a single vision-language model.

7.9K
Stable
Python
Computer Vision
Component Libraries (React)
React
#document-parsing#ocr#layout-extraction

mishoo/UglifyJS-old

A comprehensive JavaScript toolchain for parsing, minifying, and beautifying JavaScript code.

7.8K
Archived
JavaScript
Backend Frameworks
Build Tools
Node
#javascript#minifier#compressor

icindy/wxParse

A library for parsing HTML and Markdown content in WeChat mini-programs.

7.8K
Archived
JavaScript
Component Libraries (React)
Frontend Frameworks
React
#html#markdown#weapp

stanfordnlp/stanza

A powerful Python NLP library for tokenization, sentence segmentation, named entity recognition, and parsing of many languages.

7.7K
Active
Python
NLP Frameworks
API Frameworks
PyTorch
#natural-language-processing#machine-learning#deep-learning

malcommac/SwiftDate

SwiftDate is a toolkit for parsing, validating, manipulating, comparing, and displaying dates, time, and timezones in Swift.

7.7K
Archived
Swift
Date & Time
#date#date-formatting#date-time

arktypeio/arktype

A TypeScript library for runtime type validation, optimized for editor-to-runtime performance.

7.6K
Active
TypeScript
API Clients & Testing
Linters & Formatters
TypeScript
#typescript#parsing#runtime-typechecking

mgdm/htmlq

A Rust library for parsing and manipulating HTML, similar to jq.

7.5K
Archived
Rust
Component Libraries (React)
React
#html-parser#rust-library#frontend-tool

tabulapdf/tabula

Tabula is a tool for extracting data from PDF files, allowing developers to easily parse and extract tables.

7.3K
Experimental
CSS
API Frameworks
ETL & Pipelines
#pdf#scraping#data-extraction

QuivrHQ/MegaParse

Optimized file parser for LLM ingestion with no loss, supporting PDFs, Docx, and PPTx.

7.3K
Experimental
Python
React
#LLM#parser#PDF

jquery/esprima

esprima is an ECMAScript parsing infrastructure for multipurpose analysis, focused on JavaScript parsing and AST generation.

7.1K
Archived
TypeScript
API Clients & Testing
Backend Frameworks
#ast#ecmascript#esprima

go-yaml/yaml

A Go library providing YAML parsing and serialization capabilities for developers.

7.0K
Experimental
Go
API Frameworks
CLI Tools
Go
#yaml#configuration#serialization

pdfminer/pdfminer.six

A community-maintained fork of the PDF parsing library pdfminer for Python developers.

6.9K
Active
Python
API Frameworks
Backend Frameworks
Python
#pdf#parser#python

sindresorhus/query-string

A powerful utility for parsing and stringifying URL query strings in JavaScript.

6.9K
Stable
JavaScript
API Development
General Utilities
Node
#query-string#url#parse

adithya-s-k/omniparse

A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.

6.8K
Stable
Python
LLM Frameworks
File Storage
Python
#ingestion-api#ocr#parser-library

doctrine/annotations

Annotations Docblock Parser, a PHP library for parsing and working with annotations in docblocks.

6.7K
Stable
PHP
API Frameworks
Documentation
PHP
#annotations#docblock#parser

Yuliang-Liu/MonkeyOCR

A lightweight LMM-based Document Parsing Model for developers working with AI tools.

6.5K
Active
Python
Computer Vision
API Clients & Testing
Python
#document-parsing#ocr#ai-tools

caarlos0/env

A simple, zero-dependencies Go library to parse environment variables into structs.

6.0K
Active
Go
CLI Tools
Authentication
#config#configuration#environment

google/uuid

A Go package for generating and parsing UUIDs, based on RFC 4122 and DCE 1.1 standards.

6.0K
Archived
Go
General Utilities
#uuid#rfc4122#dce

JSQLParser/JSqlParser

JSqlParser is a Java library that parses SQL statements and converts them into a hierarchical Java object representation.

5.9K
Active
Java
API Frameworks
ORMs & Query Builders
Java
#sql#parser#ast
124...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.