Explore Projects

Discover 386 open source projects

Active filters (1):
Search: extractร—
Clear all

Showing 81-100 of 386 projects

jlegewie/zotfile

Zotero plugin to manage PDF attachments, including renaming, moving, syncing, and extracting annotations.

4.3K
Archived
Java
General Utilities
#pdf-management#zotero-plugin#pdf-annotations

run-llama/llama_cloud_services

A set of TypeScript-based cloud services and utilities for processing and extracting structured data from various document formats.

4.2K
Active
TypeScript
File Storage
Caching
TypeScript
#document-parsing#pdf-processing#structured-data

google/mtail

A Go-based instrumentation tool that extracts internal monitoring data from application logs for collection in a timeseries database.

4.0K
Stable
Go
CLI Tools
Monitoring
#monitoring#observability#timeseries

webpack-contrib/extract-text-webpack-plugin

A deprecated webpack plugin that extracts CSS into a separate file for better performance.

4.0K
Archived
JavaScript
Component Libraries (React)
Frontend Frameworks
React
#css#webpack#performance

cozmo/jsQR

A pure JavaScript QR code reading library that can locate, extract, and parse QR codes from raw images.

4.0K
Archived
TypeScript
Component Libraries (React)
React
#qr#qr-code#qr-parsing

attardi/wikiextractor

A Python tool for extracting plain text from Wikipedia dumps, useful for natural language processing tasks.

4.0K
Archived
Python
API Frameworks
ETL & Pipelines
Python
#wikipedia#text-extraction#nlp

snipsco/snips-nlu

Snips NLU is a Python library for extracting meaning from text using natural language processing and machine learning.

4.0K
Archived
Python
NLP
API Frameworks
Python
#natural-language-processing#intent-classification#named-entity-recognition

modelscope/ClearerVoice-Studio

An open-source toolkit for speech processing, supporting enhancement, separation, and target speaker extraction.

4.0K
Stable
Python
AI Voice & Speech
PyTorch
#speech-enhancement#speech-separation#speaker-extraction

symfony/contracts

A set of abstractions for building interoperable PHP components and libraries.

3.9K
Active
PHP
API Frameworks
CLI Tools
Symfony
#abstractions#interoperability#decoupling

mandiant/flare-floss

Automatically extracts obfuscated strings from malware using FLARE Obfuscation Solver

3.9K
Active
Python
Python
#malware-analysis#deobfuscation#strings

chrippa/livestreamer

A command-line utility that extracts streams from various services and pipes them into a video player.

3.9K
Archived
Python
API Frameworks
CLI Tools
Python
#streaming#video#command-line

reactjs/react-docgen

A CLI and library to extract information from React component files for documentation generation.

3.8K
Active
TypeScript
Component Libraries (React)
Documentation
React
#react#documentation#cli

AloneMonkey/frida-ios-dump

This tool allows developers to extract decrypted iOS app binaries from jailbroken devices for reverse engineering and security research.

3.8K
Archived
JavaScript
Security Research
iOS
JavaScript
#decrypted#ipa#reverse-engineering

DedSecInside/TorBot

DedSecInside/TorBot is a dark web OSINT tool written in Python that crawls and extracts information from the Tor network.

3.8K
Active
Python
Security Research
API Frameworks
#osint#dark-web#tor

megadose/toutatis

Toutatis is an open-source tool for extracting information from Instagram accounts, including emails and phone numbers.

3.8K
Archived
Python
Backend & APIs
CLI Tools
Python
#information-gathering#instagram-scraper#open-source-intelligence

kepano/defuddle

A TypeScript library that extracts the main content from web pages, useful for content extraction and parsing tasks.

3.7K
Active
TypeScript
Backend & APIs
API Clients & Testing
TypeScript
#web-scraping#content-extraction#parsing

atlanhq/camelot

Camelot is a Python library for extracting tables from PDF files, making it easier for developers to work with PDF data.

3.7K
Archived
Python
API Frameworks
CLI Tools
Python
#pdf#table-extraction#data-processing

miso-belica/sumy

A Python module for automatic summarization of text documents and HTML pages.

3.7K
Stable
Python
NLP
Backend Frameworks
Python
#html-extraction#text-summarization#nlp

aubio/aubio

A powerful audio and music analysis library for developers working with audio-related applications.

3.6K
Stable
C
Audio Analysis
Libraries
#audio#music#analysis

camelot-dev/camelot

A Python library for extracting tabular data from PDF files, useful for data processing and analysis.

3.6K
Active
Python
Databases
API Frameworks
#pdf#data-extraction#tabular-data
1...46...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.