Explore Projects

Discover 386 open source projects

Active filters (1):
Search: extractionร—
Clear all

Showing 141-160 of 386 projects

luin/readability

A library that turns any web page into a clean and readable view, useful for content extraction and read-later applications.

2.5K
Archived
HTML
Frontend Frameworks
API Frameworks
#readability#web-scraping#content-extraction

kevmo314/magic-copy

A Chrome extension that uses Meta's Segment Anything Model to extract and copy objects from images.

2.5K
Archived
TypeScript
Computer Vision
Component Libraries (React)
React
#computer-vision#image-processing#clipboard

omkarcloud/google-maps-scraper

A Google Maps scraper that extracts data like names, addresses, phone numbers, reviews, websites, and ratings.

2.5K
Active
Data & Databases
API Clients & Testing
#google-maps#scraping#lead-generation

lorien/grab

A powerful web scraping framework for Python that supports asynchronous crawling and flexible data extraction.

2.5K
Stable
Python
Backend Frameworks
CLI Tools
Python
#web-scraping#crawling#asynchronous

Anil-matcha/Open-Higgsfield-AI

A chatbot that allows you to chat with and extract information from PDF documents using language models and AI.

2.5K
Archived
Jupyter Notebook
LLM Frameworks
Agents & Orchestration
Jupyter Notebook
#chatbot#chatgpt#pdf

lgandx/PCredz

A tool that extracts various credentials and sensitive data from network packets or live interfaces.

2.4K
Active
Python
Security Research
CLI Tools
Python
#network-analysis#credential-extraction#pcap-parsing

onekey-sec/unblob

A Python library for extracting files from various container formats, useful for file system operations.

2.4K
Active
Python
CLI Tools
Filesystem
#archive#compression#extraction

dfd-tud/deda

A Python library for analyzing and extracting information from printer forensics data, including tracking dots.

2.4K
Archived
Python
CLI Tools
Security Research
#printer-forensics#tracking-dots#yellow-dots

j4k0xb/webcrack

A TypeScript library for deobfuscating, unminifying, and unpacking bundled JavaScript code.

2.4K
Stable
TypeScript
Reverse Engineering
Frontend Frameworks
TypeScript
#ast#deobfuscation#unminify

CyberZHG/keras-bert

A Keras implementation of BERT for feature extraction and prediction tasks.

2.4K
Archived
Python
LLM Frameworks
API Frameworks
Keras
#bert#language-model#feature-extraction

jameslyons/python_speech_features

This Python library provides common speech feature extraction functions for automatic speech recognition (ASR) tasks.

2.4K
Archived
Python
AI Voice & Speech
#speech-recognition#feature-extraction#mfcc

HarderThenHarder/transformers_tasks

A library of NLP algorithms and utilities for text classification, generation, extraction, and more using the Transformers library.

2.4K
Archived
Jupyter Notebook
LLM Frameworks
ORMs & Query Builders
PyTorch
#nlp#text-classification#text-generation

metarank/metarank

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations.

2.4K
Stable
Scala
ML Ops
API Frameworks
Scala
#automl#personalization#ranking

Vibrant-Colors/node-vibrant

Vibrant-Colors/node-vibrant is a TypeScript library for extracting prominent colors from images.

2.4K
Active
TypeScript
Component Libraries (React)
React
#color#image-processing#canvas

DrizzleRisk/drizzleDumper

Android reverse engineering tool for unpacking apps via memory analysis and code extraction.

2.4K
Archived
Makefile
Security Research
Android
#android-unpack#memory-forensics#dynamic-analysis

mravanelli/pytorch-kaldi

A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.

2.4K
Archived
Python
Speech Recognition
API Frameworks
PyTorch
#speech-recognition#deep-learning#kaldi

fhamborg/news-please

news-please is an integrated web crawler and information extractor for news that works out of the box.

2.4K
Stable
Python
API Frameworks
Web Crawlers
#news#web-crawler#data-extraction

unode/firefox_decrypt

A Python tool to extract passwords from Firefox, Thunderbird, and other Mozilla profiles.

2.4K
Active
Python
CLI Tools
Privacy Tools
#firefox#password-extraction#mozilla

symfony/cache-contracts

A set of cache abstractions extracted out of the Symfony components for PHP developers.

2.4K
Active
PHP
API Frameworks
CLI Tools
Symfony
#cache#contract#symfony

UglyToad/PdfPig

A C# library for reading and extracting text and other content from PDF files, ported from the Java PDFBox library.

2.4K
Stable
C#
API Frameworks
Databases
#pdf#pdf-extraction#document-analysis
1...79...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.