Explore Projects

Discover 385 open source projects

Active filters (1):
Search: extractionร—
Clear all

Showing 341-360 of 385 projects

oxylabs/how-to-bypass-amazon-captcha

A Python library for bypassing Amazon CAPTCHAs using the Oxylabs Amazon Scraper API.

1.1K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#amazon#captcha-solving#scraper-api

mateogianolio/ocr

A neural network-based OCR library for JavaScript, useful for building document scanning and text extraction features.

1.1K
Archived
JavaScript
Computer Vision
Frontend Frameworks
React
#ocr#text-extraction#computer-vision

r35tart/RW_Password

This Python project extracts and collects strong and weak passwords from previously leaked password data.

1.1K
Archived
Python
Security Research
#password-extraction#password-analysis#security-research

SciresM/hactool

This is a tool to view, decrypt, and extract common file formats for the Nintendo Switch.

1.1K
Archived
C
API Frameworks
CLI Tools
#nintendo-switch#file-formats#decryption

ozguralp/gmapsapiscanner

This Python library provides a tool to scan and extract data from the Google Maps API.

1.1K
Active
Python
API Clients & Testing
Backend Frameworks
Python
#google-maps-api#scraping#data-extraction

fighting41love/cocoNLP

A Chinese natural language processing library for information extraction tasks.

1.1K
Archived
Python
Natural Language Processing
#chinese#nlp#information-extraction

kohlschutter/boilerpipe

A Java library for extracting the main content from web pages, useful for content extraction tasks.

1.1K
Archived
Java
Backend Frameworks
Caching
#web-scraping#content-extraction#java

mui/pigment-css

Pigment CSS is a zero-runtime CSS-in-JS library that extracts styles to separate CSS files at build time.

1.1K
Stable
TypeScript
React
#CSS-in-JS#Build-time#Zero-runtime

da03/Attention-OCR

Attention-based OCR library for building vision AI apps that extract text from images.

1.1K
Archived
Python
Computer Vision
Backend Frameworks
Python
#ocr#computer-vision#text-extraction

kdzwinel/SnappySnippet

A Chrome extension that allows easy extraction of CSS and HTML from selected elements.

1.1K
Archived
CSS
Component Libraries (React)
Frontend Frameworks
React
#css#html#web-development

anestisb/vdexExtractor

A tool to decompile and extract Android Dex bytecode from Vdex files for Android developers.

1.1K
Archived
C
Android
CLI Tools
#android#bytecode#decompiler

tatuylonen/wiktextract

A Python library for parsing and extracting multilingual data from Wiktionary dump files.

1.1K
Active
Python
CLI Tools
Databases
#wiktionary#multilingual#parser

cypherpunk-symposium/dark-forest-toolkit

A toolkit for understanding and mitigating the impact of blockchain's maximal extractable value (MEV)

1.1K
Stable
Shell
DeFi
API Frameworks
#blockchain#defi#mev

fengsp/color-thief-py

A Python library that extracts the dominant color or color palette from images using the Pillow library.

1.1K
Archived
Python
Component Libraries (React)
React
#image-processing#color-extraction#color-palette

docker/metadata-action

A GitHub Action that extracts metadata (tags, labels) from Git references and GitHub events for Docker.

1.1K
Active
TypeScript
CI/CD
Containerization
Node
#docker#github-actions#metadata

bytebuff/JSpider

JSpider is a JavaScript web scraping tool that automatically decrypts and extracts data from websites.

1.1K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node.js
#web-scraping#javascript#nodejs

TheAiSingularity/graphrag-local-ollama

Local support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3) - LLM & Embedding extraction

1.1K
Archived
Python
LLM Frameworks
API Frameworks
Python
#llm#embedding#vector-database

sibprogrammer/xq

Command-line tool for beautifying and extracting content from XML and HTML files.

1.1K
Active
Go
CLI Tools
Backend Frameworks
#cli#xml#html

SSShooter/ebook-to-mindmap

AI-powered tool for extracting content summaries from EPUB and PDF books.

1.1K
Active
TypeScript
LLM Wrappers & SDKs
Backend Frameworks
TypeScript
#epub#pdf#book-summary

yangheng95/PyABSA

A comprehensive library for sentiment analysis, text classification, and text adversarial defense, tailored for AI-powered developers.

1.1K
Active
Jupyter Notebook
Agents & Orchestration
Inference
PyTorch
#sentiment-analysis#text-classification#text-augmentation
1...171920

Stay in the loop

Get weekly updates on trending AI coding tools and projects.