Explore Projects

Discover 386 open source projects

Active filters (1):
Search: extractร—
Clear all

Showing 221-240 of 386 projects

html-to-text/node-html-to-text

An advanced HTML to text converter for Node.js that provides robust and customizable text extraction from HTML.

1.7K
Archived
JavaScript
API Frameworks
General Utilities
Node
#html-to-text#text-extraction#email

eyurtsev/kor

A Python library for working with large language models (LLMs) like GPT-3 and Anthropic's models.

1.7K
Experimental
Python
LLM Frameworks
LLM Wrappers & SDKs
Python
#llm#natural-language-processing#information-extraction

4paradigm/OpenMLDB

OpenMLDB is an open-source machine learning database that provides a feature platform for training and inference.

1.7K
Active
C++
ML Ops
Databases
#database-for-ai#feature-engineering#feature-extraction

streamio/streamio-ffmpeg

A simple and powerful Ruby wrapper for the FFmpeg multimedia framework, enabling media transcoding and metadata extraction.

1.7K
Archived
Ruby
API Frameworks
General Utilities
#ffmpeg#media-transcoding#video-processing

activescott/lessmsi

A tool to view and extract the contents of a Windows Installer (.msi) file.

1.7K
Active
C#
CLI Tools
Authentication
#msi#extract#install

HoshinoSuzumi/chronoframe

Self-hosted personal gallery app with online photo management, EXIF parsing, geolocation, and WebGL viewer.

1.7K
Active
Vue
Component Libraries (Vue/Svelte)
Frontend Frameworks
Vue
#photo-gallery#exif-extraction#geocoding

chrismattmann/tika-python

A Python binding to the Apache Tika REST service, enabling text extraction and parsing in Python.

1.6K
Experimental
Python
API Clients & Testing
Data Processing
Python
#text-extraction#text-processing#data-extraction

apurvsinghgautam/dark-web-osint-tools

A collection of OSINT tools for exploring the dark web, including scraping, search, and data extraction capabilities.

1.6K
Experimental
Security Research
Backend Frameworks
#osint#darkweb#scraping

bespokelabsai/curator

A Python library for synthetic data curation and structured data extraction for machine learning models.

1.6K
Active
Python
Synthetic Data
LLM Frameworks
Python
#machine-learning#data-generation#data-curation

Yimeng-Zhang/feature-engineering-and-feature-selection

A comprehensive guide to feature engineering and feature selection techniques in Python, with examples.

1.6K
Archived
Jupyter Notebook
Data Mining
Documentation
Jupyter Notebook
#feature-engineering#feature-selection#machine-learning

whwlsfb/JDumpSpider

HeapDump tool for sensitive information extraction from JVM heap dumps

1.6K
Stable
Java
Spring Boot
#authentication#pentesting#heapdump

mhamilton723/FeatUp

A model-agnostic framework for extracting features from machine learning models at any resolution.

1.6K
Archived
Jupyter Notebook
LLM Frameworks
ML Ops
Jupyter Notebook
#machine-learning#model-interpretation#feature-extraction

meyda/meyda

A JavaScript library for real-time audio feature extraction, useful for music information retrieval and audio analysis.

1.6K
Archived
TypeScript
Audio Features
CLI Tools
JavaScript
#audio-processing#feature-extraction#music-information-retrieval

devnied/EMV-NFC-Paycard-Enrollment

A Java library for reading and extracting data from NFC EMV credit cards on Android and PCSC systems.

1.6K
Archived
Java
API Frameworks
Android
Android
#nfc#emv#credit-card

chriskite/anemone

Anemone is a Ruby web-spider framework for crawling and extracting data from websites.

1.6K
Archived
Ruby
Backend Frameworks
CLI Tools
Ruby
#web-crawler#data-extraction#web-scraping

myhhub/KnowledgeGraph

This Python project helps developers build knowledge graphs from scratch, including named entity recognition, relation extraction, and question answering.

1.6K
Archived
Python
Knowledge Representation
Databases
#knowledge-graph#named-entity-recognition#relation-extraction

boudinfl/pke

A Python module for automatic keyphrase extraction from documents.

1.6K
Archived
Python
Natural Language Processing
CLI Tools
Python
#computational-linguistics#information-retrieval#keyphrase-extraction

LeeeSe/MessAuto

A Rust library to automatically extract 2FA codes from iMessage and Mail App on Mac platforms.

1.6K
Stable
Rust
Authentication
CLI Tools
#2fa#sms#email

microsoft/OpenAPI.NET

The OpenAPI.NET SDK provides a useful object model and serializers to work with OpenAPI documents in .NET.

1.6K
Active
C#
API Documentation
API Clients & Testing
#openapi#http#documentation

echohive42/AI-reads-books-page-by-page

An AI-powered tool that extracts knowledge and generates summaries from PDF books, page by page.

1.6K
Archived
Python
LLM Frameworks
API Frameworks
Python
#pdf-extraction#knowledge-extraction#summarization
1...1113...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.