Showing 21-31 of 31 projects
A Python library that uses LLMs and embeddings to process datasets with up to 1000x speedups
A Python library that helps developers extract structured data from tricky documents using vision-language models.
A curated list of resources for Document Understanding (DU) related to machine learning and natural language processing.
Superlinked is a Python framework for building high-performance search & recommendation apps with structured and unstructured data.
A visual data preparation tool powered by Python, designed for data analysis and ETL tasks.
Interactively explore unstructured datasets like audio, images, and video using this TypeScript library.
An enterprise-grade, API-first LLM workspace for unstructured document processing, with features like data extraction, redaction, and prompt engineering.
A fast data versioning system for ML datasets, making it easy to version and track changes like code.
Contextualise is a powerful tool for organizing diverse information resources in knowledge-intensive projects.
An open-source Python library that helps curate better data for large language models (LLMs).
A high-performance vector search and full-text search database fork of ClickHouse, focused on use cases for AI and ML developers.
Get weekly updates on trending AI coding tools and projects.