Showing 1-20 of 31 projects
Extracts structured info from text using LLMs with source grounding
GraphRAG is a modular system for enhancing LLM outputs using knowledge graphs from unstructured text.
dvc is a data versioning and ML experiments tool that helps developers manage and track data and model changes.
Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.
Refine high-quality datasets and visual AI models with this Python library for active learning and data curation.
Builds a Neo4j graph from unstructured data using LLMs
A library for extracting knowledge graphs from unstructured text using the GPT-3 language model.
A system for agentic LLM-powered data processing and ETL workflows for unstructured data analysis.
A fast and simple framework for building neural data processing pipelines using Python.
Deep learning model for extracting & analyzing table structures from PDFs and images with datasets.
A large collection of system log datasets for AI-driven log analytics.
This GitHub repository provides a Bootcamp for dealing with unstructured data like reverse image search, audio search, and NLP.
syslog-ng is an enhanced log daemon supporting a wide range of input and output methods for logging and monitoring.
Instill Core is an open-source AI infrastructure tool for orchestrating data, models, and pipelines to build AI-powered applications.
Nomic Developer API SDK is a Python library that provides tools for clustering, duplicate detection, embeddings, and topic modeling on unstructured data.
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit.
A Python library for extracting data and LLM outputs from various document types with ease.
A high-performance, MySQL-compatible vector database that supports structured and unstructured data for AI-driven applications.
Powerful, fast, and efficient unstructured data extraction library written in Rust with language bindings.
A Python library for parsing unstructured US addresses into structured address components.
Get weekly updates on trending AI coding tools and projects.