Showing 1-20 of 27 projects
Conversational data analysis with LLMs using natural language queries on databases, CSVs, and data lakes.
QuestDB is a high-performance, open-source, time-series database for real-time analytics and financial applications.
Apache Arrow is a fast columnar data format and toolset for in-memory analytics and data interchange.
An open-source data format for building high-performance multimodal AI applications with fast random access, vector indexing, and data versioning.
High-performance data engine for AI and multimodal workloads, processing images, audio, video, and structured data at scale.
A command-line tool for running SQL queries against various data formats like JSON, CSV, Excel, and Parquet.
Blazing-fast data wrangling toolkit for AI and data engineering workflows
A desktop application for viewing and analyzing tabular data, with support for CSV, Parquet, and DuckDB.
A Rust-based library to create full-fledged APIs for slowly moving datasets without writing code.
Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.
A lightweight Rust-based TUI application to view and query tabular data files like CSV, TSV, and Parquet.
Rill is a tool for transforming data sets into powerful dashboards using SQL, enabling BI-as-code.
Apache Parquet Format, a columnar data storage format used in the Apache Hadoop ecosystem.
A Rust-based library that provides real-time analytics on Postgres tables, supporting features like columnstore, delta-lake, and Iceberg.
Petastorm enables training and evaluation of deep learning models from Apache Parquet datasets.
A large-scale entity and relation database supporting aggregation of properties for big data applications.
A C++20 library for fast serialization, deserialization and validation using reflection, supporting multiple data formats.
Apache Kafka-compatible broker with support for S3, PostgreSQL, SQLite, Apache Iceberg, and Delta Lake.
cryo is a Rust library for extracting blockchain data to parquet, CSV, JSON, or Python dataframes.
A fast, embeddable column database written in Go, optimized for AI/ML workloads.
Get weekly updates on trending AI coding tools and projects.