Category
Showing 651-700 of 897 trending projects
Tegola is an open-source Mapbox Vector Tile server written in Go, enabling efficient geospatial data visualization.
A distributed, scalable Prometheus-compatible time series database written in Scala.
A book that teaches the basics of using the Redis in-memory data structure store.
TensorBase is a new big data warehousing solution built with Rust, focused on high-performance analytics.
QueryKit is a simple CoreData query language for Swift and Objective-C developers.
Non-native graph database abstraction layer for Node.js and web browsers.
A fast and efficient C++ hash map and hash set implementation using robin hood hashing.
Transporter is a powerful ETL tool that allows developers to sync data between various persistence engines.
CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.
A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.
A collection of simple tools for data cleaning and wrangling in R for data science tasks.
A Python library for performing multivariate exploratory data analysis, including techniques like PCA, CA, MCA, MFA, and FAMD.
A tool for comparing and evaluating databases for time series data.
A C# library for reading and writing metadata in media files, useful for audio and video processing applications.
This repository contains a collection of portfolio projects for a data analyst, not a developer discovery platform.
An R package that provides support for simple features, a standardized way to encode spatial vector data.
A pure Go library for reading and writing Parquet files, a columnar data format.
A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL databases seamlessly
This repository provides code examples for Oracle's AI-enabled database features and integrations.
A Python tool that generates Entity Relationship Diagrams (ERDs) from SQLAlchemy models.
Firebird is a relational database management system (RDBMS) suitable for a wide range of applications from desktop to client-server to large databases.
PumpkinDB is an immutable, ordered key-value database engine written in Rust.
A powerful 3D visualization library for scientific data in Python.
R package for Bayesian generalized multivariate non-linear multilevel models using Stan
An end-to-end data engineering project example showcasing tools and technologies for building data pipelines.
A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.
First open-source data discovery and observability platform for data practitioners.
This is a book that teaches how to use Apache Spark for lightning-fast data analytics.
A simple, fast and versatile Datalog database written in Clojure for vibe coders.
A Python library for analyzing movement trajectory data using GeoPandas.
A curated list of Python packages for chemistry, including computational chemistry, molecular dynamics, and quantum chemistry.
Quilt is a data mesh for connecting people with actionable data, built with TypeScript.
An educational project to build a disk-based key-value store in Python for learning purposes.
A visual data preparation tool powered by Python, designed for data analysis and ETL tasks.
pandasql is a Python library that allows developers to use SQL syntax to query Pandas DataFrames.
A collection of PySpark examples covering RDD, DataFrame, and Dataset operations in Python.
A tool to easily import CSV and JSON data into PostgreSQL databases.
A curated list of awesome database libraries, resources, and tools for developers.
PDAL is a C++ library for processing point cloud data, similar to GDAL for raster data.
Python code for causal inference, a book by Miguel Hernán and James Robins.
A Python library that implements database internals from scratch, useful for learning database concepts.
A fast, hierarchical key-value storage engine written in C++ for applications that require high performance and scalability.
A fast, in-memory B-tree implementation for sorted collections in Swift.
A Python package for handling messy CSV files with improved dialect detection and a command-line interface.
Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.
Modern database IDE for dev & data workflows, supporting MySQL, PostgreSQL & MongoDB.
A comprehensive resource for developers to learn and get started with data engineering using Python.
A Python library for extracting, transforming, and loading tabular data.
A Rust library that enables querying Excel spreadsheets using SQLite, making data extraction and analysis more efficient.
A command-line tool for version controlling database snapshots, allowing developers to save, restore, and archive database state.
Get weekly updates on trending AI coding tools and projects.