Category
Showing 301-350 of 897 trending projects
A database modeling language (DBML) that helps define and document database structures.
Redis 6.0.20 through 8.0.0 for Windows, a popular open-source in-memory data structure store.
Blazing-fast data wrangling toolkit for AI and data engineering workflows
An open-source data modeling tool designed for PostgreSQL, allowing developers to generate DDL commands visually.
A SQL database explorer supporting multiple database engines like SQLite, PostgreSQL, and MySQL.
A comprehensive index of medical imaging datasets for researchers and developers working in the medical imaging field.
Fluent Migrator is a .NET migration framework for managing database schema changes across multiple database providers.
A desktop application for viewing and analyzing tabular data, with support for CSV, Parquet, and DuckDB.
Olric is a distributed, in-memory key/value store and cache for Go applications and services.
Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.
Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.
A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.
efinance is a Python library for quickly accessing financial data (funds, stocks, bonds, futures) and backtesting/quantitative trading.
SQL query builder for C# developers, supporting multiple databases and complex queries.
A Go driver for the ClickHouse analytics database, enabling fast and efficient data processing.
DataSphereStudio is a one-stop data application development and management portal covering data exchange, analysis, and visualization.
Linq to database provider for .NET, supporting various database engines.
This is a code repository for a book on practical statistics for data scientists, not a developer discovery platform.
Apache Avro is a data serialization system for efficient storage and transmission of structured data.
MongoDB-compatible database engine for cloud-native and open-source workloads with scalability and performance.
SQLite JDBC Driver - a Java library for accessing SQLite databases
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.
An acoustic spectrum analyzer library written in C++ for audio analysis and visualization.
A Python tool to convert CAJ (China Academic Journals) files to PDF for developers who work with academic literature.
A free, open-source Python library for fetching real-time stock data from Chinese stock exchanges.
A curated collection of resources for data science and machine learning enthusiasts.
A Python library for extracting data from a wide range of internet sources into a pandas DataFrame.
A curated list of tools and datasets for anomaly detection on time-series data.
A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.
An interactive and reactive data science platform powered by Scala and Apache Spark.
Open-source repository for sharing code related to the MIMIC family of critical care databases.
A Java-based database subsetting and relational data browsing tool for popular databases.
A cloud-native PostgreSQL database developed by Alibaba Cloud for high-performance, scalable data storage and management.
An open-source global repository of address, building, and parcel data for developers and geospatial applications.
Python scripts for extracting, transforming and loading Ethereum blockchain data into Google BigQuery.
A high-performance datastore for time series and tick data built on top of MongoDB.
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
An Awesome List of open-source data engineering projects for developers.
Idempotent schema management tool for MySQL, PostgreSQL, SQLite, and SQL Server databases.
OpenMapTiles is an open-source vector tile schema implementation for creating custom map tiles.
Synthea is an open-source synthetic patient population simulator for generating realistic healthcare data.
Fast, cost-effective data replication tool from Postgres to data warehouses, queues, and storage
A Python library for comparing data across databases, supporting various database engines.
Comprehensive dataset of China's administrative divisions (province, city, county, town) in JSON, CSV, and SQL formats.
An open-source project that captures the public GitHub timeline and makes it accessible for analysis.
A highly scalable, high-performance graph database that supports over 100 billion data points.
A versatile app for querying, scripting, and visualizing data from various databases, files, and APIs.
A Python library for downloading, parsing, and analyzing health data from Garmin, FitBit, and MS Health.
An open-source dev data platform to ingest, analyze, and visualize data from DevOps tools for engineering insights.
A Go library for creating high-quality plots and visualizations of data
Get weekly updates on trending AI coding tools and projects.