Category
Showing 251-300 of 897 trending projects
lakeFS is a Git-like version control system for data lakes, enabling data engineers to manage data versioning and data quality.
A Python library with most common stock market technical indicators, making it easy to implement quantitative finance and algorithmic trading.
A Python library for 3D plotting and mesh analysis using the Visualization Toolkit (VTK)
A modular quantitative trading framework for algorithmic trading, backtesting, and financial analysis.
A comprehensive index of medical imaging datasets for researchers and developers working in the medical imaging field.
Mongoose is a MongoDB object modeling tool for Node.js and Deno, simplifying database interactions with schemas and models.
A beginner-friendly Python toolkit for financial data extraction, analysis, and automation.
MyBatis SQL Mapper for Java simplifies database interactions with object mapping.
A JavaScript library that allows you to run SQLite on the web, enabling local database functionality for web apps.
A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.
mage-ai is a Python-based platform for building, running, and managing data pipelines and integrating/transforming data.
A collection of study notes, ebooks, and resources on big data, machine learning, Linux, and more for developers.
Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.
Tonbo is an embedded database for serverless and edge runtimes, optimized for offline-first and big data use cases.
Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.
Intake is a lightweight Python package for discovering, investigating, loading and distributing data.
A Python library for financial analysis and data scraping from the Finviz platform.
A comprehensive collection of resources and learning materials for big data technologies like Flink, Spark, Hadoop, and Hive.
Programmable CUDA/C++ GPU Graph Analytics library for high-performance parallel graph processing.
LiteDB is a lightweight, embedded NoSQL document database for .NET applications that can be used in a single data file.
A curated list of resources for time series forecasting, including papers, code, and other materials.
Fast, cost-effective data replication tool from Postgres to data warehouses, queues, and storage
An open-source distributed SQL database with high availability, scalability, and ACID transactions.
A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.
Comprehensive collection of city and administrative region data for China, with features like CSV export, JS code generation, and web scraping.
A collection of Jupyter Notebook files focused on data visualization and machine learning concepts.
Presto is an open-source distributed SQL query engine for big data, allowing fast analysis of large datasets.
A Python library for scraping soccer data from various sources for sports analytics and data science.
A fast and flexible R package for reading flat files (CSV, TSV, fixed-width) into R data frames.
A Python toolbox for gaining geometric insights into high-dimensional data, useful for vibe coders working with AI tools.
A high-quality, cross-platform data plotting library for Rust developers, including WebAssembly support.
The LevelDB key-value database in the Go programming language.
Pentaho Data Integration (ETL) is a Java-based tool for building data integration and ETL pipelines.
Apache Beam is a unified programming model for batch and streaming data processing.
A Python library for portfolio optimization using scikit-learn and convex optimization techniques.
A Python helper library for enhancing Jupyter Notebooks with data visualization and analysis capabilities.
ArcticDB is a high-performance, serverless DataFrame database for the Python data science ecosystem.
A SQL database explorer supporting multiple database engines like SQLite, PostgreSQL, and MySQL.
An ordered map implementation in Go with amortized O(1) performance for common operations.
A Rust library that provides multi-writer and CRDT support for SQLite databases.
A Python library to access historical market data from the Binance cryptocurrency exchange.
A definition and DDLs for the OMOP Common Data Model (CDM), a data model for healthcare data.
A Python library that provides a simple and unified interface for extracting text from any document format.
A collection of data analysis and machine learning projects and resources for developers.
A Python library for creating easy-to-use, visually appealing data tables and summaries.
SQLDelight - Generates type-safe Kotlin APIs from SQL, enabling easier database management in Kotlin projects.
Sample database for SQL Server, Oracle, MySQL, PostgreSQL, SQLite, DB2
WebAssembly version of the DuckDB analytical database, enabling fast in-browser analytics and SQL queries.
An open-source, scalable, and fault-tolerant NoSQL database with a focus on reliability and offline-first design.
Get weekly updates on trending AI coding tools and projects.