Category
Showing 101-150 of 897 trending projects
Python wrapper for the TA-Lib technical analysis library, useful for financial pattern recognition.
A space-efficient C++ implementation of the Cuckoo filter, a probabilistic data structure for set membership testing.
A comprehensive database of countries, states, and cities with data in multiple formats
Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.
Apache Flink is a stream processing framework for real-time and batch data processing.
Open-source relational database management system (RDBMS) for building data-driven applications.
Dexie.js is a minimalistic IndexedDB wrapper that simplifies offline storage and database management in web applications.
networkx is a Python library for creating, manipulating, and studying the structure and dynamics of complex networks.
A Python library for crawling historical data of China stocks.
A repository of data science interview questions and answers for developers.
A comprehensive collection of data science cheatsheets for developers and data scientists.
An open-source data catalog platform for building a high-performance, federated metadata lake.
A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.
High-performance time-series database for IoT and IIoT
An open-source framework for change data capture from various databases using Apache Kafka.
A Python library for creating circular data visualizations like Circos plots, chord diagrams, and radar charts.
SnappyData is a memory-optimized analytics database based on Apache Spark and Apache Geode, enabling real-time stream processing, transactions, and predictive analytics.
A lightweight data processing framework built on DuckDB and 3FS for vibe coders working with AI tools.
Apache Fluss is a real-time streaming storage platform built for big data analytics.
A collection of efficient Python tricks and tools for data scientists to improve their productivity.
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
An open-source metadata platform for managing your data and AI stack across the enterprise.
A geospatial data library for Ruby that provides a set of tools for working with geographic data.
Idempotent schema management tool for MySQL, PostgreSQL, SQLite, and SQL Server databases.
A curated list of awesome PostgreSQL software, libraries, tools and resources.
A Rust library to work with the Arrow data format, without requiring the Transmute crate.
A Python library for quantitative trading and stock analysis.
SciPy is a Python library for scientific and technical computing, providing a wide range of algorithms and tools.
A curated list of data engineering tools for software developers, not focused on AI coding tools.
Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.
Data quality assessment and reporting tool for data frames and database tables in R
FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.
Citus is a distributed PostgreSQL database that enables scaling out your Postgres database across multiple nodes.
A powerful Python library for record linkage and duplicate detection in data-driven applications.
efinance is a Python library for quickly accessing financial data (funds, stocks, bonds, futures) and backtesting/quantitative trading.
A lightweight, fault-tolerant distributed database built on SQLite, designed for high availability.
A comprehensive English word database with translations, parts of speech, and definitions for developers.
Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.
db.py is a Python library that provides an easier way to interact with your databases.
A library for calling Python functions from the Ruby language, enabling data science and ML workflows.
A command-line tool to generate idiomatic Go code for SQL databases across multiple database engines.
A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.
This is a Python library focused on basketball analytics and data processing.
A comprehensive collection of 150+ Python programs for quantitative finance and stock market data analysis.
Database manager for multiple database engines, runs as desktop or web app.
A fast, lightweight SQLite-based persistence layer with CloudKit synchronization for Swift developers.
This repository contains data on Chinese administrative divisions, including names, pinyin, and codes.
An educational distributed SQL database written in Rust, not focused on AI coding tools.
A library that allows developers to use LINQ to retrieve data from spreadsheets and CSV files.
A collection of notebooks covering quantitative finance and numerical methods in Python.
Get weekly updates on trending AI coding tools and projects.