Category
Showing 101-150 of 897 trending projects
This repository contains data on Chinese administrative divisions, including names, pinyin, and codes.
A comprehensive collection of 150+ Python programs for quantitative finance and stock market data analysis.
Framework for collecting and analyzing prediction market data with comprehensive Polymarket/Kalshi datasets.
ORM for TypeScript and JavaScript with support for multiple databases and platforms.
A fast, lightweight SQLite-based persistence layer with CloudKit synchronization for Swift developers.
Google's Operations Research tools for combinatorial optimization, linear programming, and operations research.
QuestDB is a high-performance, open-source, time-series database for real-time analytics and financial applications.
Portfolio analytics library for quantitative finance, built with Python
dvc is a data versioning and ML experiments tool that helps developers manage and track data and model changes.
A repository of data science interview questions and answers for developers.
A collection of Jupyter Notebook files for data analysis using Python, including a Chinese translation of the popular 'Python for Data Analysis' book.
Compilation of R and Python programming codes for data science and machine learning projects.
A Python library for crawling historical data of China stocks.
Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.
An exabyte-scale, multi-region distributed file system for developers building AI-powered applications.
A powerful customer data pipeline for collecting, processing, and analyzing user events and behavior.
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
Matplot++: A C++ graphics library for creating high-quality data visualizations and scientific plots.
Redis 6.0.20 through 8.0.0 for Windows, a popular open-source in-memory data structure store.
A quantitative research and stock analysis platform for finance professionals.
BuntDB is an embeddable, in-memory key/value database for Go with custom indexing and geospatial support.
An open-source framework for change data capture from various databases using Apache Kafka.
Fast, embedded graph database with vector search and full-text search, compatible with Cypher queries.
A Postgres extension for high-performance vector search, complementing pgvector for scale.
A Kotlin library for structured data processing, suitable for data analysis and data science tasks.
Apache Arrow is a fast columnar data format and toolset for in-memory analytics and data interchange.
GlobalBuildingAtlas is an open global and complete dataset of building polygons, heights and LoD1 3D models.
A repository of open-source data sets created for stories on The Pudding, a digital publication focused on data journalism.
A parallel processing library for Pandas that improves performance on multi-core CPUs.
A Python library for accurate and scalable fuzzy matching, record deduplication, and entity resolution.
A personal data aggregator and analysis tool for self-tracking and quantified self enthusiasts.
Apache Flink is a stream processing framework for real-time and batch data processing.
A free, open-source Python library for fetching real-time stock data from Chinese stock exchanges.
A high-performance GPU DataFrame library for data analysis and machine learning workloads.
A multi-page Streamlit app for geospatial data visualization and analysis, useful for housing and real estate applications.
A data access layer (DAL) and ORM-like library for working with SQL and NoSQL databases in Go.
A curated list of data engineering tools for software developers, not focused on AI coding tools.
A command-line tool to generate idiomatic Go code for SQL databases across multiple database engines.
A Python library for implementing the Louvain community detection algorithm on graphs.
A simple, fast, and embeddable key-value store written in Go that supports transactions and data structures.
Python wrapper for the TA-Lib technical analysis library, useful for financial pattern recognition.
A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.
networkx is a Python library for creating, manipulating, and studying the structure and dynamics of complex networks.
A comprehensive database of countries, states, and cities with data in multiple formats
Apache Cassandra is a distributed, wide-column store database system designed for high availability, scalability, and performance.
MongoDB-compatible database engine for cloud-native and open-source workloads with scalability and performance.
A curated list of awesome PostgreSQL software, libraries, tools and resources.
An open-source metadata platform for managing your data and AI stack across the enterprise.
A high-performance, embeddable key-value storage engine written in Rust for developers building data-intensive applications.
SciPy is a Python library for scientific and technical computing, providing a wide range of algorithms and tools.
Get weekly updates on trending AI coding tools and projects.