Category
Showing 851-897 of 897 trending projects
A distributed SQL database built from scratch, not focused on vibe coders or AI tools.
Crafty statistical graphics library for the Julia programming language
This Python repository contains code examples and notes for data analysis and mining.
A data processing and ETL (Extract, Transform, Load) framework for Ruby developers.
A collection of solutions to Chinese data competitions, primarily using Python.
A functional, type-safe, composable Scala data access library for Postgres databases.
A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.
A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.
Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.
A Python library for building business intelligence (BI) and OLAP solutions.
A data science and machine learning library for Go, providing DataFrame functionality similar to Python's Pandas.
A Python library providing multivariate imputation and matrix completion algorithms.
A portfolio of data science projects covering machine learning, NLP, and more for personal and academic use.
This repository provides the official Apache Spark documentation in Chinese, a popular big data processing framework.
A curated list of resources for the Hadoop ecosystem, not a developer discovery platform focused on vibe coders.
SciRuby/daru is a Ruby library for data analysis and manipulation, useful for data scientists and developers working with data.
A collection of study notes, ebooks, and resources on big data, machine learning, Linux, and more for developers.
The versioned, forkable, syncable database for developers who need a scalable, distributed data solution.
COVID-19 data repository for developers, providing daily updated case, death, and testing information.
esProc SPL is a JVM-based programming language for structured data computation, serving as both a data analysis tool and an embedded computing engine.
An intuitive Python library that adds plotting functionality to scikit-learn machine learning models
A tutorial for performing statistical data analysis using Python, covering topics like regression, hypothesis testing, and more.
A Python library for retrieving administrative division codes for China's GB/T 2260 standard.
A concise guide to the MongoDB NoSQL database for developers.
A simple embedded database library in Rust modeled after SQLite, useful for Rust projects.
Druid is a high-performance database connection pool for Java applications, designed for monitoring and management.
Apache HBase is a distributed, scalable, fault-tolerant database for large datasets built on top of HDFS.
A Python library that generates fake data for custom test databases.
A large-scale entity and relation database supporting aggregation of properties for big data applications.
Prisma1 is a database toolkit with an ORM, migrations, and admin UI for Postgres, MySQL, and MongoDB.
Apache Spark and Python tutorials for big data analysis and machine learning as Jupyter notebooks.
A collection of Python code examples and tutorials for data science, machine learning, and web development.
Azure Data Studio is a data management and development tool with connectivity to popular cloud and on-premises databases.
A collection of data science related questions and answers for developers.
A fast B+ tree indexing structure in C for efficient storage and retrieval of billions of key-value pairs.
Redis GUI client joining forces with Redis to enhance developer experience
A C# library for reading and writing CSV files, with support for a wide range of CSV file formats.
A Java connector for integrating MongoDB with Hadoop ecosystems for big data processing.
Grid Studio is a web-based application for data science with full integration of open source data science frameworks and languages.
Cloud-native, MySQL-compatible, AI-ready database with Git for Data, vector search, and full-text search capabilities.
Real-time global and U.S. data tracking for developers and researchers.
Open source SQL query assistant service for databases and data warehouses
WCDB is a cross-platform database framework developed by WeChat for Android, iOS, Linux, macOS, and Windows.
SSDB is a fast NoSQL database, an alternative to Redis, with support for leveldb and rocksdb backends.
A Java ORM SQL query builder that supports popular databases like ClickHouse, Impala, MySQL, and Presto.
EasyDB is a lightweight desktop app that lets you query local CSV, Excel, and JSON files with SQL, without an external database.
A no-code, visual data integration platform for building big data pipelines and workflows.
Get weekly updates on trending AI coding tools and projects.