Category
Showing 851-897 of 897 trending projects
Fast n-dimensional filtering and grouping of records, a powerful data manipulation library for JavaScript.
SQLite with Branches - a lightweight, embedded database with version control capabilities.
A distributed, Redis-compatible NoSQL database that provides high performance and scalability.
A distributed knowledge graph store built in Go for managing large-scale semantic data.
pandasql is a Python library that allows developers to use SQL syntax to query Pandas DataFrames.
A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.
An open-source platform for building and sharing datasets, focused on trust, privacy, and decentralization.
An intuitive Python library that adds plotting functionality to scikit-learn machine learning models
A collection of data science related questions and answers for developers.
A collection of Python code, notebooks, and examples for practical business data analysis and visualization.
A large-scale entity and relation database supporting aggregation of properties for big data applications.
CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.
A Python library providing multivariate imputation and matrix completion algorithms.
Java client library for connecting to the InfluxDB time series database.
A library for time series analysis on Apache Spark, enabling efficient large-scale time series processing.
A Chinese translation of the book 'Python for Data Analysis' 2nd Edition, covering NumPy, Pandas, and other data analysis tools.
A data processing and ETL (Extract, Transform, Load) framework for Ruby developers.
This R library provides historical investment returns analysis for the overall stock market.
A Python library that summarizes news articles by extracting the most important sentences.
An interactive and reactive data science platform powered by Scala and Apache Spark.
Collaborative offline-first SQLite wrapper for syncing app state across users & devices
A popular Scala library for parsing and manipulating JSON data in Scala applications.
A Python library for building business intelligence (BI) and OLAP solutions.
A simple embedded database library in Rust modeled after SQLite, useful for Rust projects.
A Python client library for interacting with the InfluxDB time-series database.
MongoHub is a native macOS MongoDB client that provides a GUI for managing and interacting with MongoDB databases.
Distributed, massively parallel SQL query engine for big data analytics and timeseries workloads.
A free and easy-to-use .NET library for reading and writing CSV and fixed-length data files.
Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.
Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.
This is a C++ repository for a Kaggle competition in 2014, not a developer discovery platform.
TrailDB is an efficient database for storing and querying series of events.
A lightweight key-value store built with C++ using a skiplist data structure.
A distributed SQL database built from scratch, not focused on vibe coders or AI tools.
A data workflow tool for data engineers and analysts, similar to 'Make for data'.
QueryKit is a simple CoreData query language for Swift and Objective-C developers.
Non-native graph database abstraction layer for Node.js and web browsers.
This GitHub repository provides time series data on COVID-19 cases, useful for data analysis and visualization.
A data science IDE for Python, focused on providing a user-friendly environment for data analysis and visualization.
An ORM for RethinkDB that provides an elegant and intuitive API for interacting with the database.
A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.
A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.
This repository provides the official Apache Spark documentation in Chinese, a popular big data processing framework.
A collection of solutions to Chinese data competitions, primarily using Python.
A no-code, visual data integration platform for building big data pipelines and workflows.
EasyDB is a lightweight desktop app that lets you query local CSV, Excel, and JSON files with SQL, without an external database.
Get weekly updates on trending AI coding tools and projects.