Trending Projects

Discover the fastest growing open source projects

Showing 851-897 of 897 trending projects

#851
crossfilter/crossfilter

Fast n-dimensional filtering and grouping of records, a powerful data manipulation library for JavaScript.

+9
+0.5%
1.8K
total stars
#852
aergoio/litetree

SQLite with Branches - a lightweight, embedded database with version control capabilities.

+9
+0.6%
1.6K
total stars
#853
distributedio/titan

A distributed, Redis-compatible NoSQL database that provides high performance and scalability.

+9
+0.6%
1.4K
total stars
#854
eBay/akutan

A distributed knowledge graph store built in Go for managing large-scale semantic data.

+8
+0.5%
1.7K
total stars
#855
yhat/pandasql

pandasql is a Python library that allows developers to use SQL syntax to query Pandas DataFrames.

+8
+0.6%
1.3K
total stars
#856
uber-archive/AthenaX

A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.

+8
+0.7%
1.2K
total stars
#857
qri-io/qri

An open-source platform for building and sharing datasets, focused on trust, privacy, and decentralization.

+8
+0.7%
1.1K
total stars
#858
reiinakano/scikit-plot

An intuitive Python library that adds plotting functionality to scikit-learn machine learning models

+7
+0.3%
2.4K
total stars
#859
jayinai/data-science-question-answer

A collection of data science related questions and answers for developers.

+7
+0.3%
2.4K
total stars
#860
chris1610/pbpython

A collection of Python code, notebooks, and examples for practical business data analysis and visualization.

+7
+0.3%
2.0K
total stars
#861
gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

+7
+0.4%
1.8K
total stars
#862
apache/carbondata

CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.

+7
+0.5%
1.4K
total stars
#863
iskandr/fancyimpute

A Python library providing multivariate imputation and matrix completion algorithms.

+7
+0.6%
1.3K
total stars
#864
influxdata/influxdb-java

Java client library for connecting to the InfluxDB time series database.

+7
+0.6%
1.2K
total stars
#865
sryza/spark-timeseries

A library for time series analysis on Apache Spark, enabling efficient large-scale time series processing.

+7
+0.6%
1.2K
total stars
#866
apachecn/pyda-2e-zh

A Chinese translation of the book 'Python for Data Analysis' 2nd Edition, covering NumPy, Pandas, and other data analysis tools.

+7
+0.7%
1.1K
total stars
#867
zemirco/json2csv

Convert JSON to CSV with column titles

+6
+0.2%
2.7K
total stars
#868
thbar/kiba

A data processing and ETL (Extract, Transform, Load) framework for Ruby developers.

+6
+0.3%
1.8K
total stars
#869
zonination/investing

This R library provides historical investment returns analysis for the overall stock market.

+6
+0.3%
1.7K
total stars
#870
xiaoxu193/PyTeaser

A Python library that summarizes news articles by extracting the most important sentences.

+6
+0.5%
1.2K
total stars
#871
spark-notebook/spark-notebook

An interactive and reactive data science platform powered by Scala and Apache Spark.

+5
+0.2%
3.2K
total stars
#872
orbitinghail/sqlsync

Collaborative offline-first SQLite wrapper for syncing app state across users & devices

+5
+0.2%
2.9K
total stars
#873
json4s/json4s

A popular Scala library for parsing and manipulating JSON data in Scala applications.

+5
+0.3%
1.5K
total stars
#874
mining/mining

A Python library for building business intelligence (BI) and OLAP solutions.

+5
+0.4%
1.3K
total stars
#875
joaoh82/rust_sqlite

A simple embedded database library in Rust modeled after SQLite, useful for Rust projects.

+5
+0.5%
1.1K
total stars
#876
influxdata/influxdb-python

A Python client library for interacting with the InfluxDB time-series database.

+4
+0.2%
1.7K
total stars
#877
bububa/MongoHub-Mac

MongoHub is a native macOS MongoDB client that provides a GUI for managing and interacting with MongoDB databases.

+4
+0.3%
1.2K
total stars
#878
eventql/eventql

Distributed, massively parallel SQL query engine for big data analytics and timeseries workloads.

+4
+0.3%
1.2K
total stars
#879
MarcosMeli/FileHelpers

A free and easy-to-use .NET library for reading and writing CSV and fixed-length data files.

+4
+0.3%
1.2K
total stars
#880
Teradata/kylo

Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.

+4
+0.4%
1.1K
total stars
#881
PatMartin/Dex

Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.

+3
+0.2%
1.3K
total stars
#882
ycjuan/kaggle-2014-criteo

This is a C++ repository for a Kaggle competition in 2014, not a developer discovery platform.

+3
+0.2%
1.3K
total stars
#883
traildb/traildb

TrailDB is an efficient database for storing and querying series of events.

+3
+0.3%
1.1K
total stars
#884
youngyangyang04/Skiplist-CPP

A lightweight key-value store built with C++ using a skiplist data structure.

+2
+0.1%
2.4K
total stars
#885
ngaut/builddatabase

A distributed SQL database built from scratch, not focused on vibe coders or AI tools.

+1
+0.1%
2.1K
total stars
#886
Factual/drake

A data workflow tool for data engineers and analysts, similar to 'Make for data'.

+1
+0.1%
1.5K
total stars
#887
QueryKit/QueryKit

QueryKit is a simple CoreData query language for Swift and Objective-C developers.

+1
+0.1%
1.5K
total stars
#888
fortunejs/fortune

Non-native graph database abstraction layer for Node.js and web browsers.

+1
+0.1%
1.5K
total stars
#889
datasets/covid-19

This GitHub repository provides time series data on COVID-19 cases, useful for data analysis and visualization.

+1
+0.1%
1.2K
total stars
#890
yhat/rodeo

A data science IDE for Python, focused on providing a user-friendly environment for data analysis and visualization.

0
0.0%
3.9K
total stars
#891
neumino/thinky

An ORM for RethinkDB that provides an elegant and intuitive API for interacting with the database.

0
0.0%
1.1K
total stars
#892
huangzhibiao/BGFMDB

A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.

-1
-0.1%
1.4K
total stars
#893
pomber/covid19

A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.

-1
-0.1%
1.2K
total stars
#894
apachecn/spark-doc-zh

This repository provides the official Apache Spark documentation in Chinese, a popular big data processing framework.

-1
-0.1%
1.2K
total stars
#895
geekinglcq/CDCS

A collection of solutions to Chinese data competitions, primarily using Python.

-2
-0.1%
1.8K
total stars
#896
shaiwz/data-platform-open

A no-code, visual data integration platform for building big data pipelines and workflows.

-15
-1.4%
1.0K
total stars
#897
shencangsheng/easydb_app

EasyDB is a lightweight desktop app that lets you query local CSV, Excel, and JSON files with SQL, without an external database.

-45
-4.3%
995
total stars
1...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.