Trending Projects

Discover the fastest growing open source projects

Showing 851-897 of 897 trending projects

#851

crossfilter/crossfilter

Fast n-dimensional filtering and grouping of records, a powerful data manipulation library for JavaScript.

+0.5%

1.8K

total stars

JavaScript

#852

aergoio/litetree

SQLite with Branches - a lightweight, embedded database with version control capabilities.

+0.6%

1.6K

total stars

#853

distributedio/titan

A distributed, Redis-compatible NoSQL database that provides high performance and scalability.

+0.6%

1.4K

total stars

#854

eBay/akutan

A distributed knowledge graph store built in Go for managing large-scale semantic data.

+0.5%

1.7K

total stars

#855

yhat/pandasql

pandasql is a Python library that allows developers to use SQL syntax to query Pandas DataFrames.

+0.6%

1.3K

total stars

Python

#856

uber-archive/AthenaX

A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.

+0.7%

1.2K

total stars

Java

#857

qri-io/qri

An open-source platform for building and sharing datasets, focused on trust, privacy, and decentralization.

+0.7%

1.1K

total stars

#858

reiinakano/scikit-plot

An intuitive Python library that adds plotting functionality to scikit-learn machine learning models

+0.3%

2.4K

total stars

Python

#859

jayinai/data-science-question-answer

A collection of data science related questions and answers for developers.

+0.3%

2.4K

total stars

Jupyter Notebook

#860

chris1610/pbpython

A collection of Python code, notebooks, and examples for practical business data analysis and visualization.

+0.3%

2.0K

total stars

Jupyter Notebook

#861

gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

+0.4%

1.8K

total stars

Java

#862

apache/carbondata

CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.

+0.5%

1.4K

total stars

Scala

#863

iskandr/fancyimpute

A Python library providing multivariate imputation and matrix completion algorithms.

+0.6%

1.3K

total stars

Python

#864

influxdata/influxdb-java

Java client library for connecting to the InfluxDB time series database.

+0.6%

1.2K

total stars

Java

#865

sryza/spark-timeseries

A library for time series analysis on Apache Spark, enabling efficient large-scale time series processing.

+0.6%

1.2K

total stars

Scala

#866

apachecn/pyda-2e-zh

A Chinese translation of the book 'Python for Data Analysis' 2nd Edition, covering NumPy, Pandas, and other data analysis tools.

+0.7%

1.1K

total stars

CSS

#867

zemirco/json2csv

Convert JSON to CSV with column titles

+0.2%

2.7K

total stars

JavaScript

#868

thbar/kiba

A data processing and ETL (Extract, Transform, Load) framework for Ruby developers.

+0.3%

1.8K

total stars

Ruby

#869

zonination/investing

This R library provides historical investment returns analysis for the overall stock market.

+0.3%

1.7K

total stars

#870

xiaoxu193/PyTeaser

A Python library that summarizes news articles by extracting the most important sentences.

+0.5%

1.2K

total stars

Python

#871

spark-notebook/spark-notebook

An interactive and reactive data science platform powered by Scala and Apache Spark.

+0.2%

3.2K

total stars

JavaScript

#872

orbitinghail/sqlsync

Collaborative offline-first SQLite wrapper for syncing app state across users & devices

+0.2%

2.9K

total stars

Rust

#873

json4s/json4s

A popular Scala library for parsing and manipulating JSON data in Scala applications.

+0.3%

1.5K

total stars

Scala

#874

mining/mining

A Python library for building business intelligence (BI) and OLAP solutions.

+0.4%

1.3K

total stars

Python

#875

joaoh82/rust_sqlite

A simple embedded database library in Rust modeled after SQLite, useful for Rust projects.

+0.5%

1.1K

total stars

Rust

#876

influxdata/influxdb-python

A Python client library for interacting with the InfluxDB time-series database.

+0.2%

1.7K

total stars

Python

#877

bububa/MongoHub-Mac

MongoHub is a native macOS MongoDB client that provides a GUI for managing and interacting with MongoDB databases.

+0.3%

1.2K

total stars

Objective-C

#878

eventql/eventql

Distributed, massively parallel SQL query engine for big data analytics and timeseries workloads.

+0.3%

1.2K

total stars

C++

#879

MarcosMeli/FileHelpers

A free and easy-to-use .NET library for reading and writing CSV and fixed-length data files.

+0.3%

1.2K

total stars

#880

Teradata/kylo

Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.

+0.4%

1.1K

total stars

Java

#881

PatMartin/Dex

Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.

+0.2%

1.3K

total stars

JavaScript

#882

ycjuan/kaggle-2014-criteo

This is a C++ repository for a Kaggle competition in 2014, not a developer discovery platform.

+0.2%

1.3K

total stars

C++

#883

traildb/traildb

TrailDB is an efficient database for storing and querying series of events.

+0.3%

1.1K

total stars

#884

youngyangyang04/Skiplist-CPP

A lightweight key-value store built with C++ using a skiplist data structure.

+0.1%

2.4K

total stars

C++

#885

ngaut/builddatabase

A distributed SQL database built from scratch, not focused on vibe coders or AI tools.

+0.1%

2.1K

total stars

#886

Factual/drake

A data workflow tool for data engineers and analysts, similar to 'Make for data'.

+0.1%

1.5K

total stars

Clojure

#887

QueryKit/QueryKit

QueryKit is a simple CoreData query language for Swift and Objective-C developers.

+0.1%

1.5K

total stars

Swift

#888

fortunejs/fortune

Non-native graph database abstraction layer for Node.js and web browsers.

+0.1%

1.5K

total stars

JavaScript

#889

datasets/covid-19

This GitHub repository provides time series data on COVID-19 cases, useful for data analysis and visualization.

+0.1%

1.2K

total stars

Python

#890

yhat/rodeo

A data science IDE for Python, focused on providing a user-friendly environment for data analysis and visualization.

0.0%

3.9K

total stars

JavaScript

#891

neumino/thinky

An ORM for RethinkDB that provides an elegant and intuitive API for interacting with the database.

0.0%

1.1K

total stars

JavaScript

#892

huangzhibiao/BGFMDB

A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.

-1

-0.1%

1.4K

total stars

Objective-C

#893

pomber/covid19

A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.

-1

-0.1%

1.2K

total stars

JavaScript

#894

apachecn/spark-doc-zh

This repository provides the official Apache Spark documentation in Chinese, a popular big data processing framework.

-1

-0.1%

1.2K

total stars

JavaScript

#895

geekinglcq/CDCS

A collection of solutions to Chinese data competitions, primarily using Python.

-2

-0.1%

1.8K

total stars

Python

#896

shaiwz/data-platform-open

A no-code, visual data integration platform for building big data pipelines and workflows.

-15

-1.4%

1.0K

total stars

Java

#897

shencangsheng/easydb_app

EasyDB is a lightweight desktop app that lets you query local CSV, Excel, and JSON files with SQL, without an external database.

-45

-4.3%

995

total stars

TypeScript

1...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.