Trending Projects

Discover the fastest growing open source projects

Showing 851-897 of 897 trending projects

#851
youngyangyang04/Skiplist-CPP

A lightweight key-value store built with C++ using a skiplist data structure.

+2
+0.1%
2.4K
total stars
#852
chris1610/pbpython

A collection of Python code, notebooks, and examples for practical business data analysis and visualization.

+2
+0.1%
2.0K
total stars
#853
crossfilter/crossfilter

Fast n-dimensional filtering and grouping of records, a powerful data manipulation library for JavaScript.

+2
+0.1%
1.8K
total stars
#854
cmu-db/noisepage

Self-Driving Database Management System from Carnegie Mellon University

+2
+0.1%
1.8K
total stars
#855
influxdata/influxdb-python

A Python client library for interacting with the InfluxDB time-series database.

+2
+0.1%
1.7K
total stars
#856
YelpArchive/dataset-examples

Sample datasets for users of the Yelp Academic Dataset, useful for data analysis and machine learning.

+2
+0.2%
1.3K
total stars
#857
attic-labs/noms

The versioned, forkable, syncable database for developers who need a scalable, distributed data solution.

+1
+0.0%
7.4K
total stars
#858
zemirco/json2csv

Convert JSON to CSV with column titles

+1
+0.0%
2.7K
total stars
#859
nicolaspanel/numjs

A JavaScript library that provides a NumPy-like interface for working with multi-dimensional arrays and matrices.

+1
+0.0%
2.5K
total stars
#860
GiovineItalia/Gadfly.jl

Crafty statistical graphics library for the Julia programming language

+1
+0.1%
1.9K
total stars
#861
apachecn/python_data_analysis_and_mining_action

This Python repository contains code examples and notes for data analysis and mining.

+1
+0.1%
1.8K
total stars
#862
zonination/investing

This R library provides historical investment returns analysis for the overall stock market.

+1
+0.1%
1.7K
total stars
#863
eBay/akutan

A distributed knowledge graph store built in Go for managing large-scale semantic data.

+1
+0.1%
1.7K
total stars
#864
re-data/re-data

A data quality and observability tool for monitoring and fixing data issues before they become problems.

+1
+0.1%
1.6K
total stars
#865
json4s/json4s

A popular Scala library for parsing and manipulating JSON data in Scala applications.

+1
+0.1%
1.5K
total stars
#866
PatMartin/Dex

Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.

+1
+0.1%
1.3K
total stars
#867
zhihu/kids

A C++ library for processing data streams, potentially useful for vibe coders working with AI-powered tools.

+1
+0.1%
1.2K
total stars
#868
li6185377/LKDBHelper-SQLite-ORM

An automatic database ORM library for Objective-C that provides thread-safe and deadlock-free database operations.

+1
+0.1%
1.2K
total stars
#869
machow/siuba

Python library for using dplyr-like syntax with pandas and SQL databases

+1
+0.1%
1.2K
total stars
#870
pentaho/mondrian

Mondrian is an OLAP server that enables real-time analysis of large data sets for business users.

+1
+0.1%
1.2K
total stars
#871
Teradata/kylo

Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.

+1
+0.1%
1.1K
total stars
#872
prisma/prisma1

Prisma1 is a database toolkit with an ORM, migrations, and admin UI for Postgres, MySQL, and MongoDB.

0
0.0%
16.4K
total stars
#873
spark-notebook/spark-notebook

An interactive and reactive data science platform powered by Scala and Apache Spark.

0
0.0%
3.2K
total stars
#874
begeekmyfriend/bplustree

A fast B+ tree indexing structure in C for efficient storage and retrieval of billions of key-value pairs.

0
0.0%
1.9K
total stars
#875
cgarciae/pypeln

Concurrent data pipelines in Python for building efficient and scalable data processing workflows.

0
0.0%
1.6K
total stars
#876
yhat/pandasql

pandasql is a Python library that allows developers to use SQL syntax to query Pandas DataFrames.

0
0.0%
1.3K
total stars
#877
influxdata/influxdb-java

Java client library for connecting to the InfluxDB time series database.

0
0.0%
1.2K
total stars
#878
mahmoudparsian/data-algorithms-book

This repository provides a comprehensive guide and implementations for data algorithms using MapReduce, Spark, Java, and Scala.

0
0.0%
1.1K
total stars
#879
yhat/rodeo

A data science IDE for Python, focused on providing a user-friendly environment for data analysis and visualization.

-1
-0.0%
3.9K
total stars
#880
plant99/felicette

A Python library for processing and visualizing satellite imagery data.

-1
-0.1%
1.8K
total stars
#881
gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

-1
-0.1%
1.8K
total stars
#882
thbar/kiba

A data processing and ETL (Extract, Transform, Load) framework for Ruby developers.

-1
-0.1%
1.8K
total stars
#883
mining/mining

A Python library for building business intelligence (BI) and OLAP solutions.

-1
-0.1%
1.3K
total stars
#884
iskandr/fancyimpute

A Python library providing multivariate imputation and matrix completion algorithms.

-1
-0.1%
1.3K
total stars
#885
ycjuan/kaggle-2014-criteo

This is a C++ repository for a Kaggle competition in 2014, not a developer discovery platform.

-1
-0.1%
1.3K
total stars
#886
datasets/covid-19

This GitHub repository provides time series data on COVID-19 cases, useful for data analysis and visualization.

-1
-0.1%
1.2K
total stars
#887
neumino/thinky

An ORM for RethinkDB that provides an elegant and intuitive API for interacting with the database.

-1
-0.1%
1.1K
total stars
#888
apachecn/pyda-2e-zh

A Chinese translation of the book 'Python for Data Analysis' 2nd Edition, covering NumPy, Pandas, and other data analysis tools.

-1
-0.1%
1.1K
total stars
#889
joaoh82/rust_sqlite

A simple embedded database library in Rust modeled after SQLite, useful for Rust projects.

-1
-0.1%
1.1K
total stars
#890
fortunejs/fortune

Non-native graph database abstraction layer for Node.js and web browsers.

-2
-0.1%
1.5K
total stars
#891
reiinakano/scikit-plot

An intuitive Python library that adds plotting functionality to scikit-learn machine learning models

-3
-0.1%
2.4K
total stars
#892
matrixorigin/matrixone

Cloud-native, MySQL-compatible, AI-ready database with Git for Data, vector search, and full-text search capabilities.

-5
-0.3%
1.9K
total stars
#893
ricklamers/gridstudio

Grid Studio is a web-based application for data science with full integration of open source data science frameworks and languages.

-6
-0.1%
8.9K
total stars
#894
jayinai/data-science-question-answer

A collection of data science related questions and answers for developers.

-6
-0.3%
2.4K
total stars
#895
owid/covid-19-data

COVID-19 data repository for developers, providing daily updated case, death, and testing information.

-7
-0.1%
5.7K
total stars
#896
shencangsheng/easydb_app

EasyDB is a lightweight desktop app that lets you query local CSV, Excel, and JSON files with SQL, without an external database.

-56
-5.3%
995
total stars
#897
shaiwz/data-platform-open

A no-code, visual data integration platform for building big data pipelines and workflows.

-72
-6.6%
1.0K
total stars
1...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.