Trending Projects

Discover the fastest growing open source projects

Showing 551-600 of 897 trending projects

#551
JasonKessler/scattertext

A Python library for creating beautiful visualizations of language differences across document types.

0
0.0%
2.3K
total stars
#552
BlankerL/DXY-COVID-19-Data

A data warehouse for COVID-19 time series data, useful for data analysis and visualization.

0
0.0%
2.2K
total stars
#553
IndrajeetPatil/ggstatsplot

ggstatsplot is an R library that enhances ggplot2 visualizations with statistical analysis and hypothesis testing.

0
0.0%
2.2K
total stars
#554
tensorchord/pgvecto.rs

Scalable, low-latency vector search in Postgres, revolutionizing vector search and databases.

0
0.0%
2.2K
total stars
#555
ngaut/builddatabase

A distributed SQL database built from scratch, not focused on vibe coders or AI tools.

0
0.0%
2.1K
total stars
#556
fugue-project/fugue

A unified interface for distributed computing on Spark, Dask and Ray without any rewrites.

0
0.0%
2.1K
total stars
#557
chris1610/pbpython

A collection of Python code, notebooks, and examples for practical business data analysis and visualization.

0
0.0%
2.0K
total stars
#558
apache/bookkeeper

Apache BookKeeper is a scalable, fault tolerant and low latency storage service optimized for append-only workloads.

0
0.0%
2.0K
total stars
#559
apache/datafusion-ballista

Apache DataFusion Ballista is a distributed query engine for big data analysis, built with Rust and Arrow.

0
0.0%
2.0K
total stars
#560
mysql2sqlite/mysql2sqlite

Converts MySQL database dumps to SQLite3 compatible formats for easier migration and data portability.

0
0.0%
2.0K
total stars
#561
shancarter/mr-data-converter

A JavaScript library that converts CSV and tab-delimited data to web-friendly formats like JSON and XML.

0
0.0%
2.0K
total stars
#562
JuliaPlots/Plots.jl

Powerful plotting and data visualization library for the Julia programming language.

0
0.0%
1.9K
total stars
#563
openacid/slim

A space-efficient trie data structure in Go with fast lookup performance.

0
0.0%
1.9K
total stars
#564
eveningkid/denodb

A versatile ORM for multiple databases including MySQL, SQLite, MariaDB, PostgreSQL, and MongoDB in Deno.

0
0.0%
1.9K
total stars
#565
GiovineItalia/Gadfly.jl

Crafty statistical graphics library for the Julia programming language

0
0.0%
1.9K
total stars
#566
AileenNielsen/TimeSeriesAnalysisWithPython

A Jupyter Notebook repository focused on time series analysis using Python, likely not targeted at vibe coders.

0
0.0%
1.9K
total stars
#567
ContextLab/hypertools

A Python toolbox for gaining geometric insights into high-dimensional data, useful for vibe coders working with AI tools.

0
0.0%
1.9K
total stars
#568
h2oai/datatable

A high-performance, memory-efficient Python data analysis library for handling large datasets.

0
0.0%
1.9K
total stars
#569
begeekmyfriend/bplustree

A fast B+ tree indexing structure in C for efficient storage and retrieval of billions of key-value pairs.

0
0.0%
1.9K
total stars
#570
raphaelvallat/pingouin

A Python statistical package based on Pandas, providing various statistical methods and tests.

0
0.0%
1.9K
total stars
#571
yougov/mongo-connector

MongoDB data stream pipeline tools for managing real-time data synchronization and replication.

0
0.0%
1.9K
total stars
#572
neil3d/excel2json

A C# library that converts Excel spreadsheets to JSON objects and saves them to a text file.

0
0.0%
1.9K
total stars
#573
HuaRongSAO/talib-document

A Python library for technical analysis indicators, with Chinese translation and documentation.

0
0.0%
1.9K
total stars
#574
dask/dask-tutorial

An interactive tutorial for the Dask distributed computing library, focused on data analysis and manipulation.

0
0.0%
1.9K
total stars
#575
eigenteam/eigen-git-mirror

A high-performance C++ linear algebra library focused on solvers, sparse matrices, and numerical computing.

0
0.0%
1.8K
total stars
#576
edyoda/data-science-complete-tutorial

This repository provides comprehensive tutorials and resources for learning data science and machine learning using Python.

0
0.0%
1.8K
total stars
#577
alibaba/MongoShake

MongoShake is a universal data replication platform based on MongoDB's oplog, enabling redundant replication and active-active replication.

0
0.0%
1.8K
total stars
#578
npgsql/efcore.pg

Entity Framework Core provider for PostgreSQL, enabling .NET developers to easily interact with PostgreSQL databases.

0
0.0%
1.8K
total stars
#579
jstat/jstat

A JavaScript statistical library that provides a wide range of statistical functions for data analysis.

0
0.0%
1.8K
total stars
#580
Cysharp/MasterMemory

A C# in-memory document database with source generator-based embedded typed readonly data.

0
0.0%
1.8K
total stars
#581
apachecn/python_data_analysis_and_mining_action

This Python repository contains code examples and notes for data analysis and mining.

0
0.0%
1.8K
total stars
#582
gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

0
0.0%
1.8K
total stars
#583
citusdata/cstore_fdw

A columnar storage extension for Postgres built as a foreign data wrapper.

0
0.0%
1.8K
total stars
#584
schemacrawler/SchemaCrawler

SchemaCrawler is a free database schema discovery and comprehension tool that supports various database management systems.

0
0.0%
1.8K
total stars
#585
geekinglcq/CDCS

A collection of solutions to Chinese data competitions, primarily using Python.

0
0.0%
1.8K
total stars
#586
variety/variety

A MongoDB schema analysis tool that helps developers understand and optimize their NoSQL database.

0
0.0%
1.8K
total stars
#587
crossfilter/crossfilter

Fast n-dimensional filtering and grouping of records, a powerful data manipulation library for JavaScript.

0
0.0%
1.8K
total stars
#588
cmu-db/noisepage

Self-Driving Database Management System from Carnegie Mellon University

0
0.0%
1.8K
total stars
#589
torodb/stampede

A database solution that provides better analytics on top of MongoDB and makes it easier to migrate from MongoDB to SQL.

0
0.0%
1.8K
total stars
#590
zonination/investing

This R library provides historical investment returns analysis for the overall stock market.

0
0.0%
1.7K
total stars
#591
cnosdb/cnosdb

A high-performance, highly available, and distributed time series database written in Rust.

0
0.0%
1.7K
total stars
#592
Kyubyong/numpy_exercises

A repository of NumPy exercises for developers looking to improve their Python and data manipulation skills.

0
0.0%
1.7K
total stars
#593
rich-iannone/DiagrammeR

Graph and network visualization library for R developers working with tabular data

0
0.0%
1.7K
total stars
#594
collabH/bigdata-growth

A comprehensive repository covering big data knowledge, including data warehouse modeling, real-time computing, Hadoop, Spark, and more.

0
0.0%
1.7K
total stars
#595
dotnet/EntityFramework.Docs

Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.

0
0.0%
1.7K
total stars
#596
apache/auron

The Auron accelerator framework leverages vectorized execution to speed up distributed computing on big data platforms like Spark.

0
0.0%
1.7K
total stars
#597
fonnesbeck/statistical-analysis-python-tutorial

A tutorial for performing statistical data analysis using Python, covering topics like regression, hypothesis testing, and more.

0
0.0%
1.7K
total stars
#598
Tencent/paxosstore

PaxosStore is a high-performance, distributed database solution built for large-scale applications.

0
0.0%
1.7K
total stars
#599
JifuZhao/DS-Take-Home

A collection of data science take-home challenges and solutions implemented in Jupyter Notebooks.

0
0.0%
1.7K
total stars
#600
influxdata/influxdb-python

A Python client library for interacting with the InfluxDB time-series database.

0
0.0%
1.7K
total stars
1...1113...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.