Trending Projects

Discover the fastest growing open source projects

Showing 501-550 of 897 trending projects

#501
cmu-db/noisepage

Self-Driving Database Management System from Carnegie Mellon University

0
0.0%
1.8K
total stars
#502
torodb/stampede

A database solution that provides better analytics on top of MongoDB and makes it easier to migrate from MongoDB to SQL.

0
0.0%
1.8K
total stars
#503
galaxyproject/galaxy

An open-source, community-driven platform for data-intensive scientific analysis and visualization.

0
0.0%
1.7K
total stars
#504
zonination/investing

This R library provides historical investment returns analysis for the overall stock market.

0
0.0%
1.7K
total stars
#505
cnosdb/cnosdb

A high-performance, highly available, and distributed time series database written in Rust.

0
0.0%
1.7K
total stars
#506
Kyubyong/numpy_exercises

A repository of NumPy exercises for developers looking to improve their Python and data manipulation skills.

0
0.0%
1.7K
total stars
#507
rich-iannone/DiagrammeR

Graph and network visualization library for R developers working with tabular data

0
0.0%
1.7K
total stars
#508
collabH/bigdata-growth

A comprehensive repository covering big data knowledge, including data warehouse modeling, real-time computing, Hadoop, Spark, and more.

0
0.0%
1.7K
total stars
#509
dotnet/EntityFramework.Docs

Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.

0
0.0%
1.7K
total stars
#510
Werneror/Poetry

This repository provides a comprehensive dataset of over 850,000 Chinese poems from ancient to modern times, making it a valuable resource for developers working with Chinese poetry.

0
0.0%
1.7K
total stars
#511
apache/auron

The Auron accelerator framework leverages vectorized execution to speed up distributed computing on big data platforms like Spark.

0
0.0%
1.7K
total stars
#512
fonnesbeck/statistical-analysis-python-tutorial

A tutorial for performing statistical data analysis using Python, covering topics like regression, hypothesis testing, and more.

0
0.0%
1.7K
total stars
#513
Tencent/paxosstore

PaxosStore is a high-performance, distributed database solution built for large-scale applications.

0
0.0%
1.7K
total stars
#514
JifuZhao/DS-Take-Home

A collection of data science take-home challenges and solutions implemented in Jupyter Notebooks.

0
0.0%
1.7K
total stars
#515
lh3/bwa

A fast and accurate short-read sequence aligner written in C for genomics applications.

0
0.0%
1.7K
total stars
#516
dbt-labs/dbt-utils

Utility functions for dbt projects, a popular data transformation tool for data engineers.

0
0.0%
1.7K
total stars
#517
TuGraph-family/tugraph-db

TuGraph-DB is a high-performance graph database built for fast and efficient graph data processing.

0
0.0%
1.7K
total stars
#518
Giorgi/EntityFramework.Exceptions

A .NET Standard library that provides strongly typed exceptions for Entity Framework Core across multiple database providers.

0
0.0%
1.7K
total stars
#519
influxdata/influxdb-python

A Python client library for interacting with the InfluxDB time-series database.

0
0.0%
1.7K
total stars
#520
dingodb/dingo

A high-performance, MySQL-compatible vector database that supports structured and unstructured data for AI-driven applications.

0
0.0%
1.7K
total stars
#521
IRkernel/IRkernel

R kernel for the Jupyter notebook environment, enabling interactive R programming in Jupyter.

0
0.0%
1.7K
total stars
#522
vaastav/Fantasy-Premier-League

A Python script that generates a CSV file with data about players in the English Premier League Fantasy League.

0
0.0%
1.7K
total stars
#523
faroit/awesome-python-scientific-audio

Curated list of Python software and packages for scientific research in audio

0
0.0%
1.7K
total stars
#524
imageio/imageio

A Python library for reading and writing a wide range of image and video formats, including DICOM, animated GIFs, and webcam capture.

0
0.0%
1.7K
total stars
#525
chaisql/chai

A modern, embedded SQL database written in Go for embedded and mobile applications.

0
0.0%
1.7K
total stars
#526
orium/rpds

A Rust library that provides persistent data structures for efficient and immutable data management.

0
0.0%
1.7K
total stars
#527
huandu/go-sqlbuilder

A flexible and powerful SQL string builder library plus a zero-config ORM for Go developers.

0
0.0%
1.7K
total stars
#528
topepo/caret

An R package for training and plotting classification and regression models.

0
0.0%
1.7K
total stars
#529
ptyadana/SQL-Data-Analysis-and-Visualization-Projects

This GitHub repository contains SQL data analysis and visualization projects using various tools and databases.

0
0.0%
1.7K
total stars
#530
hadley/ggplot2-book

ggplot2 is a powerful data visualization library for R that provides elegant and flexible graphics.

0
0.0%
1.7K
total stars
#531
awslabs/open-data-registry

A registry of publicly available datasets hosted on AWS for data-driven developers.

0
0.0%
1.7K
total stars
#532
jadianes/spark-py-notebooks

Apache Spark and Python tutorials for big data analysis and machine learning as Jupyter notebooks.

0
0.0%
1.7K
total stars
#533
eBay/akutan

A distributed knowledge graph store built in Go for managing large-scale semantic data.

0
0.0%
1.7K
total stars
#534
uhub/awesome-matlab

A curated list of awesome MATLAB frameworks, libraries, and software for scientific computing and data analysis.

0
0.0%
1.7K
total stars
#535
avhz/RustQuant

A Rust library for quantitative finance, including tools for machine learning, option pricing, and trading.

0
0.0%
1.7K
total stars
#536
mozilla/mentat

A persistent, relational store inspired by Datomic and DataScript, written in Rust.

0
0.0%
1.7K
total stars
#537
Hiflylabs/awesome-dbt

A curated list of awesome resources for the data transformation tool dbt, focused on analytics engineering.

0
0.0%
1.6K
total stars
#538
tylertreat/BoomFilters

Performant probabilistic data structures for processing continuous, unbounded streams in Go.

0
0.0%
1.6K
total stars
#539
cswinter/LocustDB

A blazingly fast analytics database built with Rust, optimized for rapidly devouring large amounts of data.

0
0.0%
1.6K
total stars
#540
typelevel/skunk

A functional, type-safe, composable Scala data access library for Postgres databases.

0
0.0%
1.6K
total stars
#541
Yimeng-Zhang/feature-engineering-and-feature-selection

A comprehensive guide to feature engineering and feature selection techniques in Python, with examples.

0
0.0%
1.6K
total stars
#542
koaning/drawdata

A Python library that allows developers to easily draw datasets within their notebooks.

0
0.0%
1.6K
total stars
#543
aergoio/litetree

SQLite with Branches - a lightweight, embedded database with version control capabilities.

0
0.0%
1.6K
total stars
#544
pointfreeco/sqlite-data

A fast, lightweight SQLite-based persistence layer with CloudKit synchronization for Swift developers.

0
0.0%
1.6K
total stars
#545
osm2pgsql-dev/osm2pgsql

A C++ library for importing OpenStreetMap data into a PostgreSQL/PostGIS database.

0
0.0%
1.6K
total stars
#546
babyfish-ct/jimmer

An advanced ORM library for Java and Kotlin developers that provides powerful caching and data management features.

0
0.0%
1.6K
total stars
#547
github/covid19-dashboard

An open-source COVID-19 dashboard powered by the fastpages framework, featuring data visualizations.

0
0.0%
1.6K
total stars
#548
roboyoshi/datacurator-filetree

A standard filetree template for data curation and organization, useful for developers interested in data management.

0
0.0%
1.6K
total stars
#549
reata/sqllineage

SQL Lineage Analysis Tool that provides data discovery and governance insights through Python.

0
0.0%
1.6K
total stars
#550
mongodb/mongo-hadoop

A Java connector for integrating MongoDB with Hadoop ecosystems for big data processing.

0
0.0%
1.6K
total stars
1...1012...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.