Trending Projects

Discover the fastest growing open source projects

Showing 551-600 of 897 trending projects

#551
caserec/Datasets-for-Recommender-Systems

A high-quality dataset repository for building recommender systems, useful for vibe coders working on AI-powered applications.

+52
+5.0%
1.1K
total stars
#552
ravendb/ravendb

A highly scalable, distributed, document-oriented NoSQL database with full-text search, spatial, and time-series support.

+51
+1.3%
3.9K
total stars
#553
apache/avro

Apache Avro is a data serialization system for efficient storage and transmission of structured data.

+51
+1.6%
3.2K
total stars
#554
jldbc/pybaseball

A Python library for pulling current and historical baseball statistics, including Statcast, Baseball Reference, and FanGraphs data.

+51
+3.3%
1.6K
total stars
#555
duneanalytics/spellbook

A Python library providing SQL views for Dune Analytics, a popular blockchain data analysis platform.

+51
+3.6%
1.5K
total stars
#556
DrTimothyAldenDavis/SuiteSparse

A powerful suite of sparse matrix algorithms and libraries for scientific and numerical computing.

+51
+3.6%
1.5K
total stars
#557
AlexTheAnalyst/PortfolioProjects

This repository contains a collection of portfolio projects for a data analyst, not a developer discovery platform.

+51
+3.7%
1.4K
total stars
#558
tdpetrou/Learn-Pandas

This GitHub repository provides tutorials on effectively using the Pandas library for data analysis.

+51
+4.8%
1.1K
total stars
#559
DotNetNext/SqlSugar

A powerful, multi-database ORM for .NET that supports a wide range of SQL databases and provides a seamless data access layer.

+50
+0.9%
5.8K
total stars
#560
awslabs/deequ

Deequ is a Scala library for defining "unit tests for data" to measure data quality in large datasets.

+50
+1.4%
3.6K
total stars
#561
pydata/pandas-datareader

A Python library for extracting data from a wide range of internet sources into a pandas DataFrame.

+50
+1.6%
3.2K
total stars
#562
GreenmaskIO/greenmask

A Go-based tool for database anonymization and synthetic data generation to help with security, QA, and data masking.

+50
+3.2%
1.6K
total stars
#563
PumpkinDB/PumpkinDB

PumpkinDB is an immutable, ordered key-value database engine written in Rust.

+50
+3.7%
1.4K
total stars
#564
substrait-io/substrait

A cross-platform way to express data transformation, relational algebra, and standardized record expression and plans.

+49
+3.4%
1.5K
total stars
#565
nakabonne/tstorage

An embedded time-series database written in Go for storing and querying metrics data.

+49
+4.1%
1.2K
total stars
#566
rethinkdb/rethinkdb

Realtime NoSQL database for web apps

+48
+0.2%
27.0K
total stars
#567
ApsaraDB/PolarDB-for-PostgreSQL

A cloud-native PostgreSQL database developed by Alibaba Cloud for high-performance, scalable data storage and management.

+48
+1.6%
3.1K
total stars
#568
LastAncientOne/Stock_Analysis_For_Quant

A collection of stock analysis tools across various programming languages and platforms.

+48
+2.5%
2.0K
total stars
#569
npgsql/efcore.pg

Entity Framework Core provider for PostgreSQL, enabling .NET developers to easily interact with PostgreSQL databases.

+48
+2.7%
1.8K
total stars
#570
opengeos/Awesome-GEE

A curated list of Google Earth Engine resources for geospatial analysis and remote sensing applications.

+48
+4.3%
1.2K
total stars
#571
qri-io/qri

An open-source platform for building and sharing datasets, focused on trust, privacy, and decentralization.

+48
+4.5%
1.1K
total stars
#572
redis/RedisDesktopManager

Redis GUI client joining forces with Redis to enhance developer experience

+47
+0.2%
23.2K
total stars
#573
Alluxio/alluxio

Alluxio is an open-source data orchestration platform for analytics and machine learning workloads in the cloud.

+47
+0.7%
7.2K
total stars
#574
indradb/indradb

A Rust-based graph database for developers who need to store and query connected data.

+47
+2.0%
2.4K
total stars
#575
hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

+47
+3.1%
1.5K
total stars
#576
openbabel/openbabel

Open Babel is a chemical toolbox for working with chemical data and cheminformatics.

+47
+3.8%
1.3K
total stars
#577
inloop/sqlite-viewer

A simple SQLite file viewer that allows you to view and explore SQLite databases online.

+47
+4.8%
1.0K
total stars
#578
openaddresses/openaddresses

An open-source global repository of address, building, and parcel data for developers and geospatial applications.

+46
+1.5%
3.1K
total stars
#579
awslabs/open-data-registry

A registry of publicly available datasets hosted on AWS for data-driven developers.

+46
+2.9%
1.7K
total stars
#580
san089/goodreads_etl_pipeline

An end-to-end data pipeline for building a data lake, data warehouse, and analytics platform from GoodReads data.

+46
+3.2%
1.5K
total stars
#581
wireservice/agate

A Python data analysis library optimized for humans instead of machines.

+46
+4.0%
1.2K
total stars
#582
lijin-THU/notes-python

A comprehensive set of Python notes and resources for developers, covering a wide range of topics including data science, machine learning, and scientific computing.

+45
+0.6%
7.1K
total stars
#583
dbeaver/cloudbeaver

Cloud-based database manager UI for querying, managing, and visualizing databases across multiple platforms.

+45
+1.0%
4.7K
total stars
#584
rich-iannone/DiagrammeR

Graph and network visualization library for R developers working with tabular data

+45
+2.7%
1.7K
total stars
#585
mysql/mysql-server

Open-source relational database engine powering web apps, APIs, and data-driven backends worldwide.

+44
+0.4%
12.2K
total stars
#586
aarondl/sqlboiler

SQLBoiler is a Go ORM that generates code tailored to your database schema, making it easy to interact with databases.

+44
+0.6%
7.0K
total stars
#587
lux-org/lux

Automatically visualize your pandas dataframes with a single print command, enabling quick EDA.

+44
+0.8%
5.4K
total stars
#588
GoogleTrends/data

An open-source index of Google Trends data, useful for developers building data-driven applications.

+44
+0.9%
4.8K
total stars
#589
liam-hq/liam

Automatically generates beautiful and easy-to-read ER diagrams from your database.

+44
+0.9%
4.7K
total stars
#590
jdorfman/awesome-json-datasets

A curated list of awesome JSON datasets that don't require authentication.

+44
+1.3%
3.6K
total stars
#591
igraph/igraph

A powerful C library for analyzing complex networks and graph-based data structures.

+43
+2.3%
1.9K
total stars
#592
zalando/spilo

Highly available PostgreSQL cluster using Docker, focused on data infrastructure for developers.

+43
+2.5%
1.8K
total stars
#593
vaastav/Fantasy-Premier-League

A Python script that generates a CSV file with data about players in the English Premier League Fantasy League.

+43
+2.6%
1.7K
total stars
#594
orientechnologies/orientdb

OrientDB is a versatile, multi-model DBMS that supports Graph, Document, Reactive, Full-Text, and Geospatial models.

+42
+0.9%
4.9K
total stars
#595
fluid-cloudnative/fluid

Fluid is a distributed data abstraction and acceleration framework for Big Data and AI applications on the cloud.

+42
+2.3%
1.9K
total stars
#596
mkazhdan/PoissonRecon

Poisson Surface Reconstruction is a C++ library for reconstructing surfaces from point cloud data.

+42
+2.4%
1.8K
total stars
#597
percona/percona-toolkit

Percona Toolkit is a collection of advanced open source database tools for MySQL, MongoDB, and PostgreSQL.

+42
+3.0%
1.5K
total stars
#598
lmmentel/awesome-python-chemistry

A curated list of Python packages for chemistry, including computational chemistry, molecular dynamics, and quantum chemistry.

+42
+3.2%
1.4K
total stars
#599
mymarilyn/clickhouse-driver

A Python driver for the ClickHouse database with native interface support.

+42
+3.4%
1.3K
total stars
#600
rordenlab/dcm2niix

A DICOM to NIfTI converter for medical imaging research and neuroimaging applications.

+42
+3.9%
1.1K
total stars
1...1113...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.