Trending Projects

Discover the fastest growing open source projects

Showing 451-500 of 897 trending projects

#451
egbertbouman/youtube-comment-downloader

Simple script for downloading YouTube comments without using the YouTube API.

+157
+15.1%
1.2K
total stars
#452
sentinelsat/sentinelsat

A Python library for searching and downloading Copernicus Sentinel satellite images for geographic data analysis.

+156
+18.2%
1.0K
total stars
#453
electricitymaps/electricitymaps-contrib

An open-source repository for parsing electricity data and powering a comprehensive electricity data platform.

+155
+4.1%
4.0K
total stars
#454
LastAncientOne/Stock_Analysis_For_Quant

A collection of stock analysis tools across various programming languages and platforms.

+154
+8.5%
2.0K
total stars
#455
delight-im/FreeGeoDB

A free database of geographic place names and corresponding geospatial data for developers to use.

+153
+10.7%
1.6K
total stars
#456
avehtari/BDA_py_demos

Provides Bayesian data analysis demos in Python for developers interested in probabilistic modeling.

+153
+17.3%
1.0K
total stars
#457
paradigmxyz/cryo

cryo is a Rust library for extracting blockchain data to parquet, CSV, JSON, or Python dataframes.

+152
+11.0%
1.5K
total stars
#458
jtv/libpqxx

The official C++ client API for PostgreSQL, providing a high-level interface for interacting with PostgreSQL databases.

+152
+13.4%
1.3K
total stars
#459
tidyverse/dplyr

dplyr is a powerful R library for data manipulation, providing a grammar of data manipulation.

+151
+3.1%
5.0K
total stars
#460
amundsen-io/amundsen

Amundsen is an open-source data discovery platform for improving productivity of data analysts and engineers.

+151
+3.3%
4.7K
total stars
#461
LibRaw/LibRaw

LibRaw is a C++ library for reading RAW image files from digital cameras.

+151
+11.8%
1.4K
total stars
#462
owid/covid-19-data

COVID-19 data repository for developers, providing daily updated case, death, and testing information.

+150
+2.7%
5.7K
total stars
#463
awslabs/deequ

Deequ is a Scala library for defining "unit tests for data" to measure data quality in large datasets.

+150
+4.4%
3.6K
total stars
#464
RUCAIBox/RecSysDatasets

A repository of public data sources for building and testing recommender systems.

+149
+14.6%
1.2K
total stars
#465
ekzhu/datasketch

A Python library for data sketching techniques like MinHash, LSH, HyperLogLog, and HNSW for approximate similarity search.

+148
+5.4%
2.9K
total stars
#466
neilotoole/sq

sq is a Go-based data wrangling tool that supports a variety of data formats and databases.

+147
+6.4%
2.5K
total stars
#467
rosedblabs/rosedb

Lightweight, fast, and reliable key-value database engine in Go for high-throughput applications.

+146
+3.1%
4.9K
total stars
#468
CLUEbenchmark/CLUEDatasetSearch

A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.

+146
+3.4%
4.4K
total stars
#469
antontarasenko/smq

A collection of SQL queries to analyze social media datasets.

+146
+10.4%
1.5K
total stars
#470
tidwall/buntdb

BuntDB is an embeddable, in-memory key/value database for Go with custom indexing and geospatial support.

+145
+3.1%
4.8K
total stars
#471
apache/avro

Apache Avro is a data serialization system for efficient storage and transmission of structured data.

+145
+4.7%
3.2K
total stars
#472
rich-iannone/DiagrammeR

Graph and network visualization library for R developers working with tabular data

+145
+9.1%
1.7K
total stars
#473
igrigorik/gharchive.org

An open-source project that captures the public GitHub timeline and makes it accessible for analysis.

+143
+5.0%
3.0K
total stars
#474
PyWavelets/pywt

PyWavelets is a Python library for wavelet transform algorithms and techniques, useful for image and signal processing.

+143
+6.5%
2.3K
total stars
#475
AlexTheAnalyst/PortfolioProjects

This repository contains a collection of portfolio projects for a data analyst, not a developer discovery platform.

+143
+11.2%
1.4K
total stars
#476
objectbox/objectbox-go

Embedded Go Database, a fast open-source NoSQL database solution for Go projects.

+143
+12.8%
1.3K
total stars
#477
linq2db/linq2db

Linq to database provider for .NET, supporting various database engines.

+142
+4.6%
3.2K
total stars
#478
dolthub/go-mysql-server

A MySQL-compatible relational database with a storage agnostic query engine, implemented in Go.

+142
+5.7%
2.6K
total stars
#479
TuGraph-family/tugraph-db

TuGraph-DB is a high-performance graph database built for fast and efficient graph data processing.

+142
+9.1%
1.7K
total stars
#480
spandanb/learndb-py

A Python library that implements database internals from scratch, useful for learning database concepts.

+142
+11.9%
1.3K
total stars
#481
RoaringBitmap/RoaringBitmap

A high-performance compressed bitset library for Java used in Apache Spark, Netflix Atlas, and others.

+141
+3.8%
3.8K
total stars
#482
mymarilyn/clickhouse-driver

A Python driver for the ClickHouse database with native interface support.

+141
+12.2%
1.3K
total stars
#483
SciRuby/sciruby

SciRuby provides a collection of tools for scientific computation in Ruby, catering to developers working with data and scientific applications.

+141
+16.4%
1.0K
total stars
#484
cube2222/octosql

OctoSQL is a powerful SQL query tool that allows you to join, analyze, and transform data from multiple databases and file formats.

+140
+2.7%
5.2K
total stars
#485
dpilger26/NumCpp

A C++ implementation of the Python NumPy library for scientific computing and numerical analysis.

+139
+3.7%
3.9K
total stars
#486
axiomhq/hyperloglog

HyperLogLog data structure library with space-efficient sparse and LogLog-Beta implementations.

+139
+15.6%
1.0K
total stars
#487
psycopg/psycopg2

A Python database adapter for PostgreSQL, allowing developers to interact with their databases.

+138
+4.0%
3.6K
total stars
#488
kayak/pypika

PyPika is a Python SQL query builder that provides a readable, Pythonic syntax for constructing complex SQL queries.

+137
+5.0%
2.9K
total stars
#489
mono/taglib-sharp

A C# library for reading and writing metadata in media files, useful for audio and video processing applications.

+137
+10.6%
1.4K
total stars
#490
erthink/libmdbx

High-performance, transactional key-value database engine for embedded systems and cryptocurrencies.

+137
+11.2%
1.4K
total stars
#491
ApsaraDB/PolarDB-for-PostgreSQL

A cloud-native PostgreSQL database developed by Alibaba Cloud for high-performance, scalable data storage and management.

+136
+4.5%
3.1K
total stars
#492
babyfish-ct/jimmer

An advanced ORM library for Java and Kotlin developers that provides powerful caching and data management features.

+136
+9.1%
1.6K
total stars
#493
raphaelvallat/pingouin

A Python statistical package based on Pandas, providing various statistical methods and tests.

+135
+7.8%
1.9K
total stars
#494
dedupeio/dedupe

A Python library for accurate and scalable fuzzy matching, record deduplication, and entity resolution.

+134
+3.1%
4.4K
total stars
#495
jdorfman/awesome-json-datasets

A curated list of awesome JSON datasets that don't require authentication.

+134
+3.9%
3.6K
total stars
#496
substrait-io/substrait

A cross-platform way to express data transformation, relational algebra, and standardized record expression and plans.

+134
+10.0%
1.5K
total stars
#497
dotnetcore/FreeSql

An ORM (Object-Relational Mapping) library for .NET that supports a wide range of database providers, including SQL Server, MySQL, PostgreSQL, and more.

+133
+3.1%
4.4K
total stars
#498
wainshine/Chinese-Names-Corpus

A Chinese name corpus and generator for natural language processing and entity recognition.

+133
+3.2%
4.3K
total stars
#499
uhub/awesome-matlab

A curated list of awesome MATLAB frameworks, libraries, and software for scientific computing and data analysis.

+133
+8.7%
1.7K
total stars
#500
AlaSQL/alasql

AlaSQL is a JavaScript SQL database for browser and Node.js that handles both relational tables and nested JSON data.

+132
+1.9%
7.3K
total stars
1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.