Trending Projects

Discover the fastest growing open source projects

Showing 651-700 of 897 trending projects

#651
neo4j-contrib/neo4j-apoc-procedures

A collection of procedures for the Neo4j graph database, providing advanced graph algorithms and utilities.

+67
+3.8%
1.9K
total stars
#652
Cyan4973/FiniteStateEntropy

A high-performance compression library written in C for developers working with large data sets.

+67
+4.8%
1.5K
total stars
#653
movingpandas/movingpandas

A Python library for analyzing movement trajectory data using GeoPandas.

+67
+5.1%
1.4K
total stars
#654
BrambleXu/pydata-notebook

A collection of Jupyter Notebook files for data analysis using Python, including a Chinese translation of the popular 'Python for Data Analysis' book.

+66
+1.4%
4.7K
total stars
#655
mourner/flatbush

A fast spatial index library for 2D points and rectangles in JavaScript, useful for geospatial applications.

+66
+4.4%
1.6K
total stars
#656
elixir-explorer/explorer

A fast and elegant data exploration library for Elixir, providing series and dataframes for data science workflows.

+66
+5.5%
1.3K
total stars
#657
osm2pgsql-dev/osm2pgsql

A C++ library for importing OpenStreetMap data into a PostgreSQL/PostGIS database.

+65
+4.2%
1.6K
total stars
#658
opengeos/Awesome-GEE

A curated list of Google Earth Engine resources for geospatial analysis and remote sensing applications.

+65
+5.9%
1.2K
total stars
#659
OvertureMaps/data

Overture Maps Data is a Python library providing access to open-source geographic data.

+65
+6.3%
1.1K
total stars
#660
ploomber/ploomber

Ploomber is a fast and versatile tool for building and deploying data pipelines that can be used with a variety of AI and ML tools.

+64
+1.8%
3.6K
total stars
#661
EntilZha/PyFunctional

A Python library for creating data processing pipelines using functional programming principles.

+64
+2.6%
2.5K
total stars
#662
FirebirdSQL/firebird

Firebird is a relational database management system (RDBMS) suitable for a wide range of applications from desktop to client-server to large databases.

+64
+4.8%
1.4K
total stars
#663
apache/impala

Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.

+64
+5.3%
1.3K
total stars
#664
s3ql/s3ql

A full-featured file system for online data storage, built with Python.

+64
+5.5%
1.2K
total stars
#665
nfstream/nfstream

NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.

+64
+5.7%
1.2K
total stars
#666
dotnet/EntityFramework.Docs

Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.

+63
+3.8%
1.7K
total stars
#667
gopherdata/gophernotes

The Go kernel for Jupyter notebooks and nteract, enabling data science and numerical computing in Go.

+62
+1.6%
4.0K
total stars
#668
tidyverse/tidyverse

A collection of R packages for data science, including tools for data manipulation, visualization, and modeling.

+62
+3.6%
1.8K
total stars
#669
Unidata/MetPy

MetPy is a Python library for reading, visualizing, and performing calculations with weather data.

+62
+4.7%
1.4K
total stars
#670
kelvins/municipios-brasileiros

A Python library with data related to Brazilian municipalities, including IBGE codes, latitude, longitude, and more.

+62
+5.4%
1.2K
total stars
#671
ddsjoberg/gtsummary

An R package that provides customizable and presentation-ready data summary and analytic result tables.

+62
+5.6%
1.2K
total stars
#672
tdpetrou/Learn-Pandas

This GitHub repository provides tutorials on effectively using the Pandas library for data analysis.

+62
+5.9%
1.1K
total stars
#673
pachyderm/pachyderm

Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.

+61
+1.0%
6.3K
total stars
#674
mysql2sqlite/mysql2sqlite

Converts MySQL database dumps to SQLite3 compatible formats for easier migration and data portability.

+61
+3.2%
2.0K
total stars
#675
JuliaPlots/Plots.jl

Powerful plotting and data visualization library for the Julia programming language.

+61
+3.3%
1.9K
total stars
#676
neil3d/excel2json

A C# library that converts Excel spreadsheets to JSON objects and saves them to a text file.

+61
+3.4%
1.9K
total stars
#677
percona/percona-server

Percona Server is an enhanced, open-source version of the MySQL database management system.

+61
+5.1%
1.3K
total stars
#678
2ndQuadrant/pglogical

A high-performance logical replication extension for PostgreSQL that enables fast, cross-version database replication.

+61
+5.3%
1.2K
total stars
#679
johannfaouzi/pyts

A Python package for time series classification, useful for developers working with time-series data.

+60
+3.3%
1.9K
total stars
#680
pytroll/satpy

A Python package for processing earth-observing satellite data with support for common data formats and tools.

+60
+5.4%
1.2K
total stars
#681
shaypal5/awesome-twitter-data

A curated list of Twitter datasets and resources for data scientists and social network analysts.

+60
+5.8%
1.1K
total stars
#682
yhilpisch/py4fi

This is a Python library for financial applications, not a tool for AI-powered vibe coders.

+59
+3.2%
1.9K
total stars
#683
igraph/python-igraph

Python interface for the igraph library, a powerful tool for network analysis and visualization.

+59
+4.3%
1.4K
total stars
#684
realm/realm-core

Core database component for the Realm Mobile Database SDKs, a popular NoSQL database for mobile apps.

+59
+6.0%
1.0K
total stars
#685
sripathikrishnan/redis-rdb-tools

A Python tool to parse Redis dump.rdb files, analyze memory usage, and export data to JSON.

+58
+1.1%
5.2K
total stars
#686
brimdata/zui

Zui is a powerful desktop app for exploring and working with data, with support for CSV, JSON, and the Zed data format.

+58
+3.1%
1.9K
total stars
#687
twosigma/flint

A time series library for Apache Spark that provides a high-level API for working with time series data.

+58
+6.0%
1.0K
total stars
#688
apache/bookkeeper

Apache BookKeeper is a scalable, fault tolerant and low latency storage service optimized for append-only workloads.

+57
+3.0%
2.0K
total stars
#689
kevwan/go-stash

A high-performance, open-source data processing pipeline for ingesting Kafka data and sending it to Elasticsearch.

+57
+4.9%
1.2K
total stars
#690
fraunhoferportugal/tsfel

An intuitive library to extract features from time series data for data science and machine learning.

+57
+5.5%
1.1K
total stars
#691
hardikkamboj/An-Introduction-to-Statistical-Learning

This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.

+56
+2.3%
2.5K
total stars
#692
xflr6/graphviz

Simple Python interface for Graphviz, a popular open-source data visualization tool.

+56
+3.2%
1.8K
total stars
#693
LuxCoreRender/LuxCore

LuxCore is a high-performance path-tracing render engine for realistic 3D graphics and visualization.

+56
+4.5%
1.3K
total stars
#694
BlakeRMills/MetBrewer

A color palette package in R inspired by works at the Metropolitan Museum of Art in New York.

+56
+4.8%
1.2K
total stars
#695
jeremyevans/sequel

Sequel is a Ruby library that provides a powerful and flexible object-relational mapping (ORM) for databases.

+55
+1.1%
5.1K
total stars
#696
emirozer/fake2db

A Python library that generates fake data for custom test databases.

+55
+2.4%
2.4K
total stars
#697
IRkernel/IRkernel

R kernel for the Jupyter notebook environment, enabling interactive R programming in Jupyter.

+55
+3.4%
1.7K
total stars
#698
itbdw/ip-database

An offline IP database for developers to look up IP address geolocation information.

+54
+3.8%
1.5K
total stars
#699
avinassh/py-caskdb

An educational project to build a disk-based key-value store in Python for learning purposes.

+54
+4.2%
1.4K
total stars
#700
wannesm/dtaidistance

A fast C-based implementation of Dynamic Time Warping, a popular algorithm for comparing time series data.

+54
+4.6%
1.2K
total stars
1...1315...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.