Trending Projects

Discover the fastest growing open source projects

Showing 701-750 of 897 trending projects

#701
JuliaPlots/Plots.jl

Powerful plotting and data visualization library for the Julia programming language.

+23
+1.2%
1.9K
total stars
#702
brimdata/zui

Zui is a powerful desktop app for exploring and working with data, with support for CSV, JSON, and the Zed data format.

+23
+1.2%
1.9K
total stars
#703
mongodb/mongo-hadoop

A Java connector for integrating MongoDB with Hadoop ecosystems for big data processing.

+23
+1.4%
1.6K
total stars
#704
igraph/python-igraph

Python interface for the igraph library, a powerful tool for network analysis and visualization.

+23
+1.6%
1.4K
total stars
#705
kelvins/municipios-brasileiros

A Python library with data related to Brazilian municipalities, including IBGE codes, latitude, longitude, and more.

+23
+1.9%
1.2K
total stars
#706
OvertureMaps/data

Overture Maps Data is a Python library providing access to open-source geographic data.

+23
+2.1%
1.1K
total stars
#707
griddb/griddb

GridDB is a fast and scalable open-source database for time-series IoT and big data applications.

+22
+0.9%
2.5K
total stars
#708
oceanbase/seekdb

AI-native database unifying vector, text, and structured data for hybrid search and in-database AI workflows.

+22
+0.9%
2.4K
total stars
#709
quarylabs/quary

Open-source BI platform for engineers to explore and model large-scale data pipelines.

+22
+0.9%
2.4K
total stars
#710
xflr6/graphviz

Simple Python interface for Graphviz, a popular open-source data visualization tool.

+22
+1.2%
1.8K
total stars
#711
schemacrawler/SchemaCrawler

SchemaCrawler is a free database schema discovery and comprehension tool that supports various database management systems.

+22
+1.3%
1.8K
total stars
#712
spark-examples/pyspark-examples

A collection of PySpark examples covering RDD, DataFrame, and Dataset operations in Python.

+22
+1.7%
1.3K
total stars
#713
ddsjoberg/gtsummary

An R package that provides customizable and presentation-ready data summary and analytic result tables.

+22
+1.9%
1.2K
total stars
#714
pytroll/satpy

A Python package for processing earth-observing satellite data with support for common data formats and tools.

+22
+1.9%
1.2K
total stars
#715
dotnet/EntityFramework.Docs

Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.

+21
+1.2%
1.7K
total stars
#716
mono/taglib-sharp

A C# library for reading and writing metadata in media files, useful for audio and video processing applications.

+21
+1.5%
1.4K
total stars
#717
r-spatial/sf

An R package that provides support for simple features, a standardized way to encode spatial vector data.

+21
+1.5%
1.4K
total stars
#718
percona/percona-server

Percona Server is an enhanced, open-source version of the MySQL database management system.

+21
+1.7%
1.3K
total stars
#719
nfstream/nfstream

NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.

+21
+1.8%
1.2K
total stars
#720
neil3d/excel2json

A C# library that converts Excel spreadsheets to JSON objects and saves them to a text file.

+20
+1.1%
1.9K
total stars
#721
timescale/tsbs

A tool for comparing and evaluating databases for time series data.

+20
+1.4%
1.4K
total stars
#722
FirebirdSQL/firebird

Firebird is a relational database management system (RDBMS) suitable for a wide range of applications from desktop to client-server to large databases.

+20
+1.4%
1.4K
total stars
#723
obspy/obspy

A Python toolbox for seismology and seismological observatories, providing tools for data processing and analysis.

+20
+1.6%
1.3K
total stars
#724
pyexcel/pyexcel

A Python library for reading, manipulating, and writing data in various spreadsheet file formats.

+20
+1.6%
1.3K
total stars
#725
microsoft/Trill

Trill is a single-node query processor for temporal or streaming data.

+20
+1.6%
1.3K
total stars
#726
2ndQuadrant/pglogical

A high-performance logical replication extension for PostgreSQL that enables fast, cross-version database replication.

+20
+1.7%
1.2K
total stars
#727
pachyderm/pachyderm

Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.

+19
+0.3%
6.3K
total stars
#728
ankane/groupdate

A Ruby library that makes it easy to group temporal data, useful for developers working with time-series data.

+19
+0.5%
3.9K
total stars
#729
HuaRongSAO/talib-document

A Python library for technical analysis indicators, with Chinese translation and documentation.

+19
+1.0%
1.9K
total stars
#730
avinassh/py-caskdb

An educational project to build a disk-based key-value store in Python for learning purposes.

+19
+1.4%
1.4K
total stars
#731
apache/impala

Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.

+19
+1.5%
1.3K
total stars
#732
johannfaouzi/pyts

A Python package for time series classification, useful for developers working with time-series data.

+18
+1.0%
1.9K
total stars
#733
Data-Learn/data-engineering

A comprehensive resource for developers to learn and get started with data engineering using Python.

+18
+1.4%
1.3K
total stars
#734
SPLWare/esProc

esProc SPL is a JVM-based programming language for structured data computation, serving as both a data analysis tool and an embedded computing engine.

+17
+0.4%
4.7K
total stars
#735
Cyb3rWard0g/HELK

An open-source threat hunting platform built on the ELK stack for security researchers and analysts.

+17
+0.4%
3.9K
total stars
#736
hardikkamboj/An-Introduction-to-Statistical-Learning

This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.

+17
+0.7%
2.5K
total stars
#737
dingodb/dingo

A high-performance, MySQL-compatible vector database that supports structured and unstructured data for AI-driven applications.

+17
+1.0%
1.7K
total stars
#738
hadley/ggplot2-book

ggplot2 is a powerful data visualization library for R that provides elegant and flexible graphics.

+17
+1.0%
1.7K
total stars
#739
pyjanitor-devs/pyjanitor

A Python library for cleaning and transforming data, inspired by the R package Janitor.

+17
+1.2%
1.5K
total stars
#740
shuttle-hq/synth

Synth is a Rust library for generating realistic, randomized test data for applications and databases.

+17
+1.2%
1.5K
total stars
#741
Unidata/MetPy

MetPy is a Python library for reading, visualizing, and performing calculations with weather data.

+17
+1.2%
1.4K
total stars
#742
Toblerity/Fiona

Fiona is a Python library for reading and writing geographic data files, with support for CLI usage.

+17
+1.4%
1.2K
total stars
#743
syndtr/goleveldb

LevelDB key/value database in Go for building high-performance data-intensive applications.

+16
+0.3%
6.3K
total stars
#744
tidyverse/tidyverse

A collection of R packages for data science, including tools for data manipulation, visualization, and modeling.

+16
+0.9%
1.8K
total stars
#745
LuxCoreRender/LuxCore

LuxCore is a high-performance path-tracing render engine for realistic 3D graphics and visualization.

+16
+1.3%
1.3K
total stars
#746
eleanorlutz/asteroids_atlas_of_space

This is an astronomy visualization project that maps orbits of asteroids in the solar system.

+16
+1.3%
1.3K
total stars
#747
ploomber/ploomber

Ploomber is a fast and versatile tool for building and deploying data pipelines that can be used with a variety of AI and ML tools.

+15
+0.4%
3.6K
total stars
#748
aditya-grover/node2vec

This Scala library provides a high-performance implementation of the node2vec algorithm for embedding graphs.

+15
+0.6%
2.7K
total stars
#749
benedekrozemberczki/awesome-community-detection

A curated list of community detection research papers with implementations for data science and network analysis.

+15
+0.6%
2.4K
total stars
#750
PizzaDeDados/datascience-pizza

A repository for collecting study materials and resources related to data analysis and related fields.

+15
+0.6%
2.4K
total stars
1...1416...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.