Trending Projects

Discover the fastest growing open source projects

Showing 651-700 of 897 trending projects

#651
PoloDB/PoloDB

PoloDB is an embedded document database written in Rust for building cross-platform, local-first applications.

+33
+2.9%
1.2K
total stars
#652
shaypal5/awesome-twitter-data

A curated list of Twitter datasets and resources for data scientists and social network analysts.

+33
+3.1%
1.1K
total stars
#653
lacuna/bifurcan

A library of functional, durable data structures written in Java for developers building robust applications.

+33
+3.4%
1.0K
total stars
#654
vaexio/vaex

A high-performance Python library for working with large tabular datasets, offering efficient data manipulation and visualization.

+32
+0.4%
8.5K
total stars
#655
CLUEbenchmark/CLUEDatasetSearch

A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.

+32
+0.7%
4.4K
total stars
#656
huandu/go-sqlbuilder

A flexible and powerful SQL string builder library plus a zero-config ORM for Go developers.

+32
+1.9%
1.7K
total stars
#657
polarsignals/frostdb

A fast, embeddable column database written in Go, optimized for AI/ML workloads.

+32
+2.2%
1.5K
total stars
#658
crazyhottommy/getting-started-with-genomics-tools-and-resources

A collection of Unix, R, and Python tools for bioinformatics and data science projects.

+32
+2.4%
1.4K
total stars
#659
jrfiedler/causal_inference_python_code

Python code for causal inference, a book by Miguel Hernán and James Robins.

+32
+2.5%
1.3K
total stars
#660
apache/incubator-xtable

Apache XTable is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

+32
+2.8%
1.2K
total stars
#661
gonum/plot

A Go library for creating high-quality plots and visualizations of data

+31
+1.1%
2.9K
total stars
#662
lh3/bwa

A fast and accurate short-read sequence aligner written in C for genomics applications.

+31
+1.9%
1.7K
total stars
#663
quantopian/empyrical

A Python library that provides common financial risk and performance metrics used in financial analysis.

+31
+2.2%
1.5K
total stars
#664
jackzhenguo/python-small-examples

A collection of Python code examples and tutorials for data science, machine learning, and web development.

+30
+0.4%
8.1K
total stars
#665
jtablesaw/tablesaw

A high-performance Java library for data analysis, visualization, and machine learning.

+30
+0.8%
3.7K
total stars
#666
IndrajeetPatil/ggstatsplot

ggstatsplot is an R library that enhances ggplot2 visualizations with statistical analysis and hypothesis testing.

+30
+1.4%
2.2K
total stars
#667
hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

+30
+2.0%
1.5K
total stars
#668
tidwall/btree

A high-performance B-tree implementation for Go, useful for building database-like applications.

+30
+2.6%
1.2K
total stars
#669
rob-med/awesome-TS-anomaly-detection

A curated list of tools and datasets for anomaly detection on time-series data.

+29
+0.9%
3.2K
total stars
#670
uiwjs/province-city-china

Comprehensive dataset of China's administrative divisions (province, city, county, town) in JSON, CSV, and SQL formats.

+29
+1.0%
3.0K
total stars
#671
paul-buerkner/brms

R package for Bayesian generalized multivariate non-linear multilevel models using Stan

+29
+2.1%
1.4K
total stars
#672
graphframes/graphframes

GraphFrames provides DataFrame-based Graphs for Apache Spark, enabling scalable graph analysis and algorithms.

+29
+2.6%
1.1K
total stars
#673
docker-library/mongo

Docker image for the popular MongoDB database, enabling easy deployment and integration with other services.

+29
+2.8%
1.1K
total stars
#674
lukes/ISO-3166-Countries-with-Regional-Codes

A comprehensive dataset of ISO 3166-1 country codes and their corresponding UN Geoscheme regional codes, ready to use in various formats.

+28
+1.2%
2.4K
total stars
#675
osm2pgsql-dev/osm2pgsql

A C++ library for importing OpenStreetMap data into a PostgreSQL/PostGIS database.

+28
+1.8%
1.6K
total stars
#676
SciTools/cartopy

Cartopy is a Python library for creating maps and visualizing spatial data with matplotlib support.

+28
+1.8%
1.6K
total stars
#677
Cyan4973/FiniteStateEntropy

A high-performance compression library written in C for developers working with large data sets.

+28
+1.9%
1.5K
total stars
#678
Tessil/robin-map

A fast and efficient C++ hash map and hash set implementation using robin hood hashing.

+28
+2.0%
1.4K
total stars
#679
MaxHalford/prince

A Python library for performing multivariate exploratory data analysis, including techniques like PCA, CA, MCA, MFA, and FAMD.

+28
+2.0%
1.4K
total stars
#680
realm/realm-core

Core database component for the Realm Mobile Database SDKs, a popular NoSQL database for mobile apps.

+28
+2.8%
1.0K
total stars
#681
twosigma/flint

A time series library for Apache Spark that provides a high-level API for working with time series data.

+28
+2.8%
1.0K
total stars
#682
enhancedformysql/The-Art-of-Problem-Solving-in-Software-Engineering_How-to-Make-MySQL-Better

This repository provides a comprehensive guide on optimizing MySQL performance and solving common database problems.

+27
+1.4%
1.9K
total stars
#683
raphaelvallat/pingouin

A Python statistical package based on Pandas, providing various statistical methods and tests.

+27
+1.5%
1.9K
total stars
#684
mourner/flatbush

A fast spatial index library for 2D points and rectangles in JavaScript, useful for geospatial applications.

+27
+1.8%
1.6K
total stars
#685
XD-DENG/SQL-exercise

A collection of SQL practice problems for developers to improve their SQL skills.

+27
+1.9%
1.5K
total stars
#686
s3ql/s3ql

A full-featured file system for online data storage, built with Python.

+27
+2.3%
1.2K
total stars
#687
The-Japan-DataScientist-Society/100knocks-preprocess

A repository for the 100 Knocks of Data Science Preprocessing, focused on structured data processing.

+26
+1.1%
2.5K
total stars
#688
Giorgi/EntityFramework.Exceptions

A .NET Standard library that provides strongly typed exceptions for Entity Framework Core across multiple database providers.

+26
+1.6%
1.7K
total stars
#689
imageio/imageio

A Python library for reading and writing a wide range of image and video formats, including DICOM, animated GIFs, and webcam capture.

+26
+1.6%
1.7K
total stars
#690
pachterlab/gget

gget is a Python library that enables efficient querying of genomic reference databases like NCBI, Ensembl, and UniProt.

+26
+2.4%
1.1K
total stars
#691
mpmath/mpmath

A Python library for arbitrary-precision floating-point arithmetic, providing advanced numerical capabilities.

+26
+2.5%
1.1K
total stars
#692
bigdatagenomics/adam

ADAM is a genomics analysis platform with specialized file formats built using Apache Spark and Apache Parquet.

+26
+2.5%
1.0K
total stars
#693
sfikas/medical-imaging-datasets

A collection of medical imaging datasets for researchers and developers in the healthcare industry.

+25
+1.0%
2.5K
total stars
#694
roboyoshi/datacurator-filetree

A standard filetree template for data curation and organization, useful for developers interested in data management.

+24
+1.5%
1.6K
total stars
#695
go-spatial/tegola

Tegola is an open-source Mapbox Vector Tile server written in Go, enabling efficient geospatial data visualization.

+24
+1.7%
1.5K
total stars
#696
wgzhao/Addax

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL databases seamlessly

+24
+1.7%
1.4K
total stars
#697
movingpandas/movingpandas

A Python library for analyzing movement trajectory data using GeoPandas.

+24
+1.8%
1.4K
total stars
#698
TablePlus/DBngin

DBngin is a free, open-source, cross-platform database management tool for developers.

+24
+2.0%
1.2K
total stars
#699
ResidentMario/geoplot

A high-level geospatial data visualization library for Python developers working with spatial data.

+24
+2.0%
1.2K
total stars
#700
rosedblabs/rosedb

Lightweight, fast, and reliable key-value database engine in Go for high-throughput applications.

+23
+0.5%
4.9K
total stars
1...1315...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.