Trending Projects

Discover the fastest growing open source projects

Showing 601-650 of 897 trending projects

#601

GeostatsGuy/PythonNumericalDemos

Python demos for spatial data analytics, geostatistics, and machine learning to support courses.

+0.2%

1.5K

total stars

Jupyter Notebook

#602

go-spatial/tegola

Tegola is an open-source Mapbox Vector Tile server written in Go, enabling efficient geospatial data visualization.

+0.2%

1.5K

total stars

#603

wgzhao/Addax

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL databases seamlessly

+0.2%

1.4K

total stars

Java

#604

PKUJohnson/OpenData

An open-source financial data extraction tool that allows easy API access to web scrape data from various websites.

+0.2%

1.4K

total stars

Python

#605

Image-Py/imagepy

A Python-based image processing framework with plugins for common image processing libraries.

+0.2%

1.4K

total stars

Python

#606

attaswift/BTree

A fast, in-memory B-tree implementation for sorted collections in Swift.

+0.2%

1.3K

total stars

Swift

#607

alan-turing-institute/CleverCSV

A Python package for handling messy CSV files with improved dialect detection and a command-line interface.

+0.2%

1.3K

total stars

Python

#608

ifsnop/mysqldump-php

A PHP library that provides a MySQL backup functionality, similar to the mysqldump CLI tool.

+0.2%

1.3K

total stars

PHP

#609

JetBrains/xodus

Xodus is a transactional, schema-less embedded database used by JetBrains products like YouTrack and Hub.

+0.2%

1.3K

total stars

Java

#610

datacrypt-project/hitchhiker-tree

A high-performance, persistent, off-heap data structure written in Clojure for data-intensive applications.

+0.3%

1.2K

total stars

Clojure

#611

TablePlus/DBngin

DBngin is a free, open-source, cross-platform database management tool for developers.

+0.3%

1.2K

total stars

#612

bububa/MongoHub-Mac

MongoHub is a native macOS MongoDB client that provides a GUI for managing and interacting with MongoDB databases.

+0.3%

1.2K

total stars

Objective-C

#613

apache/accumulo

Apache Accumulo is a scalable and robust key-value store that provides a sparse, sorted, distributed, and persistent multi-dimensional table.

+0.3%

1.1K

total stars

Java

#614

eduosi/district

This repository contains data on Chinese administrative divisions, including names, pinyin, and codes.

+0.3%

1.1K

total stars

#615

docker-library/mongo

Docker image for the popular MongoDB database, enabling easy deployment and integration with other services.

+0.3%

1.1K

total stars

Shell

#616

crazyhottommy/RNA-seq-analysis

This GitHub repository contains notes and code for analyzing RNA-seq data using Python and Snakemake.

+0.3%

1.1K

total stars

Python

#617

gunrock/gunrock

Programmable CUDA/C++ GPU Graph Analytics library for high-performance parallel graph processing.

+0.3%

1.1K

total stars

C++

#618

patx/pickledb

An in-memory key-value store using Python's orjson module for persistence, with SQLite support.

+0.3%

1.1K

total stars

Python

#619

apache/celeborn

Apache Celeborn is a high-performance shuffle and spilled data service for big data applications.

+0.3%

1.0K

total stars

Java

#620

CJ-Chen/TBtools-II

A powerful GUI/CLI tool for biologists to work with NGS data, not a vibe coder tool.

+0.3%

1.0K

total stars

Shell

#621

rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

+0.3%

1.0K

total stars

#622

elliotchance/orderedmap

An ordered map implementation in Go with amortized O(1) performance for common operations.

+0.3%

1.0K

total stars

#623

hazelcast/hazelcast

Hazelcast is a high-performance, distributed in-memory data platform for real-time insights and stream processing.

+0.0%

6.6K

total stars

Java

#624

apache/hive

Apache Hive is a data warehouse software built on top of Apache Hadoop for querying and managing large datasets.

+0.0%

6.0K

total stars

Java

#625

niderhoff/nlp-datasets

A curated list of free/public domain text datasets for natural language processing (NLP) tasks.

+0.0%

6.0K

total stars

#626

kakuilan/china_area_mysql

This is a MySQL library containing China's 5-level administrative regions, not a vibe coder tool.

+0.0%

5.3K

total stars

#627

sripathikrishnan/redis-rdb-tools

A Python tool to parse Redis dump.rdb files, analyze memory usage, and export data to JSON.

+0.0%

5.2K

total stars

Python

#628

tidwall/buntdb

BuntDB is an embeddable, in-memory key/value database for Go with custom indexing and geospatial support.

+0.0%

4.8K

total stars

#629

crate/crate

CrateDB is a distributed, scalable SQL database for storing and analyzing massive amounts of data in near real-time.

+0.1%

4.4K

total stars

Java

#630

ploomber/ploomber

Ploomber is a fast and versatile tool for building and deploying data pipelines that can be used with a variety of AI and ML tools.

+0.1%

3.6K

total stars

Python

#631

WeBankFinTech/DataSphereStudio

DataSphereStudio is a one-stop data application development and management portal covering data exchange, analysis, and visualization.

+0.1%

3.3K

total stars

Java

#632

wesm/feather

Feather is a fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow.

+0.1%

2.8K

total stars

JavaScript

#633

schematics/schematics

Python data structures library focused on serialization, deserialization, and validation of complex data schemas.

+0.1%

2.6K

total stars

Python

#634

youngyangyang04/Skiplist-CPP

A lightweight key-value store built with C++ using a skiplist data structure.

+0.1%

2.4K

total stars

C++

#635

chezou/tabula-py

A simple Python wrapper for the Tabula Java library, which extracts tables from PDF files into Pandas DataFrames.

+0.1%

2.3K

total stars

Python

#636

enhancedformysql/The-Art-of-Problem-Solving-in-Software-Engineering_How-to-Make-MySQL-Better

This repository provides a comprehensive guide on optimizing MySQL performance and solving common database problems.

+0.1%

1.9K

total stars

#637

plant99/felicette

A Python library for processing and visualizing satellite imagery data.

+0.1%

1.8K

total stars

Python

#638

Kyubyong/numpy_exercises

A repository of NumPy exercises for developers looking to improve their Python and data manipulation skills.

+0.1%

1.7K

total stars

Python

#639

JifuZhao/DS-Take-Home

A collection of data science take-home challenges and solutions implemented in Jupyter Notebooks.

+0.1%

1.7K

total stars

Jupyter Notebook

#640

dingodb/dingo

A high-performance, MySQL-compatible vector database that supports structured and unstructured data for AI-driven applications.

+0.1%

1.7K

total stars

Java

#641

aws-samples/aws-glue-samples

AWS Glue code samples for building data integration and ETL pipelines on AWS.

+0.1%

1.5K

total stars

Python

#642

locationtech/geomesa

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

+0.1%

1.5K

total stars

Scala

#643

shuttle-hq/synth

Synth is a Rust library for generating realistic, randomized test data for applications and databases.

+0.1%

1.5K

total stars

Rust

#644

CamDavidsonPilon/lifetimes

A Python library for calculating customer lifetime value metrics and cohort analysis.

+0.1%

1.5K

total stars

Python

#645

Tessil/robin-map

A fast and efficient C++ hash map and hash set implementation using robin hood hashing.

+0.1%

1.4K

total stars

C++

#646

sfirke/janitor

A collection of simple tools for data cleaning and wrangling in R for data science tasks.

+0.1%

1.4K

total stars

#647

tidyverse/tidyr

tidyr is an R package that provides a set of functions to tidy messy data into a format suitable for analysis.

+0.1%

1.4K

total stars

#648

r-spatial/sf

An R package that provides support for simple features, a standardized way to encode spatial vector data.

+0.1%

1.4K

total stars

#649

PumpkinDB/PumpkinDB

PumpkinDB is an immutable, ordered key-value database engine written in Rust.

+0.1%

1.4K

total stars

Rust

#650

PyTables/PyTables

A powerful Python package to manage and work with extremely large amounts of data.

+0.1%

1.4K

total stars

Python

1...1214...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.