Trending Projects

Discover the fastest growing open source projects

Showing 801-850 of 897 trending projects

#801
typelevel/skunk

A functional, type-safe, composable Scala data access library for Postgres databases.

+9
+0.6%
1.6K
total stars
#802
8080labs/ppscore

A Python library that provides a Predictive Power Score (PPS) to measure the predictive power between variables.

+9
+0.8%
1.2K
total stars
#803
easystats/easystats

An R project focused on providing high-performance statistical models, data analysis, and visualization tools.

+9
+0.8%
1.1K
total stars
#804
emirozer/fake2db

A Python library that generates fake data for custom test databases.

+8
+0.3%
2.4K
total stars
#805
TomAugspurger/effective-pandas

A collection of articles and source code on using the pandas data analysis library.

+8
+0.5%
1.6K
total stars
#806
GeostatsGuy/PythonNumericalDemos

Python demos for spatial data analytics, geostatistics, and machine learning to support courses.

+8
+0.6%
1.5K
total stars
#807
jeremycole/innodb_diagrams

Diagrams and documentation for InnoDB, the storage engine used by MySQL and MariaDB databases.

+8
+0.6%
1.5K
total stars
#808
enthought/mayavi

A powerful 3D visualization library for scientific data in Python.

+8
+0.6%
1.4K
total stars
#809
data-forge/data-forge-ts

A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.

+8
+0.6%
1.4K
total stars
#810
alan-turing-institute/CleverCSV

A Python package for handling messy CSV files with improved dialect detection and a command-line interface.

+8
+0.6%
1.3K
total stars
#811
ifsnop/mysqldump-php

A PHP library that provides a MySQL backup functionality, similar to the mysqldump CLI tool.

+8
+0.6%
1.3K
total stars
#812
Image-Py/imagepy

A Python-based image processing framework with plugins for common image processing libraries.

+7
+0.5%
1.4K
total stars
#813
scijs/ndarray

A JavaScript library for working with multidimensional arrays, useful for data visualization and scientific computing.

+7
+0.6%
1.2K
total stars
#814
spatie/db-dumper

A PHP library for dumping the contents of a database to a file, supporting multiple database engines.

+7
+0.6%
1.2K
total stars
#815
brettkromkamp/contextualise

Contextualise is a powerful tool for organizing diverse information resources in knowledge-intensive projects.

+7
+0.6%
1.1K
total stars
#816
datafold/data-diff

A Python library for comparing data across databases, supporting various database engines.

+6
+0.2%
3.0K
total stars
#817
wesm/feather

Feather is a fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow.

+6
+0.2%
2.8K
total stars
#818
shancarter/mr-data-converter

A JavaScript library that converts CSV and tab-delimited data to web-friendly formats like JSON and XML.

+6
+0.3%
2.0K
total stars
#819
couchbase/forestdb

A fast, hierarchical key-value storage engine written in C++ for applications that require high performance and scalability.

+6
+0.5%
1.3K
total stars
#820
x2bool/xlite

A Rust library that enables querying Excel spreadsheets using SQLite, making data extraction and analysis more efficient.

+6
+0.5%
1.3K
total stars
#821
juliasilge/tidytext

A library for text mining and natural language processing using tidy data principles in R.

+6
+0.5%
1.2K
total stars
#822
tangwz/db-monthly

A collection of monthly reports on the internals of Alibaba Cloud's database products.

+6
+0.6%
1.1K
total stars
#823
liucongg/NLPDataSet

A repository containing various NLP datasets collected and organized by the owner.

+6
+0.6%
1.1K
total stars
#824
orbitinghail/sqlsync

Collaborative offline-first SQLite wrapper for syncing app state across users & devices

+5
+0.2%
2.9K
total stars
#825
orbitjs/orbit

A composable data framework for building ambitious web applications using TypeScript.

+5
+0.2%
2.3K
total stars
#826
RJT1990/pyflux

Open source time series library for Python, useful for statistical analysis and modeling.

+5
+0.2%
2.1K
total stars
#827
eigenteam/eigen-git-mirror

A high-performance C++ linear algebra library focused on solvers, sparse matrices, and numerical computing.

+5
+0.3%
1.8K
total stars
#828
tensorbase/tensorbase

TensorBase is a new big data warehousing solution built with Rust, focused on high-performance analytics.

+5
+0.3%
1.5K
total stars
#829
ucarGroup/DataLink

DataLink is a real-time and offline data exchange platform that supports synchronization between heterogeneous data sources.

+5
+0.5%
1.1K
total stars
#830
BlankerL/DXY-COVID-19-Data

A data warehouse for COVID-19 time series data, useful for data analysis and visualization.

+4
+0.2%
2.2K
total stars
#831
eveningkid/denodb

A versatile ORM for multiple databases including MySQL, SQLite, MariaDB, PostgreSQL, and MongoDB in Deno.

+4
+0.2%
1.9K
total stars
#832
cswinter/LocustDB

A blazingly fast analytics database built with Rust, optimized for rapidly devouring large amounts of data.

+4
+0.2%
1.6K
total stars
#833
Intel-bigdata/HiBench

HiBench is a big data benchmark suite for evaluating the performance of different big data frameworks.

+4
+0.3%
1.5K
total stars
#834
CodeCutTech/Efficient_Python_tricks_and_tools_for_data_scientists

A collection of efficient Python tricks and tools for data scientists to improve their productivity.

+4
+0.3%
1.5K
total stars
#835
Softmotions/ejdb

EJDB2 is an embeddable JSON database engine with a simple XPath-like query language (JQL) for C/C++ applications.

+4
+0.3%
1.5K
total stars
#836
karlseguin/the-little-redis-book

A book that teaches the basics of using the Redis in-memory data structure store.

+4
+0.3%
1.5K
total stars
#837
apache/carbondata

CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.

+4
+0.3%
1.4K
total stars
#838
quiltdata/quilt

Quilt is a data mesh for connecting people with actionable data, built with TypeScript.

+4
+0.3%
1.4K
total stars
#839
RxSwiftCommunity/RxRealm

A Swift extension for RealmSwift that provides reactive programming support using RxSwift.

+4
+0.3%
1.2K
total stars
#840
GeospatialPython/pyshp

A pure Python library for reading and writing ESRI Shapefiles, a popular geospatial data format.

+4
+0.3%
1.1K
total stars
#841
red-data-tools/pycall.rb

A library for calling Python functions from the Ruby language, enabling data science and ML workflows.

+4
+0.4%
1.1K
total stars
#842
FeatureBaseDB/featurebase

FeatureBase is a fast analytical database built on bitmaps, perfect for ML and data-intensive applications.

+3
+0.1%
2.5K
total stars
#843
openacid/slim

A space-efficient trie data structure in Go with fast lookup performance.

+3
+0.2%
1.9K
total stars
#844
citusdata/cstore_fdw

A columnar storage extension for Postgres built as a foreign data wrapper.

+3
+0.2%
1.8K
total stars
#845
variety/variety

A MongoDB schema analysis tool that helps developers understand and optimize their NoSQL database.

+3
+0.2%
1.8K
total stars
#846
aergoio/litetree

SQLite with Branches - a lightweight, embedded database with version control capabilities.

+3
+0.2%
1.6K
total stars
#847
filodb/FiloDB

A distributed, scalable Prometheus-compatible time series database written in Scala.

+3
+0.2%
1.5K
total stars
#848
lukasmartinelli/pgfutter

A tool to easily import CSV and JSON data into PostgreSQL databases.

+3
+0.2%
1.3K
total stars
#849
matplotlib/AnatomyOfMatplotlib

Anatomy of Matplotlib tutorial for SciPy conference, focused on data visualization for scientific computing.

+3
+0.2%
1.2K
total stars
#850
golang/leveldb

The LevelDB key-value database in the Go programming language.

+3
+0.3%
1.2K
total stars
1...1618

Stay in the loop

Get weekly updates on trending AI coding tools and projects.