Trending Projects

Discover the fastest growing open source projects

Showing 151-200 of 897 trending projects

#151
youssefHosni/Data-Science-Interview-Questions-Answers

A curated list of data science interview questions and answers for developers.

+314
+6.0%
5.5K
total stars
#152
jeremyevans/sequel

Sequel is a Ruby library that provides a powerful and flexible object-relational mapping (ORM) for databases.

+314
+6.6%
5.1K
total stars
#153
ApsaraDB/PolarDB-for-PostgreSQL

A cloud-native PostgreSQL database developed by Alibaba Cloud for high-performance, scalable data storage and management.

+312
+11.0%
3.1K
total stars
#154
argoproj/argo-workflows

Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.

+311
+1.9%
16.5K
total stars
#155
mysql/mysql-connector-j

MySQL Connector/J is a JDBC driver that enables Java applications to connect to MySQL databases.

+311
+44.3%
1.0K
total stars
#156
redis/go-redis

Redis client for Go with support for Redis 8.0+

+309
+1.4%
22.0K
total stars
#157
great-expectations/great_expectations

A Python library that helps ensure data quality and reliability through data profiling and testing.

+301
+2.8%
11.2K
total stars
#158
ijl/orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

+301
+3.9%
7.9K
total stars
#159
deanmalmgren/textract

A Python library that provides a simple and unified interface for extracting text from any document format.

+301
+7.2%
4.5K
total stars
#160
devrimgunduz/pagila

A PostgreSQL sample database for testing and learning SQL queries.

+297
+40.5%
1.0K
total stars
#161
dexie/Dexie.js

Dexie.js is a minimalistic IndexedDB wrapper that simplifies offline storage and database management in web applications.

+296
+2.1%
14.1K
total stars
#162
dtinit/data-transfer-project

The Data Transfer Project enables direct transfer of user data between online service providers.

+296
+8.9%
3.6K
total stars
#163
groue/GRDB.swift

A toolkit for SQLite databases, focused on application development with a Swift-based API.

+295
+3.7%
8.3K
total stars
#164
quantopian/qgrid

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

+294
+10.5%
3.1K
total stars
#165
oceanbase/oceanbase

A fast, scalable, and distributed database for transactional, analytical, and AI workloads.

+292
+3.0%
10.0K
total stars
#166
scylladb/scylladb

A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.

+290
+1.9%
15.4K
total stars
#167
probberechts/soccerdata

A Python library for scraping soccer data from various sources for sports analytics and data science.

+288
+22.1%
1.6K
total stars
#168
citusdata/citus

Citus is a distributed PostgreSQL database that enables scaling out your Postgres database across multiple nodes.

+286
+2.4%
12.3K
total stars
#169
dunwu/db-tutorial

An in-depth tutorial covering mainstream database knowledge for backend developers.

+286
+5.7%
5.3K
total stars
#170
memgraph/memgraph

Open-source graph database optimized for dynamic analytics and streaming data environments.

+283
+8.1%
3.8K
total stars
#171
FavioVazquez/ds-cheatsheets

A comprehensive collection of data science cheatsheets for developers and data scientists.

+277
+1.7%
16.2K
total stars
#172
thinkaurelius/titan

Titan is a distributed graph database that can be used for building large-scale data-intensive applications.

+277
+5.6%
5.2K
total stars
#173
IQSS/dataverse

Open source research data repository software built with Java.

+272
+36.1%
1.0K
total stars
#174
apple/foundationdb

FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.

+271
+1.7%
16.2K
total stars
#175
grantjenks/python-sortedcontainers

A Python library that provides efficient, Pythonic data structures for sorted lists, dictionaries, and sets.

+271
+7.4%
3.9K
total stars
#176
pawelsalawa/sqlitestudio

A free, open-source SQLite database manager for multiple platforms.

+269
+4.3%
6.4K
total stars
#177
GreptimeTeam/greptimedb

Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.

+269
+4.7%
6.0K
total stars
#178
caj2pdf/caj2pdf

A Python tool to convert CAJ (China Academic Journals) files to PDF for developers who work with academic literature.

+269
+9.2%
3.2K
total stars
#179
taosdata/TDengine

High-performance time-series database for IoT and IIoT

+265
+1.1%
24.8K
total stars
#180
JoshClose/CsvHelper

A C# library for reading and writing CSV files, with support for a wide range of CSV file formats.

+265
+5.4%
5.2K
total stars
#181
isar/isar

Extremely fast, easy to use, and fully async NoSQL database for Flutter apps

+265
+7.1%
4.0K
total stars
#182
simonw/datasette

An open-source multi-tool for exploring and publishing data, focused on simplifying data analysis and sharing.

+263
+2.5%
10.8K
total stars
#183
man-group/arctic

A high-performance datastore for time series and tick data built on top of MongoDB.

+263
+9.3%
3.1K
total stars
#184
dbgate/dbgate

Database manager for multiple database engines, runs as desktop or web app.

+262
+4.0%
6.8K
total stars
#185
neozhaoliang/pywonderland

A Python library that provides a tour of the wonderland of math with visualizations and algorithms.

+260
+6.5%
4.2K
total stars
#186
mourner/rbush

RBush is a high-performance JavaScript R-tree-based 2D spatial index for points and rectangles.

+258
+10.5%
2.7K
total stars
#187
pubkey/rxdb

Reactive, local-first database for JavaScript apps with real-time sync and flexible storage

+252
+1.1%
23.1K
total stars
#188
Rockyzsu/stock

A Python library for quantitative trading and stock analysis.

+251
+3.6%
7.2K
total stars
#189
pingcap/awesome-database-learning

A comprehensive list of learning materials to help developers understand database internals.

+247
+2.4%
10.7K
total stars
#190
LAStools/LAStools

This repository contains efficient tools for LiDAR processing, focused on working with point cloud data.

+247
+31.2%
1.0K
total stars
#191
blaze/odo

A Python library for data migration and transformation in the Blaze project.

+246
+32.4%
1.0K
total stars
#192
jorgerojas26/lazysql

A cross-platform TUI database management tool written in Go for developers working with databases.

+244
+7.4%
3.5K
total stars
#193
andkret/Cookbook

A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.

+242
+1.6%
15.0K
total stars
#194
beamandrew/medical-data

No description provided for this medical data repository.

+242
+4.2%
6.0K
total stars
#195
dathere/qsv

Blazing-fast data wrangling toolkit for AI and data engineering workflows

+242
+7.4%
3.5K
total stars
#196
apache/hugegraph

A highly scalable, high-performance graph database that supports over 100 billion data points.

+242
+8.9%
3.0K
total stars
#197
RedisTimeSeries/RedisTimeSeries

A Redis module that provides a time series data structure for storing and querying time series data.

+242
+29.4%
1.1K
total stars
#198
rqlite/rqlite

A lightweight, fault-tolerant distributed database built on SQLite, designed for high availability.

+239
+1.4%
17.3K
total stars
#199
mathesar-foundation/mathesar

An open-source, self-hosted database management tool with a spreadsheet-like interface for Postgres

+239
+5.2%
4.9K
total stars
#200
duckdb/ducklake

DuckLake is an integrated data lake and catalog format written in C++.

+237
+10.4%
2.5K
total stars
1...35...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.