Trending Projects

Discover the fastest growing open source projects

Showing 1-50 of 897 trending projects

#1🥇
alibaba/zvec

Lightning-fast in-process vector DB for RAG & semantic search in C++

+3.9K
stars (+81.9%)
8.7K total
C++
#2🥈
drivendataorg/cookiecutter-data-science

A flexible and standardized cookiecutter template for doing and sharing data science work in Python.

+3.9K
stars (+66.5%)
9.7K total
Python
#3🥉
doctrine/dbal

A PHP database abstraction layer that provides a simple, consistent API for interacting with different database systems.

+3.4K
stars (+52.8%)
9.7K total
PHP
#4
milvus-io/milvus

High-performance vector database for AI apps

+3.1K
+7.8%
43.1K
total stars
#5
drawdb-io/drawdb

Database diagram editor and SQL generator

+2.5K
+7.3%
36.8K
total stars
#6
tursodatabase/turso

Turso is an in-process SQL database, compatible with SQLite, written in Rust for high performance.

+2.5K
+16.3%
17.7K
total stars
#7
akfamily/akshare

AKShare is a simple and elegant Python library for accessing financial data APIs.

+2.2K
+14.9%
16.8K
total stars
#8
qdrant/qdrant

Vector database for AI applications

+2.1K
+7.8%
29.3K
total stars
#9
duckdb/duckdb

High-performance analytical in-process SQL database for developers

+2.0K
+5.7%
36.5K
total stars
#10
ClickHouse/ClickHouse

Real-time analytics database for generating data reports

+2.0K
+4.4%
46.2K
total stars
#11
drizzle-team/drizzle-orm

TypeScript ORM for Node.js, Bun, Deno, and serverless environments

+1.7K
+5.3%
33.1K
total stars
#12
apache/superset

A modern, enterprise-ready business intelligence web application for data visualization and exploration.

+1.7K
+2.4%
70.8K
total stars
#13
PyPortfolio/PyPortfolioOpt

A Python library for financial portfolio optimization, including classical efficient frontier and advanced techniques.

+1.6K
+41.3%
5.5K
total stars
#14
redis/redis

Redis is a fast, in-memory data structure server used for caching, real-time applications, and more.

+1.6K
+2.2%
73.3K
total stars
#15
metabase/metabase

Open-source BI tool for data analysis and visualization

+1.5K
+3.4%
46.3K
total stars
#16
elastic/elasticsearch

Distributed, RESTful search engine for developers

+1.5K
+2.0%
76.3K
total stars
#17
pgvector/pgvector

Vector similarity search for Postgres

+1.4K
+7.7%
20.1K
total stars
#18
chartdb/chartdb

Web-based database diagramming editor with AI-powered export and schema import

+1.4K
+7.1%
21.4K
total stars
#19
apache/airflow

Apache Airflow for workflow orchestration

+1.4K
+3.2%
44.5K
total stars
#20
pola-rs/polars

Fast DataFrame query engine in Rust with Python/Rust/Node.js/R frontends

+1.4K
+3.7%
37.6K
total stars
#21
JerBouma/FinanceDatabase

This is a comprehensive financial database with 300,000+ symbols including equities, currencies, and cryptocurrencies.

+1.2K
+20.9%
7.2K
total stars
#22
valkey-io/valkey

High-performance key-value database for caching and real-time workloads

+1.2K
+5.1%
25.0K
total stars
#23
dolthub/dolt

Dolt is Git for Data, enabling version control for SQL databases with Git-like commands and features.

+1.2K
+6.2%
20.5K
total stars
#24
postgres/postgres

PostgreSQL database source code

+1.1K
+5.8%
20.2K
total stars
#25
timescale/timescaledb

Time-series database for real-time analytics as a PostgreSQL extension

+1.1K
+5.2%
22.0K
total stars
#26
mukunku/ParquetViewer

A simple Windows desktop app for viewing and querying Apache Parquet files, a popular big data format.

+1.1K
+1793.2%
1.1K
total stars
#27
marcboeker/go-duckdb

A Go database/sql driver for the DuckDB database engine, enabling fast and efficient data processing.

+1.0K
+1336.0%
1.1K
total stars
#28
prisma/prisma

Next-gen ORM for Node.js/TypeScript with multiple database support

+999
+2.3%
45.5K
total stars
#29
alibaba/AliSQL

AliSQL is a MySQL branch originated from Alibaba Group, focused on high performance and scalability.

+994
+20.9%
5.8K
total stars
#30
pandas-dev/pandas

Core data analysis library for Python with labeled data structures and statistical functions

+974
+2.1%
48.1K
total stars
#31
dataease/dataease

Open-source BI tool for data visualization and analysis

+942
+4.2%
23.5K
total stars
#32
efficient/cuckoofilter

A space-efficient C++ implementation of the Cuckoo filter, a probabilistic data structure for set membership testing.

+942
+1427.3%
1.0K
total stars
#33
SciRuby/daru

SciRuby/daru is a Ruby library for data analysis and manipulation, useful for data scientists and developers working with data.

+940
+776.9%
1.1K
total stars
#34
beekeeper-studio/beekeeper-studio

Modern SQL client for multiple databases

+893
+4.2%
22.1K
total stars
#35
allenai/s2orc

A large-scale open-access corpus of scientific papers and metadata for researchers and developers.

+878
+627.1%
1.0K
total stars
#36
apache/celeborn

Apache Celeborn is a high-performance shuffle and spilled data service for big data applications.

+872
+522.2%
1.0K
total stars
#37
apache/kafka

Distributed event streaming platform for data pipelines and real-time apps

+819
+2.6%
32.1K
total stars
#38
PrefectHQ/prefect

Workflow orchestration for resilient data pipelines in Python

+814
+3.9%
21.8K
total stars
#39
facebookresearch/cc_net

Tools to download and cleanup Common Crawl data, a large web crawl dataset, for further analysis and processing.

+801
+338.0%
1.0K
total stars
#40
etcd-io/etcd

Distributed key-value store for critical distributed system data

+770
+1.5%
51.6K
total stars
#41
rxin/db-readings

This is a collection of readings and resources related to databases, not a vibe coder platform.

+766
+10.6%
8.0K
total stars
#42
fluvio-community/fluvio

Fluvio is an event stream processing engine for developers to build responsive data-intensive apps.

+760
+17.2%
5.2K
total stars
#43
databricks/spark-csv

CSV Data Source for Apache Spark 1.x, a Scala library for working with structured data.

+759
+253.8%
1.1K
total stars
#44
CJ-Chen/TBtools-II

A powerful GUI/CLI tool for biologists to work with NGS data, not a vibe coder tool.

+759
+279.0%
1.0K
total stars
#45
open-metadata/OpenMetadata

A unified metadata platform for data discovery, data observability, and data governance.

+748
+9.2%
8.8K
total stars
#46
dragonflydb/dragonfly

Modern in-memory key-value store for caching and data management

+737
+2.5%
30.1K
total stars
#47
realm/realm-java

Realm is a mobile database that serves as a replacement for SQLite and ORMs.

+731
+6.8%
11.5K
total stars
#48
numpy/numpy

Fundamental package for scientific computing with Python

+723
+2.3%
31.6K
total stars
#49
airbytehq/airbyte

Data integration platform for ELT pipelines from APIs, databases & files to databases, warehouses & lakes

+708
+3.5%
20.8K
total stars
#50
apache/spark

Unified analytics engine for large-scale data processing

+704
+1.7%
42.9K
total stars
2...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.