Trending Projects

Discover the fastest growing open source projects

Showing 1-50 of 897 trending projects

#1🥇
alibaba/zvec

Lightning-fast in-process vector DB for RAG & semantic search in C++

+3.9K
stars (+81.9%)
8.7K total
C++
#2🥈
go-gorm/gorm

GORM is a developer-friendly ORM library for Golang, offering features like associations, hooks, and auto migrations.

+2.1K
stars (+5.5%)
39.7K total
Go
#3🥉
rethinkdb/rethinkdb

Realtime NoSQL database for web apps

+2.0K
stars (+8.0%)
27.0K total
C++
#4
SheetJS/sheetjs

SheetJS Spreadsheet Data Toolkit for data extraction and spreadsheet generation.

+1.9K
+5.4%
36.2K
total stars
#5
redis/RedisDesktopManager

Redis GUI client joining forces with Redis to enhance developer experience

+1.9K
+8.7%
23.2K
total stars
#6
tursodatabase/turso

Turso is an in-process SQL database, compatible with SQLite, written in Rust for high performance.

+1.7K
+10.8%
17.7K
total stars
#7
drivendataorg/cookiecutter-data-science

A flexible and standardized cookiecutter template for doing and sharing data science work in Python.

+1.6K
+20.1%
9.7K
total stars
#8
akfamily/akshare

AKShare is a simple and elegant Python library for accessing financial data APIs.

+1.6K
+10.6%
16.8K
total stars
#9
doctrine/dbal

A PHP database abstraction layer that provides a simple, consistent API for interacting with different database systems.

+1.5K
+18.4%
9.7K
total stars
#10
drawdb-io/drawdb

Database diagram editor and SQL generator

+1.5K
+4.1%
36.8K
total stars
#11
typesense/typesense

Fast, typo-tolerant search engine for building delightful search experiences

+1.4K
+5.7%
25.3K
total stars
#12
qdrant/qdrant

Vector database for AI applications

+1.3K
+4.8%
29.3K
total stars
#13
milvus-io/milvus

High-performance vector database for AI apps

+1.2K
+2.8%
43.1K
total stars
#14
ClickHouse/ClickHouse

Real-time analytics database for generating data reports

+1.2K
+2.6%
46.2K
total stars
#15
duckdb/duckdb

High-performance analytical in-process SQL database for developers

+1.1K
+3.2%
36.5K
total stars
#16
drizzle-team/drizzle-orm

TypeScript ORM for Node.js, Bun, Deno, and serverless environments

+1.0K
+3.2%
33.1K
total stars
#17
dolthub/dolt

Dolt is Git for Data, enabling version control for SQL databases with Git-like commands and features.

+1.0K
+5.3%
20.5K
total stars
#18
Tencent/wcdb

WCDB is a cross-platform database framework developed by WeChat for Android, iOS, Linux, macOS, and Windows.

+1.0K
+9.6%
11.7K
total stars
#19
great-expectations/great_expectations

A Python library that helps ensure data quality and reliability through data profiling and testing.

+1.0K
+10.0%
11.2K
total stars
#20
apache/superset

A modern, enterprise-ready business intelligence web application for data visualization and exploration.

+1.0K
+1.4%
70.8K
total stars
#21
metabase/metabase

Open-source BI tool for data analysis and visualization

+996
+2.2%
46.3K
total stars
#22
redis/redis

Redis is a fast, in-memory data structure server used for caching, real-time applications, and more.

+993
+1.4%
73.3K
total stars
#23
pgvector/pgvector

Vector similarity search for Postgres

+895
+4.7%
20.1K
total stars
#24
valeriansaliou/sonic

Fast, lightweight search backend alternative to Elasticsearch

+866
+4.3%
21.2K
total stars
#25
pola-rs/polars

Fast DataFrame query engine in Rust with Python/Rust/Node.js/R frontends

+849
+2.3%
37.6K
total stars
#26
valkey-io/valkey

High-performance key-value database for caching and real-time workloads

+803
+3.3%
25.0K
total stars
#27
vaexio/vaex

A high-performance Python library for working with large tabular datasets, offering efficient data manipulation and visualization.

+795
+10.3%
8.5K
total stars
#28
PRQL/prql

PRQL is a modern, powerful, and pipelined SQL replacement for transforming data.

+793
+8.0%
10.7K
total stars
#29
alibaba/AliSQL

AliSQL is a MySQL branch originated from Alibaba Group, focused on high performance and scalability.

+785
+15.8%
5.8K
total stars
#30
apache/airflow

Apache Airflow for workflow orchestration

+772
+1.8%
44.5K
total stars
#31
mukunku/ParquetViewer

A simple Windows desktop app for viewing and querying Apache Parquet files, a popular big data format.

+750
+204.4%
1.1K
total stars
#32
postgres/postgres

PostgreSQL database source code

+717
+3.7%
20.2K
total stars
#33
timescale/timescaledb

Time-series database for real-time analytics as a PostgreSQL extension

+701
+3.3%
22.0K
total stars
#34
chartdb/chartdb

Web-based database diagramming editor with AI-powered export and schema import

+671
+3.2%
21.4K
total stars
#35
marcboeker/go-duckdb

A Go database/sql driver for the DuckDB database engine, enabling fast and efficient data processing.

+669
+164.0%
1.1K
total stars
#36
sqldef/sqldef

Idempotent schema management tool for MySQL, PostgreSQL, SQLite, and SQL Server databases.

+661
+28.1%
3.0K
total stars
#37
efficient/cuckoofilter

A space-efficient C++ implementation of the Cuckoo filter, a probabilistic data structure for set membership testing.

+644
+176.9%
1.0K
total stars
#38
cantaro86/Financial-Models-Numerical-Methods

A collection of notebooks covering quantitative finance and numerical methods in Python.

+629
+10.3%
6.7K
total stars
#39
pachyderm/pachyderm

Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.

+615
+10.8%
6.3K
total stars
#40
JerBouma/FinanceDatabase

This is a comprehensive financial database with 300,000+ symbols including equities, currencies, and cryptocurrencies.

+604
+9.2%
7.2K
total stars
#41
pandas-dev/pandas

Core data analysis library for Python with labeled data structures and statistical functions

+603
+1.3%
48.1K
total stars
#42
zhisheng17/flink-learning

This is a comprehensive learning resource for the Flink stream processing framework, covering concepts, principles, and real-world use cases.

+598
+4.1%
15.1K
total stars
#43
opendataloader-project/opendataloader-pdf

Fast local PDF-to-Markdown/JSON converter for RAG pipelines. No GPU needed.

+591
+47.5%
1.8K
total stars
#44
SciRuby/daru

SciRuby/daru is a Ruby library for data analysis and manipulation, useful for data scientists and developers working with data.

+583
+122.0%
1.1K
total stars
#45
dataease/dataease

Open-source BI tool for data visualization and analysis

+571
+2.5%
23.5K
total stars
#46
allenai/s2orc

A large-scale open-access corpus of scientific papers and metadata for researchers and developers.

+530
+108.6%
1.0K
total stars
#47
elastic/elasticsearch

Distributed, RESTful search engine for developers

+529
+0.7%
76.3K
total stars
#48
prisma/prisma

Next-gen ORM for Node.js/TypeScript with multiple database support

+527
+1.2%
45.5K
total stars
#49
lk-geimfari/mimesis

Mimesis is a fast Python library for generating fake data in multiple languages for testing and development purposes.

+521
+12.2%
4.8K
total stars
#50
apache/celeborn

Apache Celeborn is a high-performance shuffle and spilled data service for big data applications.

+519
+99.8%
1.0K
total stars
2...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.