Trending Projects

Discover the fastest growing open source projects

Showing 151-200 of 897 trending projects

#151
jupyter/docker-stacks

Docker images containing Jupyter applications for data science and machine learning workflows.

0
0.0%
8.4K
total stars
#152
igorbarinov/awesome-data-engineering

A curated list of data engineering tools for software developers, not focused on AI coding tools.

0
0.0%
8.3K
total stars
#153
pentaho/pentaho-kettle

Pentaho Data Integration (ETL) is a Java-based tool for building data integration and ETL pipelines.

0
0.0%
8.3K
total stars
#154
groue/GRDB.swift

A toolkit for SQLite databases, focused on application development with a Swift-based API.

0
0.0%
8.3K
total stars
#155
jackzhenguo/python-small-examples

A collection of Python code examples and tutorials for data science, machine learning, and web development.

0
0.0%
8.1K
total stars
#156
allegro/bigcache

Efficient in-memory cache in Go for storing and retrieving large amounts of data.

0
0.0%
8.1K
total stars
#157
rxin/db-readings

This is a collection of readings and resources related to databases, not a vibe coder platform.

0
0.0%
8.0K
total stars
#158
ijl/orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

0
0.0%
7.9K
total stars
#159
microsoft/azuredatastudio

Azure Data Studio is a data management and development tool with connectivity to popular cloud and on-premises databases.

0
0.0%
7.7K
total stars
#160
PostgresApp/PostgresApp

An open-source PostgreSQL client application for macOS, providing an easy way to set up and manage a local PostgreSQL database.

0
0.0%
7.7K
total stars
#161
msiemens/tinydb

A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.

0
0.0%
7.5K
total stars
#162
attic-labs/noms

The versioned, forkable, syncable database for developers who need a scalable, distributed data solution.

0
0.0%
7.4K
total stars
#163
MariaDB/server

Open-source relational database management system (RDBMS) for building data-driven applications.

0
0.0%
7.3K
total stars
#164
AlaSQL/alasql

AlaSQL is a JavaScript SQL database for browser and Node.js that handles both relational tables and nested JSON data.

0
0.0%
7.3K
total stars
#165
kennethreitz/records

Records is a Python SQL library that makes interacting with databases more intuitive and human-friendly.

0
0.0%
7.2K
total stars
#166
erikgrinaker/toydb

An educational distributed SQL database written in Rust, not focused on AI coding tools.

0
0.0%
7.2K
total stars
#167
liuhuanyong/QASystemOnMedicalKG

A tutorial and implementation of a disease-centered medical knowledge graph and QA system.

0
0.0%
7.2K
total stars
#168
Rockyzsu/stock

A Python library for quantitative trading and stock analysis.

0
0.0%
7.2K
total stars
#169
google/draco

Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.

0
0.0%
7.2K
total stars
#170
Alluxio/alluxio

Alluxio is an open-source data orchestration platform for analytics and machine learning workloads in the cloud.

0
0.0%
7.2K
total stars
#171
JerBouma/FinanceDatabase

This is a comprehensive financial database with 300,000+ symbols including equities, currencies, and cryptocurrencies.

0
0.0%
7.2K
total stars
#172
lijin-THU/notes-python

A comprehensive set of Python notes and resources for developers, covering a wide range of topics including data science, machine learning, and scientific computing.

0
0.0%
7.1K
total stars
#173
jvns/pandas-cookbook

Pandas Cookbook is a collection of recipes for using Python's powerful data analysis library, Pandas.

0
0.0%
7.0K
total stars
#174
snowplow/snowplow

A powerful customer data pipeline for collecting, processing, and analyzing user events and behavior.

0
0.0%
7.0K
total stars
#175
aarondl/sqlboiler

SQLBoiler is a Go ORM that generates code tailored to your database schema, making it easy to interact with databases.

0
0.0%
7.0K
total stars
#176
apache/couchdb

An open-source, scalable, and fault-tolerant NoSQL database with a focus on reliability and offline-first design.

0
0.0%
6.8K
total stars
#177
ranaroussi/quantstats

Portfolio analytics library for quantitative finance, built with Python

0
0.0%
6.8K
total stars
#178
dbgate/dbgate

Database manager for multiple database engines, runs as desktop or web app.

0
0.0%
6.8K
total stars
#179
sqldelight/sqldelight

SQLDelight - Generates type-safe Kotlin APIs from SQL, enabling easier database management in Kotlin projects.

0
0.0%
6.8K
total stars
#180
cantaro86/Financial-Models-Numerical-Methods

A collection of notebooks covering quantitative finance and numerical methods in Python.

0
0.0%
6.7K
total stars
#181
rhiever/Data-Analysis-and-Machine-Learning-Projects

A collection of data analysis and machine learning projects and resources for developers.

0
0.0%
6.6K
total stars
#182
apache/zeppelin

Zeppelin is a web-based notebook that enables data-driven, interactive data analytics and collaborative documents.

0
0.0%
6.6K
total stars
#183
hazelcast/hazelcast

Hazelcast is a high-performance, distributed in-memory data platform for real-time insights and stream processing.

0
0.0%
6.6K
total stars
#184
pawelsalawa/sqlitestudio

A free, open-source SQLite database manager for multiple platforms.

0
0.0%
6.4K
total stars
#185
qinwf/awesome-R

A curated list of awesome R packages, frameworks and software for data analysis and data science.

0
0.0%
6.4K
total stars
#186
xiangyuecn/AreaCity-JsSpider-StatsGov

Comprehensive collection of city and administrative region data for China, with features like CSV export, JS code generation, and web scraping.

0
0.0%
6.4K
total stars
#187
apache/flink-cdc

Flink CDC is a streaming data integration tool that enables real-time data pipelines and change data capture.

0
0.0%
6.4K
total stars
#188
wireservice/csvkit

A suite of utilities for converting to and working with CSV, the king of tabular file formats.

0
0.0%
6.4K
total stars
#189
syndtr/goleveldb

LevelDB key/value database in Go for building high-performance data-intensive applications.

0
0.0%
6.3K
total stars
#190
pachyderm/pachyderm

Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.

0
0.0%
6.3K
total stars
#191
apache/pinot

Apache Pinot is a realtime distributed OLAP datastore for fast querying of large datasets.

0
0.0%
6.0K
total stars
#192
GreptimeTeam/greptimedb

Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.

0
0.0%
6.0K
total stars
#193
apache/hive

Apache Hive is a data warehouse software built on top of Apache Hadoop for querying and managing large datasets.

0
0.0%
6.0K
total stars
#194
beamandrew/medical-data

No description provided for this medical data repository.

0
0.0%
6.0K
total stars
#195
niderhoff/nlp-datasets

A curated list of free/public domain text datasets for natural language processing (NLP) tasks.

0
0.0%
6.0K
total stars
#196
OSGeo/gdal

GDAL is an open-source library for working with various geospatial data formats, useful for remote sensing and GIS applications.

0
0.0%
5.8K
total stars
#197
DotNetNext/SqlSugar

A powerful, multi-database ORM for .NET that supports a wide range of SQL databases and provides a seamless data access layer.

0
0.0%
5.8K
total stars
#198
alibaba/AliSQL

AliSQL is a MySQL branch originated from Alibaba Group, focused on high performance and scalability.

0
0.0%
5.8K
total stars
#199
kurrent-io/KurrentDB

KurrentDB is an event-native database designed for modern software and event-driven architectures.

0
0.0%
5.7K
total stars
#200
tonsky/datascript

Immutable database and Datalog query engine for Clojure, ClojureScript and JS

0
0.0%
5.7K
total stars
1...35...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.