Trending Projects

Discover the fastest growing open source projects

Showing 451-500 of 897 trending projects

#451
GiovineItalia/Gadfly.jl

Crafty statistical graphics library for the Julia programming language

0
0.0%
1.9K
total stars
#452
broadinstitute/gatk

Official code repository for the Genome Analysis Toolkit (GATK), a bioinformatics library for working with next-generation DNA sequencing data.

0
0.0%
1.9K
total stars
#453
AileenNielsen/TimeSeriesAnalysisWithPython

A Jupyter Notebook repository focused on time series analysis using Python, likely not targeted at vibe coders.

0
0.0%
1.9K
total stars
#454
baidu/tera

An Internet-scale distributed database system built on C++, inspired by Google's Bigtable.

0
0.0%
1.9K
total stars
#455
yhilpisch/py4fi

This is a Python library for financial applications, not a tool for AI-powered vibe coders.

0
0.0%
1.9K
total stars
#456
data-engineering-community/data-engineering-wiki

A community-driven wiki for learning data engineering, covering topics like data modeling, pipelines, and databases.

0
0.0%
1.9K
total stars
#457
apache/kudu

Apache Kudu is a high-performance, open-source columnar storage engine for large datasets in the Apache Hadoop ecosystem.

0
0.0%
1.9K
total stars
#458
fluid-cloudnative/fluid

Fluid is a distributed data abstraction and acceleration framework for Big Data and AI applications on the cloud.

0
0.0%
1.9K
total stars
#459
skfolio/skfolio

A Python library for portfolio optimization using scikit-learn and convex optimization techniques.

0
0.0%
1.9K
total stars
#460
ContextLab/hypertools

A Python toolbox for gaining geometric insights into high-dimensional data, useful for vibe coders working with AI tools.

0
0.0%
1.9K
total stars
#461
h2oai/datatable

A high-performance, memory-efficient Python data analysis library for handling large datasets.

0
0.0%
1.9K
total stars
#462
begeekmyfriend/bplustree

A fast B+ tree indexing structure in C for efficient storage and retrieval of billions of key-value pairs.

0
0.0%
1.9K
total stars
#463
raphaelvallat/pingouin

A Python statistical package based on Pandas, providing various statistical methods and tests.

0
0.0%
1.9K
total stars
#464
yougov/mongo-connector

MongoDB data stream pipeline tools for managing real-time data synchronization and replication.

0
0.0%
1.9K
total stars
#465
johannfaouzi/pyts

A Python package for time series classification, useful for developers working with time-series data.

0
0.0%
1.9K
total stars
#466
matrixorigin/matrixone

Cloud-native, MySQL-compatible, AI-ready database with Git for Data, vector search, and full-text search capabilities.

0
0.0%
1.9K
total stars
#467
neil3d/excel2json

A C# library that converts Excel spreadsheets to JSON objects and saves them to a text file.

0
0.0%
1.9K
total stars
#468
HuaRongSAO/talib-document

A Python library for technical analysis indicators, with Chinese translation and documentation.

0
0.0%
1.9K
total stars
#469
dask/dask-tutorial

An interactive tutorial for the Dask distributed computing library, focused on data analysis and manipulation.

0
0.0%
1.9K
total stars
#470
neo4j-contrib/neo4j-apoc-procedures

A collection of procedures for the Neo4j graph database, providing advanced graph algorithms and utilities.

0
0.0%
1.9K
total stars
#471
x-ream/sqli

A Java ORM SQL query builder that supports popular databases like ClickHouse, Impala, MySQL, and Presto.

0
0.0%
1.9K
total stars
#472
NateScarlet/holiday-cn

A Python tool for automatically scraping data on China's statutory holidays from government announcements.

0
0.0%
1.8K
total stars
#473
cbailes/awesome-deep-trading

A curated list of resources for machine learning-based algorithmic trading and quantitative finance.

0
0.0%
1.8K
total stars
#474
opendataloader-project/opendataloader-pdf

Fast local PDF-to-Markdown/JSON converter for RAG pipelines. No GPU needed.

0
0.0%
1.8K
total stars
#475
plant99/felicette

A Python library for processing and visualizing satellite imagery data.

0
0.0%
1.8K
total stars
#476
eigenteam/eigen-git-mirror

A high-performance C++ linear algebra library focused on solvers, sparse matrices, and numerical computing.

0
0.0%
1.8K
total stars
#477
materialsproject/pymatgen

A robust Python library for materials analysis and computational materials science.

0
0.0%
1.8K
total stars
#478
mwaskom/seaborn-data

This is a data repository for the Seaborn data visualization library in Python.

0
0.0%
1.8K
total stars
#479
edyoda/data-science-complete-tutorial

This repository provides comprehensive tutorials and resources for learning data science and machine learning using Python.

0
0.0%
1.8K
total stars
#480
risinglightdb/risinglight

An educational OLAP database system built in Rust for learning and experimentation.

0
0.0%
1.8K
total stars
#481
feldera/feldera

The Feldera Incremental Computation Engine is a Rust-based library for building real-time data pipelines and materialized views.

0
0.0%
1.8K
total stars
#482
apache/fluss

Apache Fluss is a real-time streaming storage platform built for big data analytics.

0
0.0%
1.8K
total stars
#483
alibaba/MongoShake

MongoShake is a universal data replication platform based on MongoDB's oplog, enabling redundant replication and active-active replication.

0
0.0%
1.8K
total stars
#484
npgsql/efcore.pg

Entity Framework Core provider for PostgreSQL, enabling .NET developers to easily interact with PostgreSQL databases.

0
0.0%
1.8K
total stars
#485
mkazhdan/PoissonRecon

Poisson Surface Reconstruction is a C++ library for reconstructing surfaces from point cloud data.

0
0.0%
1.8K
total stars
#486
jstat/jstat

A JavaScript statistical library that provides a wide range of statistical functions for data analysis.

0
0.0%
1.8K
total stars
#487
Cysharp/MasterMemory

A C# in-memory document database with source generator-based embedded typed readonly data.

0
0.0%
1.8K
total stars
#488
zalando/spilo

Highly available PostgreSQL cluster using Docker, focused on data infrastructure for developers.

0
0.0%
1.8K
total stars
#489
apachecn/python_data_analysis_and_mining_action

This Python repository contains code examples and notes for data analysis and mining.

0
0.0%
1.8K
total stars
#490
xflr6/graphviz

Simple Python interface for Graphviz, a popular open-source data visualization tool.

0
0.0%
1.8K
total stars
#491
gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

0
0.0%
1.8K
total stars
#492
RoaringBitmap/CRoaring

Optimized Roaring bitmaps in C and C++ with SIMD (AVX2, AVX-512, NEON) for high-performance data processing.

0
0.0%
1.8K
total stars
#493
citusdata/cstore_fdw

A columnar storage extension for Postgres built as a foreign data wrapper.

0
0.0%
1.8K
total stars
#494
schemacrawler/SchemaCrawler

SchemaCrawler is a free database schema discovery and comprehension tool that supports various database management systems.

0
0.0%
1.8K
total stars
#495
DQinYuan/chinese_province_city_area_mapper

A Python module for extracting and mapping Chinese province, city, and district data.

0
0.0%
1.8K
total stars
#496
tidyverse/tidyverse

A collection of R packages for data science, including tools for data manipulation, visualization, and modeling.

0
0.0%
1.8K
total stars
#497
thbar/kiba

A data processing and ETL (Extract, Transform, Load) framework for Ruby developers.

0
0.0%
1.8K
total stars
#498
geekinglcq/CDCS

A collection of solutions to Chinese data competitions, primarily using Python.

0
0.0%
1.8K
total stars
#499
variety/variety

A MongoDB schema analysis tool that helps developers understand and optimize their NoSQL database.

0
0.0%
1.8K
total stars
#500
crossfilter/crossfilter

Fast n-dimensional filtering and grouping of records, a powerful data manipulation library for JavaScript.

0
0.0%
1.8K
total stars
1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.