Trending Projects

Discover the fastest growing open source projects

Showing 701-750 of 897 trending projects

#701

TeoMeWhy/teomerefs

A comprehensive guide to technical references for data careers, including Python, machine learning, and data science.

0.0%

1.3K

total stars

#702

CliMA/Oceananigans.jl

A fast, flexible, ocean-flavored fluid dynamics library for climate and ocean modeling on CPUs and GPUs.

0.0%

1.3K

total stars

Julia

#703

wesm/msgvault

Archive, search, and analyze your entire email/chat history offline with DuckDB-powered analytics and AI queries.

0.0%

1.3K

total stars

#704

iskandr/fancyimpute

A Python library providing multivariate imputation and matrix completion algorithms.

0.0%

1.3K

total stars

Python

#705

mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark for big data analytics and data processing.

0.0%

1.3K

total stars

Jupyter Notebook

#706

microsoft/Trill

Trill is a single-node query processor for temporal or streaming data.

0.0%

1.3K

total stars

#707

apache/impala

Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.

0.0%

1.3K

total stars

C++

#708

YelpArchive/dataset-examples

Sample datasets for users of the Yelp Academic Dataset, useful for data analysis and machine learning.

0.0%

1.3K

total stars

Python

#709

objectbox/objectbox-go

Embedded Go Database, a fast open-source NoSQL database solution for Go projects.

0.0%

1.3K

total stars

#710

elixir-explorer/explorer

A fast and elegant data exploration library for Elixir, providing series and dataframes for data science workflows.

0.0%

1.3K

total stars

Elixir

#711

ycjuan/kaggle-2014-criteo

This is a C++ repository for a Kaggle competition in 2014, not a developer discovery platform.

0.0%

1.3K

total stars

C++

#712

percona/percona-server

Percona Server is an enhanced, open-source version of the MySQL database management system.

0.0%

1.3K

total stars

C++

#713

JetBrains/xodus

Xodus is a transactional, schema-less embedded database used by JetBrains products like YouTrack and Hub.

0.0%

1.3K

total stars

Java

#714

uwdata/mosaic

An extensible framework for linking databases and interactive views, focused on scalability and visualization.

0.0%

1.3K

total stars

TypeScript

#715

rsvp/fecon235

Notebooks for financial economics, including analyses of Federal Reserve, GDP, inflation, and more.

0.0%

1.3K

total stars

Jupyter Notebook

#716

submato/xhscrawl

A web scraping tool for collecting data from Xiaohongshu, Bilibili, and other Chinese social platforms.

0.0%

1.3K

total stars

#717

meta-pytorch/data

A PyTorch library for data loading and utility functions shared across PyTorch domain libraries.

0.0%

1.2K

total stars

Python

#718

scijs/ndarray

A JavaScript library for working with multidimensional arrays, useful for data visualization and scientific computing.

0.0%

1.2K

total stars

JavaScript

#719

jbmusso/awesome-graph

A curated list of resources for graph databases and graph computing tools, useful for developers working with graph-based data.

0.0%

1.2K

total stars

#720

duckdb/dbt-duckdb

A dbt adapter for the DuckDB database, enabling developers to build data pipelines and models with dbt.

0.0%

1.2K

total stars

Python

#721

nakabonne/tstorage

An embedded time-series database written in Go for storing and querying metrics data.

0.0%

1.2K

total stars

#722

JoinQuant/jqdatasdk

A Python package for easy access to financial market data in China for quantitative finance and FinTech applications.

0.0%

1.2K

total stars

Python

#723

matplotlib/AnatomyOfMatplotlib

Anatomy of Matplotlib tutorial for SciPy conference, focused on data visualization for scientific computing.

0.0%

1.2K

total stars

Jupyter Notebook

#724

manami-project/anime-offline-database

This repository provides a comprehensive JSON dataset containing metadata on anime series, movies, and cross-references to various anime sites.

0.0%

1.2K

total stars

Makefile

#725

BlakeRMills/MetBrewer

A color palette package in R inspired by works at the Metropolitan Museum of Art in New York.

0.0%

1.2K

total stars

#726

cmu-db/ottertune

An automatic DBMS configuration tool for optimizing database performance.

0.0%

1.2K

total stars

Python

#727

Toblerity/Fiona

Fiona is a Python library for reading and writing geographic data files, with support for CLI usage.

0.0%

1.2K

total stars

Python

#728

s3ql/s3ql

A full-featured file system for online data storage, built with Python.

0.0%

1.2K

total stars

Python

#729

uber-archive/AthenaX

A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.

0.0%

1.2K

total stars

Java

#730

lit26/finvizfinance

A Python library for financial analysis and data scraping from the Finviz platform.

0.0%

1.2K

total stars

Jupyter Notebook

#731

zhihu/kids

A C++ library for processing data streams, potentially useful for vibe coders working with AI-powered tools.

0.0%

1.2K

total stars

C++

#732

sajal2692/data-science-portfolio

A portfolio of data science projects covering machine learning, NLP, and more for personal and academic use.

0.0%

1.2K

total stars

Jupyter Notebook

#733

marcboeker/gmail-to-sqlite

Index your Gmail account to a SQLite DB and perform custom data analysis on your email.

0.0%

1.2K

total stars

Python

#734

pomber/covid19

A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.

0.0%

1.2K

total stars

JavaScript

#735

wannesm/dtaidistance

A fast C-based implementation of Dynamic Time Warping, a popular algorithm for comparing time series data.

0.0%

1.2K

total stars

Python

#736

yhat/db.py

db.py is a Python library that provides an easier way to interact with your databases.

0.0%

1.2K

total stars

Python

#737

datacrypt-project/hitchhiker-tree

A high-performance, persistent, off-heap data structure written in Clojure for data-intensive applications.

0.0%

1.2K

total stars

Clojure

#738

li6185377/LKDBHelper-SQLite-ORM

An automatic database ORM library for Objective-C that provides thread-safe and deadlock-free database operations.

0.0%

1.2K

total stars

Objective-C

#739

kevwan/go-stash

A high-performance, open-source data processing pipeline for ingesting Kafka data and sending it to Elasticsearch.

0.0%

1.2K

total stars

#740

andrewgbruce/statistics-for-data-scientists

This repository provides code and data for a book on statistics for data scientists.

0.0%

1.2K

total stars

#741

citusdata/postgresql-hll

A PostgreSQL extension that adds HyperLogLog data structures as a native data type.

0.0%

1.2K

total stars

#742

kelvins/municipios-brasileiros

A Python library with data related to Brazilian municipalities, including IBGE codes, latitude, longitude, and more.

0.0%

1.2K

total stars

Python

#743

calogica/dbt-expectations

A port of Great Expectations to dbt test macros for data testing and validation in data engineering workflows.

0.0%

1.2K

total stars

Shell

#744

TablePlus/DBngin

DBngin is a free, open-source, cross-platform database management tool for developers.

0.0%

1.2K

total stars

#745

lakekeeper/lakekeeper

Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.

0.0%

1.2K

total stars

Rust

#746

ResidentMario/geoplot

A high-level geospatial data visualization library for Python developers working with spatial data.

0.0%

1.2K

total stars

Python

#747

2ndQuadrant/pglogical

A high-performance logical replication extension for PostgreSQL that enables fast, cross-version database replication.

0.0%

1.2K

total stars

#748

juliasilge/tidytext

A library for text mining and natural language processing using tidy data principles in R.

0.0%

1.2K

total stars

#749

influxdata/influxdb-java

Java client library for connecting to the InfluxDB time series database.

0.0%

1.2K

total stars

Java

#750

egbertbouman/youtube-comment-downloader

Simple script for downloading YouTube comments without using the YouTube API.

0.0%

1.2K

total stars

Python

1...1416...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.