Trending Projects

Discover the fastest growing open source projects

Showing 701-750 of 897 trending projects

#701
TeoMeWhy/teomerefs

A comprehensive guide to technical references for data careers, including Python, machine learning, and data science.

0
0.0%
1.3K
total stars
#702
CliMA/Oceananigans.jl

A fast, flexible, ocean-flavored fluid dynamics library for climate and ocean modeling on CPUs and GPUs.

0
0.0%
1.3K
total stars
#703
wesm/msgvault

Archive, search, and analyze your entire email/chat history offline with DuckDB-powered analytics and AI queries.

0
0.0%
1.3K
total stars
#704
iskandr/fancyimpute

A Python library providing multivariate imputation and matrix completion algorithms.

0
0.0%
1.3K
total stars
#705
mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark for big data analytics and data processing.

0
0.0%
1.3K
total stars
#706
microsoft/Trill

Trill is a single-node query processor for temporal or streaming data.

0
0.0%
1.3K
total stars
#707
apache/impala

Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.

0
0.0%
1.3K
total stars
#708
YelpArchive/dataset-examples

Sample datasets for users of the Yelp Academic Dataset, useful for data analysis and machine learning.

0
0.0%
1.3K
total stars
#709
objectbox/objectbox-go

Embedded Go Database, a fast open-source NoSQL database solution for Go projects.

0
0.0%
1.3K
total stars
#710
elixir-explorer/explorer

A fast and elegant data exploration library for Elixir, providing series and dataframes for data science workflows.

0
0.0%
1.3K
total stars
#711
ycjuan/kaggle-2014-criteo

This is a C++ repository for a Kaggle competition in 2014, not a developer discovery platform.

0
0.0%
1.3K
total stars
#712
percona/percona-server

Percona Server is an enhanced, open-source version of the MySQL database management system.

0
0.0%
1.3K
total stars
#713
JetBrains/xodus

Xodus is a transactional, schema-less embedded database used by JetBrains products like YouTrack and Hub.

0
0.0%
1.3K
total stars
#714
uwdata/mosaic

An extensible framework for linking databases and interactive views, focused on scalability and visualization.

0
0.0%
1.3K
total stars
#715
rsvp/fecon235

Notebooks for financial economics, including analyses of Federal Reserve, GDP, inflation, and more.

0
0.0%
1.3K
total stars
#716
submato/xhscrawl

A web scraping tool for collecting data from Xiaohongshu, Bilibili, and other Chinese social platforms.

0
0.0%
1.3K
total stars
#717
meta-pytorch/data

A PyTorch library for data loading and utility functions shared across PyTorch domain libraries.

0
0.0%
1.2K
total stars
#718
scijs/ndarray

A JavaScript library for working with multidimensional arrays, useful for data visualization and scientific computing.

0
0.0%
1.2K
total stars
#719
jbmusso/awesome-graph

A curated list of resources for graph databases and graph computing tools, useful for developers working with graph-based data.

0
0.0%
1.2K
total stars
#720
duckdb/dbt-duckdb

A dbt adapter for the DuckDB database, enabling developers to build data pipelines and models with dbt.

0
0.0%
1.2K
total stars
#721
nakabonne/tstorage

An embedded time-series database written in Go for storing and querying metrics data.

0
0.0%
1.2K
total stars
#722
JoinQuant/jqdatasdk

A Python package for easy access to financial market data in China for quantitative finance and FinTech applications.

0
0.0%
1.2K
total stars
#723
matplotlib/AnatomyOfMatplotlib

Anatomy of Matplotlib tutorial for SciPy conference, focused on data visualization for scientific computing.

0
0.0%
1.2K
total stars
#724
manami-project/anime-offline-database

This repository provides a comprehensive JSON dataset containing metadata on anime series, movies, and cross-references to various anime sites.

0
0.0%
1.2K
total stars
#725
BlakeRMills/MetBrewer

A color palette package in R inspired by works at the Metropolitan Museum of Art in New York.

0
0.0%
1.2K
total stars
#726
cmu-db/ottertune

An automatic DBMS configuration tool for optimizing database performance.

0
0.0%
1.2K
total stars
#727
Toblerity/Fiona

Fiona is a Python library for reading and writing geographic data files, with support for CLI usage.

0
0.0%
1.2K
total stars
#728
s3ql/s3ql

A full-featured file system for online data storage, built with Python.

0
0.0%
1.2K
total stars
#729
uber-archive/AthenaX

A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.

0
0.0%
1.2K
total stars
#730
lit26/finvizfinance

A Python library for financial analysis and data scraping from the Finviz platform.

0
0.0%
1.2K
total stars
#731
zhihu/kids

A C++ library for processing data streams, potentially useful for vibe coders working with AI-powered tools.

0
0.0%
1.2K
total stars
#732
sajal2692/data-science-portfolio

A portfolio of data science projects covering machine learning, NLP, and more for personal and academic use.

0
0.0%
1.2K
total stars
#733
marcboeker/gmail-to-sqlite

Index your Gmail account to a SQLite DB and perform custom data analysis on your email.

0
0.0%
1.2K
total stars
#734
pomber/covid19

A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.

0
0.0%
1.2K
total stars
#735
wannesm/dtaidistance

A fast C-based implementation of Dynamic Time Warping, a popular algorithm for comparing time series data.

0
0.0%
1.2K
total stars
#736
yhat/db.py

db.py is a Python library that provides an easier way to interact with your databases.

0
0.0%
1.2K
total stars
#737
datacrypt-project/hitchhiker-tree

A high-performance, persistent, off-heap data structure written in Clojure for data-intensive applications.

0
0.0%
1.2K
total stars
#738
li6185377/LKDBHelper-SQLite-ORM

An automatic database ORM library for Objective-C that provides thread-safe and deadlock-free database operations.

0
0.0%
1.2K
total stars
#739
kevwan/go-stash

A high-performance, open-source data processing pipeline for ingesting Kafka data and sending it to Elasticsearch.

0
0.0%
1.2K
total stars
#740
andrewgbruce/statistics-for-data-scientists

This repository provides code and data for a book on statistics for data scientists.

0
0.0%
1.2K
total stars
#741
citusdata/postgresql-hll

A PostgreSQL extension that adds HyperLogLog data structures as a native data type.

0
0.0%
1.2K
total stars
#742
kelvins/municipios-brasileiros

A Python library with data related to Brazilian municipalities, including IBGE codes, latitude, longitude, and more.

0
0.0%
1.2K
total stars
#743
calogica/dbt-expectations

A port of Great Expectations to dbt test macros for data testing and validation in data engineering workflows.

0
0.0%
1.2K
total stars
#744
TablePlus/DBngin

DBngin is a free, open-source, cross-platform database management tool for developers.

0
0.0%
1.2K
total stars
#745
lakekeeper/lakekeeper

Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.

0
0.0%
1.2K
total stars
#746
ResidentMario/geoplot

A high-level geospatial data visualization library for Python developers working with spatial data.

0
0.0%
1.2K
total stars
#747
2ndQuadrant/pglogical

A high-performance logical replication extension for PostgreSQL that enables fast, cross-version database replication.

0
0.0%
1.2K
total stars
#748
juliasilge/tidytext

A library for text mining and natural language processing using tidy data principles in R.

0
0.0%
1.2K
total stars
#749
influxdata/influxdb-java

Java client library for connecting to the InfluxDB time series database.

0
0.0%
1.2K
total stars
#750
egbertbouman/youtube-comment-downloader

Simple script for downloading YouTube comments without using the YouTube API.

0
0.0%
1.2K
total stars
1...1416...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.