Trending Projects

Discover the fastest growing open source projects

Showing 701-750 of 897 trending projects

#701
mymarilyn/clickhouse-driver

A Python driver for the ClickHouse database with native interface support.

0
0.0%
1.3K
total stars
#702
LuxCoreRender/LuxCore

LuxCore is a high-performance path-tracing render engine for realistic 3D graphics and visualization.

0
0.0%
1.3K
total stars
#703
eleanorlutz/asteroids_atlas_of_space

This is an astronomy visualization project that maps orbits of asteroids in the solar system.

0
0.0%
1.3K
total stars
#704
orlp/slotmap

A Rust data structure for efficiently storing and accessing data in a sparse set.

0
0.0%
1.3K
total stars
#705
supermarin/ObjectiveRecord

ActiveRecord-like API for CoreData, a powerful object-relational mapping (ORM) for iOS development.

0
0.0%
1.3K
total stars
#706
ifsnop/mysqldump-php

A PHP library that provides a MySQL backup functionality, similar to the mysqldump CLI tool.

0
0.0%
1.3K
total stars
#707
XTXMarkets/ternfs

An exabyte-scale, multi-region distributed file system for developers building AI-powered applications.

0
0.0%
1.3K
total stars
#708
mining/mining

A Python library for building business intelligence (BI) and OLAP solutions.

0
0.0%
1.3K
total stars
#709
pyexcel/pyexcel

A Python library for reading, manipulating, and writing data in various spreadsheet file formats.

0
0.0%
1.3K
total stars
#710
datavane/tis

A Java-based framework for building agile DataOps pipelines using tools like Flink, DataX, and Chunjun with a web UI.

0
0.0%
1.3K
total stars
#711
GoogleCloudPlatform/bigquery-utils

Useful scripts, UDFs, views, and other utilities for migration and data warehouse operations in BigQuery.

0
0.0%
1.3K
total stars
#712
TeoMeWhy/teomerefs

A comprehensive guide to technical references for data careers, including Python, machine learning, and data science.

0
0.0%
1.3K
total stars
#713
mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark for big data analytics and data processing.

0
0.0%
1.3K
total stars
#714
microsoft/Trill

Trill is a single-node query processor for temporal or streaming data.

0
0.0%
1.3K
total stars
#715
apache/impala

Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.

0
0.0%
1.3K
total stars
#716
YelpArchive/dataset-examples

Sample datasets for users of the Yelp Academic Dataset, useful for data analysis and machine learning.

0
0.0%
1.3K
total stars
#717
ycjuan/kaggle-2014-criteo

This is a C++ repository for a Kaggle competition in 2014, not a developer discovery platform.

0
0.0%
1.3K
total stars
#718
objectbox/objectbox-go

Embedded Go Database, a fast open-source NoSQL database solution for Go projects.

0
0.0%
1.3K
total stars
#719
percona/percona-server

Percona Server is an enhanced, open-source version of the MySQL database management system.

0
0.0%
1.3K
total stars
#720
JetBrains/xodus

Xodus is a transactional, schema-less embedded database used by JetBrains products like YouTrack and Hub.

0
0.0%
1.3K
total stars
#721
scijs/ndarray

A JavaScript library for working with multidimensional arrays, useful for data visualization and scientific computing.

0
0.0%
1.2K
total stars
#722
jbmusso/awesome-graph

A curated list of resources for graph databases and graph computing tools, useful for developers working with graph-based data.

0
0.0%
1.2K
total stars
#723
nakabonne/tstorage

An embedded time-series database written in Go for storing and querying metrics data.

0
0.0%
1.2K
total stars
#724
matplotlib/AnatomyOfMatplotlib

Anatomy of Matplotlib tutorial for SciPy conference, focused on data visualization for scientific computing.

0
0.0%
1.2K
total stars
#725
BlakeRMills/MetBrewer

A color palette package in R inspired by works at the Metropolitan Museum of Art in New York.

0
0.0%
1.2K
total stars
#726
manami-project/anime-offline-database

This repository provides a comprehensive JSON dataset containing metadata on anime series, movies, and cross-references to various anime sites.

0
0.0%
1.2K
total stars
#727
cmu-db/ottertune

An automatic DBMS configuration tool for optimizing database performance.

0
0.0%
1.2K
total stars
#728
Toblerity/Fiona

Fiona is a Python library for reading and writing geographic data files, with support for CLI usage.

0
0.0%
1.2K
total stars
#729
uber-archive/AthenaX

A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.

0
0.0%
1.2K
total stars
#730
zhihu/kids

A C++ library for processing data streams, potentially useful for vibe coders working with AI-powered tools.

0
0.0%
1.2K
total stars
#731
sajal2692/data-science-portfolio

A portfolio of data science projects covering machine learning, NLP, and more for personal and academic use.

0
0.0%
1.2K
total stars
#732
marcboeker/gmail-to-sqlite

Index your Gmail account to a SQLite DB and perform custom data analysis on your email.

0
0.0%
1.2K
total stars
#733
pomber/covid19

A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.

0
0.0%
1.2K
total stars
#734
wannesm/dtaidistance

A fast C-based implementation of Dynamic Time Warping, a popular algorithm for comparing time series data.

0
0.0%
1.2K
total stars
#735
yhat/db.py

db.py is a Python library that provides an easier way to interact with your databases.

0
0.0%
1.2K
total stars
#736
datacrypt-project/hitchhiker-tree

A high-performance, persistent, off-heap data structure written in Clojure for data-intensive applications.

0
0.0%
1.2K
total stars
#737
li6185377/LKDBHelper-SQLite-ORM

An automatic database ORM library for Objective-C that provides thread-safe and deadlock-free database operations.

0
0.0%
1.2K
total stars
#738
kevwan/go-stash

A high-performance, open-source data processing pipeline for ingesting Kafka data and sending it to Elasticsearch.

0
0.0%
1.2K
total stars
#739
citusdata/postgresql-hll

A PostgreSQL extension that adds HyperLogLog data structures as a native data type.

0
0.0%
1.2K
total stars
#740
calogica/dbt-expectations

A port of Great Expectations to dbt test macros for data testing and validation in data engineering workflows.

0
0.0%
1.2K
total stars
#741
ResidentMario/geoplot

A high-level geospatial data visualization library for Python developers working with spatial data.

0
0.0%
1.2K
total stars
#742
2ndQuadrant/pglogical

A high-performance logical replication extension for PostgreSQL that enables fast, cross-version database replication.

0
0.0%
1.2K
total stars
#743
juliasilge/tidytext

A library for text mining and natural language processing using tidy data principles in R.

0
0.0%
1.2K
total stars
#744
influxdata/influxdb-java

Java client library for connecting to the InfluxDB time series database.

0
0.0%
1.2K
total stars
#745
egbertbouman/youtube-comment-downloader

Simple script for downloading YouTube comments without using the YouTube API.

0
0.0%
1.2K
total stars
#746
tidwall/btree

A high-performance B-tree implementation for Go, useful for building database-like applications.

0
0.0%
1.2K
total stars
#747
nfstream/nfstream

NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.

0
0.0%
1.2K
total stars
#748
sryza/spark-timeseries

A library for time series analysis on Apache Spark, enabling efficient large-scale time series processing.

0
0.0%
1.2K
total stars
#749
wireservice/agate

A Python data analysis library optimized for humans instead of machines.

0
0.0%
1.2K
total stars
#750
marsupialtail/quokka

A scalable, distributed ETL framework for building data lake analytics pipelines.

0
0.0%
1.2K
total stars
1...1416...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.