Trending Projects

Discover the fastest growing open source projects

Showing 751-800 of 897 trending projects

#751
tidwall/btree

A high-performance B-tree implementation for Go, useful for building database-like applications.

0
0.0%
1.2K
total stars
#752
nfstream/nfstream

NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.

0
0.0%
1.2K
total stars
#753
sryza/spark-timeseries

A library for time series analysis on Apache Spark, enabling efficient large-scale time series processing.

0
0.0%
1.2K
total stars
#754
wireservice/agate

A Python data analysis library optimized for humans instead of machines.

0
0.0%
1.2K
total stars
#755
marsupialtail/quokka

A scalable, distributed ETL framework for building data lake analytics pipelines.

0
0.0%
1.2K
total stars
#756
apache/cloudberry

Open-source massively parallel processing (MPP) database, an alternative to Greenplum.

0
0.0%
1.2K
total stars
#757
JuliaStats/Distributions.jl

A comprehensive Julia library for probability distributions and related statistical functions.

0
0.0%
1.2K
total stars
#758
DaveSkender/Stock.Indicators

A C# NuGet package that provides technical indicators and trading insights for financial market data analysis.

0
0.0%
1.2K
total stars
#759
bububa/MongoHub-Mac

MongoHub is a native macOS MongoDB client that provides a GUI for managing and interacting with MongoDB databases.

0
0.0%
1.2K
total stars
#760
apachecn/spark-doc-zh

This repository provides the official Apache Spark documentation in Chinese, a popular big data processing framework.

0
0.0%
1.2K
total stars
#761
machow/siuba

Python library for using dplyr-like syntax with pandas and SQL databases

0
0.0%
1.2K
total stars
#762
eventql/eventql

Distributed, massively parallel SQL query engine for big data analytics and timeseries workloads.

0
0.0%
1.2K
total stars
#763
PoloDB/PoloDB

PoloDB is an embedded document database written in Rust for building cross-platform, local-first applications.

0
0.0%
1.2K
total stars
#764
ddsjoberg/gtsummary

An R package that provides customizable and presentation-ready data summary and analytic result tables.

0
0.0%
1.2K
total stars
#765
apache/ozone

Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.

0
0.0%
1.2K
total stars
#766
opengeos/Awesome-GEE

A curated list of Google Earth Engine resources for geospatial analysis and remote sensing applications.

0
0.0%
1.2K
total stars
#767
YuLab-SMU/clusterProfiler

A comprehensive enrichment analysis tool for interpreting omics data, with support for GO, KEGG, and more.

0
0.0%
1.2K
total stars
#768
cvxgrp/cvxportfolio

A Python library for portfolio optimization and back-testing in finance.

0
0.0%
1.2K
total stars
#769
8080labs/ppscore

A Python library that provides a Predictive Power Score (PPS) to measure the predictive power between variables.

0
0.0%
1.2K
total stars
#770
xiaoxu193/PyTeaser

A Python library that summarizes news articles by extracting the most important sentences.

0
0.0%
1.2K
total stars
#771
pytroll/satpy

A Python package for processing earth-observing satellite data with support for common data formats and tools.

0
0.0%
1.2K
total stars
#772
RUCAIBox/RecSysDatasets

A repository of public data sources for building and testing recommender systems.

0
0.0%
1.2K
total stars
#773
scikit-bio/scikit-bio

A versatile Python library for bioinformatics, providing data structures, algorithms, and educational resources.

0
0.0%
1.2K
total stars
#774
pydata/bottleneck

A fast, efficient C extension for NumPy that provides optimized array functions.

0
0.0%
1.2K
total stars
#775
RxSwiftCommunity/RxRealm

A Swift extension for RealmSwift that provides reactive programming support using RxSwift.

0
0.0%
1.2K
total stars
#776
spatie/db-dumper

A PHP library for dumping the contents of a database to a file, supporting multiple database engines.

0
0.0%
1.2K
total stars
#777
apache/incubator-xtable

Apache XTable is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

0
0.0%
1.2K
total stars
#778
pentaho/mondrian

Mondrian is an OLAP server that enables real-time analysis of large data sets for business users.

0
0.0%
1.2K
total stars
#779
robjhyndman/forecast

A time series forecasting library for R, providing a wide range of models and tools for accurate predictions.

0
0.0%
1.2K
total stars
#780
ChawlaAvi/Daily-Dose-of-Data-Science

A collection of code snippets and tutorials for data science and data analysis in Python.

0
0.0%
1.2K
total stars
#781
zinggAI/zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

0
0.0%
1.2K
total stars
#782
datasets/covid-19

This GitHub repository provides time series data on COVID-19 cases, useful for data analysis and visualization.

0
0.0%
1.2K
total stars
#783
farzaa/gemini-bball

This is a Python library focused on basketball analytics and data processing.

0
0.0%
1.2K
total stars
#784
thinh-vu/vnstock

A beginner-friendly Python toolkit for financial data extraction, analysis, and automation.

0
0.0%
1.2K
total stars
#785
golang/leveldb

The LevelDB key-value database in the Go programming language.

0
0.0%
1.2K
total stars
#786
MarcosMeli/FileHelpers

A free and easy-to-use .NET library for reading and writing CSV and fixed-length data files.

0
0.0%
1.2K
total stars
#787
easystats/easystats

An R project focused on providing high-performance statistical models, data analysis, and visualization tools.

0
0.0%
1.1K
total stars
#788
GeospatialPython/pyshp

A pure Python library for reading and writing ESRI Shapefiles, a popular geospatial data format.

0
0.0%
1.1K
total stars
#789
samayo/country-json

A simple JSON data set of country information, useful for building apps that need country data.

0
0.0%
1.1K
total stars
#790
petewarden/dstk

A collection of open data sets and tools for data science and machine learning tasks.

0
0.0%
1.1K
total stars
#791
abhishek-ch/around-dataengineering

A comprehensive knowledge hub for data engineering, machine learning, and MLOps tools and practices.

0
0.0%
1.1K
total stars
#792
graphframes/graphframes

GraphFrames provides DataFrame-based Graphs for Apache Spark, enabling scalable graph analysis and algorithms.

0
0.0%
1.1K
total stars
#793
apache/accumulo

Apache Accumulo is a scalable and robust key-value store that provides a sparse, sorted, distributed, and persistent multi-dimensional table.

0
0.0%
1.1K
total stars
#794
lvgalvao/data-engineering-roadmap

Comprehensive roadmap for data engineering and AI development in Python

0
0.0%
1.1K
total stars
#795
rordenlab/dcm2niix

A DICOM to NIfTI converter for medical imaging research and neuroimaging applications.

0
0.0%
1.1K
total stars
#796
ucarGroup/DataLink

DataLink is a real-time and offline data exchange platform that supports synchronization between heterogeneous data sources.

0
0.0%
1.1K
total stars
#797
neumino/thinky

An ORM for RethinkDB that provides an elegant and intuitive API for interacting with the database.

0
0.0%
1.1K
total stars
#798
alecthw/mmdb_china_ip_list

A library for generating MaxMind GeoIP2 databases for China IP addresses.

0
0.0%
1.1K
total stars
#799
tdpetrou/Learn-Pandas

This GitHub repository provides tutorials on effectively using the Pandas library for data analysis.

0
0.0%
1.1K
total stars
#800
scratchdata/scratchdata

A Swiss army knife for big data, enabling seamless integration with popular data warehousing solutions.

0
0.0%
1.1K
total stars
1...151718

Stay in the loop

Get weekly updates on trending AI coding tools and projects.