Trending Projects

Discover the fastest growing open source projects

Showing 551-600 of 897 trending projects

#551
realm/realm-java

Realm is a mobile database that serves as a replacement for SQLite and ORMs.

+4
+0.0%
11.5K
total stars
#552
doctrine/dbal

A PHP database abstraction layer that provides a simple, consistent API for interacting with different database systems.

+4
+0.0%
9.7K
total stars
#553
datawhalechina/competition-baseline

A collection of code examples and baselines for common data science and machine learning competitions.

+4
+0.1%
4.7K
total stars
#554
gopherdata/gophernotes

The Go kernel for Jupyter notebooks and nteract, enabling data science and numerical computing in Go.

+4
+0.1%
4.0K
total stars
#555
dtinit/data-transfer-project

The Data Transfer Project enables direct transfer of user data between online service providers.

+4
+0.1%
3.6K
total stars
#556
databricks/koalas

Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

+4
+0.1%
3.4K
total stars
#557
facebook/mysql-5.6

This is Facebook's branch of the Oracle MySQL database, including the MyRocks storage engine.

+4
+0.1%
2.6K
total stars
#558
hardikkamboj/An-Introduction-to-Statistical-Learning

This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.

+4
+0.2%
2.5K
total stars
#559
RJT1990/pyflux

Open source time series library for Python, useful for statistical analysis and modeling.

+4
+0.2%
2.1K
total stars
#560
apache/bookkeeper

Apache BookKeeper is a scalable, fault tolerant and low latency storage service optimized for append-only workloads.

+4
+0.2%
2.0K
total stars
#561
johannfaouzi/pyts

A Python package for time series classification, useful for developers working with time-series data.

+4
+0.2%
1.9K
total stars
#562
schemacrawler/SchemaCrawler

SchemaCrawler is a free database schema discovery and comprehension tool that supports various database management systems.

+4
+0.2%
1.8K
total stars
#563
cnosdb/cnosdb

A high-performance, highly available, and distributed time series database written in Rust.

+4
+0.2%
1.7K
total stars
#564
dotnet/EntityFramework.Docs

Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.

+4
+0.2%
1.7K
total stars
#565
faroit/awesome-python-scientific-audio

Curated list of Python software and packages for scientific research in audio

+4
+0.2%
1.7K
total stars
#566
tylertreat/BoomFilters

Performant probabilistic data structures for processing continuous, unbounded streams in Go.

+4
+0.2%
1.6K
total stars
#567
SciTools/cartopy

Cartopy is a Python library for creating maps and visualizing spatial data with matplotlib support.

+4
+0.3%
1.6K
total stars
#568
uwdata/arquero

A JavaScript library for efficient querying and transformation of array-backed data tables.

+4
+0.3%
1.5K
total stars
#569
Awesome-Image-Registration-Organization/awesome-image-registration

A curated collection of resources related to image registration, including books, papers, videos, and toolboxes.

+4
+0.3%
1.5K
total stars
#570
mono/taglib-sharp

A C# library for reading and writing metadata in media files, useful for audio and video processing applications.

+4
+0.3%
1.4K
total stars
#571
movingpandas/movingpandas

A Python library for analyzing movement trajectory data using GeoPandas.

+4
+0.3%
1.4K
total stars
#572
petl-developers/petl

A Python library for extracting, transforming, and loading tabular data.

+4
+0.3%
1.3K
total stars
#573
marcboeker/gmail-to-sqlite

Index your Gmail account to a SQLite DB and perform custom data analysis on your email.

+4
+0.3%
1.2K
total stars
#574
nfstream/nfstream

NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.

+4
+0.3%
1.2K
total stars
#575
apache/cloudberry

Open-source massively parallel processing (MPP) database, an alternative to Greenplum.

+4
+0.3%
1.2K
total stars
#576
robjhyndman/forecast

A time series forecasting library for R, providing a wide range of models and tools for accurate predictions.

+4
+0.3%
1.2K
total stars
#577
farzaa/gemini-bball

This is a Python library focused on basketball analytics and data processing.

+4
+0.3%
1.2K
total stars
#578
brandon-rhodes/pycon-pandas-tutorial

A tutorial for using the popular Python data analysis library Pandas, presented at PyCon 2015.

+4
+0.4%
1.1K
total stars
#579
intake/intake

Intake is a lightweight Python package for discovering, investigating, loading and distributing data.

+4
+0.4%
1.1K
total stars
#580
hail-is/hail

Cloud-native genomic dataframes and batch computing for bioinformatics and genetics research.

+4
+0.4%
1.1K
total stars
#581
LAStools/LAStools

This repository contains efficient tools for LiDAR processing, focused on working with point cloud data.

+4
+0.4%
1.0K
total stars
#582
taynaud/python-louvain

A Python library for implementing the Louvain community detection algorithm on graphs.

+4
+0.4%
1.0K
total stars
#583
bashtage/linearmodels

This Python library provides additional linear models for statistical modeling and analysis.

+4
+0.4%
1.0K
total stars
#584
airbnb/knowledge-repo

A next-generation curated knowledge sharing platform for data scientists and other technical professionals.

+3
+0.1%
5.5K
total stars
#585
pudo/dataset

Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.

+3
+0.1%
4.9K
total stars
#586
nalepae/pandarallel

A parallel processing library for Pandas that improves performance on multi-core CPUs.

+3
+0.1%
3.8K
total stars
#587
rob-med/awesome-TS-anomaly-detection

A curated list of tools and datasets for anomaly detection on time-series data.

+3
+0.1%
3.2K
total stars
#588
dblalock/bolt

A fast C++ library for high-performance matrix and vector operations.

+3
+0.1%
2.5K
total stars
#589
The-Japan-DataScientist-Society/100knocks-preprocess

A repository for the 100 Knocks of Data Science Preprocessing, focused on structured data processing.

+3
+0.1%
2.5K
total stars
#590
JasonKessler/scattertext

A Python library for creating beautiful visualizations of language differences across document types.

+3
+0.1%
2.3K
total stars
#591
fugue-project/fugue

A unified interface for distributed computing on Spark, Dask and Ray without any rewrites.

+3
+0.1%
2.1K
total stars
#592
yhilpisch/py4fi

This is a Python library for financial applications, not a tool for AI-powered vibe coders.

+3
+0.2%
1.9K
total stars
#593
edyoda/data-science-complete-tutorial

This repository provides comprehensive tutorials and resources for learning data science and machine learning using Python.

+3
+0.2%
1.8K
total stars
#594
Cysharp/MasterMemory

A C# in-memory document database with source generator-based embedded typed readonly data.

+3
+0.2%
1.8K
total stars
#595
DQinYuan/chinese_province_city_area_mapper

A Python module for extracting and mapping Chinese province, city, and district data.

+3
+0.2%
1.8K
total stars
#596
rich-iannone/DiagrammeR

Graph and network visualization library for R developers working with tabular data

+3
+0.2%
1.7K
total stars
#597
Giorgi/EntityFramework.Exceptions

A .NET Standard library that provides strongly typed exceptions for Entity Framework Core across multiple database providers.

+3
+0.2%
1.7K
total stars
#598
getdozer/dozer

Dozer is a real-time data movement tool that leverages CDC to move data between various sources and sinks.

+3
+0.2%
1.6K
total stars
#599
itbdw/ip-database

An offline IP database for developers to look up IP address geolocation information.

+3
+0.2%
1.5K
total stars
#600
pyjanitor-devs/pyjanitor

A Python library for cleaning and transforming data, inspired by the R package Janitor.

+3
+0.2%
1.5K
total stars
1...1113...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.