Trending Projects

Discover the fastest growing open source projects

Showing 751-800 of 897 trending projects

#751
topepo/caret

An R package for training and plotting classification and regression models.

0
0.0%
1.7K
total stars
#752
cswinter/LocustDB

A blazingly fast analytics database built with Rust, optimized for rapidly devouring large amounts of data.

0
0.0%
1.6K
total stars
#753
github/covid19-dashboard

An open-source COVID-19 dashboard powered by the fastpages framework, featuring data visualizations.

0
0.0%
1.6K
total stars
#754
js-data/js-data

A framework-agnostic, datastore-agnostic JavaScript ORM built for ease of use and peace of mind.

0
0.0%
1.6K
total stars
#755
cgarciae/pypeln

Concurrent data pipelines in Python for building efficient and scalable data processing workflows.

0
0.0%
1.6K
total stars
#756
re-data/re-data

A data quality and observability tool for monitoring and fixing data issues before they become problems.

0
0.0%
1.6K
total stars
#757
gobuffalo/pop

A Go ORM and query builder for interacting with databases in Go applications.

0
0.0%
1.5K
total stars
#758
Intel-bigdata/HiBench

HiBench is a big data benchmark suite for evaluating the performance of different big data frameworks.

0
0.0%
1.5K
total stars
#759
skaiworldwide-oss/agensgraph

AgensGraph is a transactional graph database based on PostgreSQL for enterprise-level applications.

0
0.0%
1.5K
total stars
#760
tensorbase/tensorbase

TensorBase is a new big data warehousing solution built with Rust, focused on high-performance analytics.

0
0.0%
1.5K
total stars
#761
QueryKit/QueryKit

QueryKit is a simple CoreData query language for Swift and Objective-C developers.

0
0.0%
1.5K
total stars
#762
apache/carbondata

CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.

0
0.0%
1.4K
total stars
#763
distributedio/titan

A distributed, Redis-compatible NoSQL database that provides high performance and scalability.

0
0.0%
1.4K
total stars
#764
oracle-samples/oracle-db-examples

This repository provides code examples for Oracle's AI-enabled database features and integrations.

0
0.0%
1.4K
total stars
#765
quiltdata/quilt

Quilt is a data mesh for connecting people with actionable data, built with TypeScript.

0
0.0%
1.4K
total stars
#766
slashbase/slashbaseide

Modern database IDE for dev & data workflows, supporting MySQL, PostgreSQL & MongoDB.

0
0.0%
1.3K
total stars
#767
Data-Learn/data-engineering

A comprehensive resource for developers to learn and get started with data engineering using Python.

0
0.0%
1.3K
total stars
#768
wainshine/Company-Names-Corpus

A corpus of company names, abbreviations, and brands that can be used for Chinese text segmentation and entity recognition.

0
0.0%
1.3K
total stars
#769
supermarin/ObjectiveRecord

ActiveRecord-like API for CoreData, a powerful object-relational mapping (ORM) for iOS development.

0
0.0%
1.3K
total stars
#770
objectbox/objectbox-go

Embedded Go Database, a fast open-source NoSQL database solution for Go projects.

0
0.0%
1.3K
total stars
#771
uber-archive/AthenaX

A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.

0
0.0%
1.2K
total stars
#772
zhihu/kids

A C++ library for processing data streams, potentially useful for vibe coders working with AI-powered tools.

0
0.0%
1.2K
total stars
#773
yhat/db.py

db.py is a Python library that provides an easier way to interact with your databases.

0
0.0%
1.2K
total stars
#774
li6185377/LKDBHelper-SQLite-ORM

An automatic database ORM library for Objective-C that provides thread-safe and deadlock-free database operations.

0
0.0%
1.2K
total stars
#775
influxdata/influxdb-java

Java client library for connecting to the InfluxDB time series database.

0
0.0%
1.2K
total stars
#776
wireservice/agate

A Python data analysis library optimized for humans instead of machines.

0
0.0%
1.2K
total stars
#777
machow/siuba

Python library for using dplyr-like syntax with pandas and SQL databases

0
0.0%
1.2K
total stars
#778
eventql/eventql

Distributed, massively parallel SQL query engine for big data analytics and timeseries workloads.

0
0.0%
1.2K
total stars
#779
xiaoxu193/PyTeaser

A Python library that summarizes news articles by extracting the most important sentences.

0
0.0%
1.2K
total stars
#780
datasets/covid-19

This GitHub repository provides time series data on COVID-19 cases, useful for data analysis and visualization.

0
0.0%
1.2K
total stars
#781
petewarden/dstk

A collection of open data sets and tools for data science and machine learning tasks.

0
0.0%
1.1K
total stars
#782
scratchdata/scratchdata

A Swiss army knife for big data, enabling seamless integration with popular data warehousing solutions.

0
0.0%
1.1K
total stars
#783
qri-io/qri

An open-source platform for building and sharing datasets, focused on trust, privacy, and decentralization.

0
0.0%
1.1K
total stars
#784
moby/datakit

Connect processes into powerful data pipelines with a simple git-like filesystem interface

0
0.0%
1.1K
total stars
#785
traildb/traildb

TrailDB is an efficient database for storing and querying series of events.

0
0.0%
1.1K
total stars
#786
gaarason/database-all

Eloquent ORM for Java 8, 11, 17, 21, 23 and Spring boot 2.x, 3.x

0
0.0%
1.1K
total stars
#787
mahmoudparsian/data-algorithms-book

This repository provides a comprehensive guide and implementations for data algorithms using MapReduce, Spark, Java, and Scala.

0
0.0%
1.1K
total stars
#788
oetiker/rrdtool-1.x

RRDtool is a time-series database system for efficiently storing and graphing data.

0
0.0%
1.1K
total stars
#789
liucongg/NLPDataSet

A repository containing various NLP datasets collected and organized by the owner.

0
0.0%
1.1K
total stars
#790
realm/realm-core

Core database component for the Realm Mobile Database SDKs, a popular NoSQL database for mobile apps.

0
0.0%
1.0K
total stars
#791
rgeo/rgeo

A geospatial data library for Ruby that provides a set of tools for working with geographic data.

0
0.0%
1.0K
total stars
#792
pixiedust/pixiedust

A Python helper library for enhancing Jupyter Notebooks with data visualization and analysis capabilities.

0
0.0%
1.0K
total stars
#793
facebookresearch/cc_net

Tools to download and cleanup Common Crawl data, a large web crawl dataset, for further analysis and processing.

0
0.0%
1.0K
total stars
#794
lacuna/bifurcan

A library of functional, durable data structures written in Java for developers building robust applications.

0
0.0%
1.0K
total stars
#795
sentinelsat/sentinelsat

A Python library for searching and downloading Copernicus Sentinel satellite images for geographic data analysis.

0
0.0%
1.0K
total stars
#796
efficient/cuckoofilter

A space-efficient C++ implementation of the Cuckoo filter, a probabilistic data structure for set membership testing.

0
0.0%
1.0K
total stars
#797
SciRuby/sciruby

SciRuby provides a collection of tools for scientific computation in Ruby, catering to developers working with data and scientific applications.

0
0.0%
1.0K
total stars
#798
rethinkdb/rethinkdb

Realtime NoSQL database for web apps

-1
0.0%
27.0K
total stars
#799
pachyderm/pachyderm

Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.

-1
-0.0%
6.3K
total stars
#800
lux-org/lux

Automatically visualize your pandas dataframes with a single print command, enabling quick EDA.

-1
-0.0%
5.4K
total stars
1...151718

Stay in the loop

Get weekly updates on trending AI coding tools and projects.