Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 1961-1980 of 3,485 projects

Cysharp/MasterMemory

A C# in-memory document database with source generator-based embedded typed readonly data.

1.8K
Stable
C#
Databases
CLI Tools
Unity
#memory-database#in-memory#source-generator

zalando/spilo

Highly available PostgreSQL cluster using Docker, focused on data infrastructure for developers.

1.8K
Active
Python
Databases
Containerization
#postgresql#high-availability#docker

apachecn/python_data_analysis_and_mining_action

This Python repository contains code examples and notes for data analysis and mining.

1.8K
Archived
Python
Data Analysis
Tutorials & Courses
#data-analysis#data-science#python3

nltk/nltk_data

NLTK Data is a collection of datasets, models, and other resources for natural language processing in Python.

1.8K
Active
Python
Natural Language Processing
Python
#nlp#linguistics#corpora

camelot-dev/excalibur

A Python library for extracting tabular data from PDF documents, with a web interface for human-in-the-loop extraction.

1.8K
Archived
Python
Backend Frameworks
ETL & Pipelines
Flask
#pdf#table-extraction#data-processing

safe-graph/graph-fraud-detection-papers

A curated list of Graph/Transformer-based papers and resources for fraud, anomaly, and outlier detection.

1.8K
Active
Machine Learning
Data Mining
#academic-publications#anomaly-detection#data-science

xflr6/graphviz

Simple Python interface for Graphviz, a popular open-source data visualization tool.

1.8K
Stable
Python
Data Visualization
CLI Tools
Python
#data-visualization#graphviz#network-graph

scholarly-python-package/scholarly

A Python library that allows developers to easily retrieve author and publication data from Google Scholar.

1.8K
Experimental
Python
CLI Tools
Databases
Python
#citation-analysis#citation-network#googlescholar

gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

1.8K
Experimental
Java
Databases
API Frameworks
#big-data#graph-database#hadoop

tradingview/charting-library-examples

A collection of examples showcasing integrations of the TradingView Charting Library with various frameworks and libraries.

1.8K
Stable
TypeScript
Charts & Visualization
Frontend Frameworks
React
#charting#examples#tradingview

xavier-zy/Awesome-pytorch-list-CNVersion

A curated list of awesome PyTorch libraries, models, and tutorials in Chinese translation.

1.8K
Archived
Jupyter Notebook
Computer Vision
Natural Language Processing
PyTorch
#awesome-list#computer-vision#deep-learning

RoaringBitmap/CRoaring

Optimized Roaring bitmaps in C and C++ with SIMD (AVX2, AVX-512, NEON) for high-performance data processing.

1.8K
Active
C
Databases
CLI Tools
C
#bitset#simd#avx2

KevinVandy/material-react-table

A fully-featured Material UI V5 implementation of TanStack React Table V8, written in TypeScript

1.8K
Stable
TypeScript
Component Libraries (React)
Frontend Frameworks
React
#material-ui#react-table#typescript

citusdata/cstore_fdw

A columnar storage extension for Postgres built as a foreign data wrapper.

1.8K
Archived
C
Databases
API Frameworks
#columnar-storage#columnar-store#compression

AppFlowy-IO/AppFlowy-Cloud

A collaborative workspace for developers to build with AI tools

1.8K
Stable
Rust
MCP Servers
React
#authentication#streaming#real-time

embulk/embulk

Embulk is a pluggable bulk data loader that helps developers load data from various sources into databases.

1.8K
Stable
Java
API Frameworks
ETL & Pipelines
#bulk-data#etl#data-pipeline

musana/CF-Hero

A reconnaissance tool that uses multiple data sources to discover the origin IP addresses of Cloudflare-protected web applications.

1.8K
Experimental
Go
Security Research
CLI Tools
#security#reconnaissance#cloudflare

benedekrozemberczki/awesome-fraud-detection-papers

A curated list of data mining papers about fraud detection.

1.8K
Active
Python
ML Ops
Data Mining
#fraud-detection#data-mining#classification

keptn/keptn

Keptn is a cloud-native application life-cycle orchestration tool that automates SLO-driven delivery and operations.

1.8K
Archived
Go
API Frameworks
CI/CD
Kubernetes
#continuous-delivery#data-driven#event-based

tower-archive/tower

Small components for building apps, manipulating data, and automating a distributed infrastructure.

1.8K
Archived
CoffeeScript
CoffeeScript
#authentication#streaming#real-time
1...98100...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.