Data & Databases

ORMs, query builders, databases, and data pipelines

Showing 881-900 of 5,250 projects

locuslab/TCN

A library for sequence modeling benchmarks and temporal convolutional networks (TCN) in Python.

4.5K
Archived
Python
ML Ops
API Frameworks
Python
#sequence-modeling#temporal-convolutional-networks#benchmarking

facebookresearch/DrQA

A Python library for reading Wikipedia to answer open-domain questions, focused on natural language processing and machine learning.

4.5K
Archived
Python
LLM Frameworks
API Frameworks
None
#natural-language-processing#machine-learning#open-domain-qa

salesforce/Merlion

A Python framework for building advanced time series forecasting and anomaly detection models.

4.5K
Archived
Python
ML Ops
Caching
#time-series#forecasting#anomaly-detection

ultrajson/ultrajson

Ultra fast JSON decoder and encoder written in C with Python bindings

4.5K
Active
C++
API Clients & Testing
API Frameworks
#json#decoder#encoder

water8394/flink-recommandSystem-demo

A real-time product recommendation system built with Flink, Redis, HBase, and Kafka for vibe coders.

4.5K
Archived
Java
Realtime
Caching
Flink
#flink#recommender-system#real-time

deckerst/aves

Aves is a gallery and metadata explorer app built for Android with Flutter, supporting a variety of media formats.

4.5K
Active
Dart
Component Libraries (Flutter)
File Storage
Flutter
#android#gallery#metadata

deanmalmgren/textract

A Python library that provides a simple and unified interface for extracting text from any document format.

4.5K
Archived
HTML
ETL & Pipelines
CLI Tools
Python
#text-extraction#pdf#docx

Kyubyong/transformer

A TensorFlow implementation of the Transformer, a popular deep learning model for natural language processing tasks.

4.5K
Archived
Python
LLM Frameworks
API Frameworks
TensorFlow
#transformer#attention-mechanism#natural-language-processing

thunlp/OpenNRE

An open-source Python library for neural relation extraction, a key task in natural language processing.

4.4K
Archived
Python
LLM Frameworks
API Frameworks
Python
#relation-extraction#natural-language-processing#information-extraction

dedupeio/dedupe

A Python library for accurate and scalable fuzzy matching, record deduplication, and entity resolution.

4.4K
Experimental
Python
ORMs & Query Builders
API Frameworks
Python
#data-cleaning#entity-resolution#fuzzy-matching

Kozea/Radicale

A simple CalDAV (calendar) and CardDAV (contact) server written in Python.

4.4K
Active
Python
API Frameworks
Databases
#caldav#carddav#icalendar

varunshenoy/GraphGPT

A library for extracting knowledge graphs from unstructured text using the GPT-3 language model.

4.4K
Archived
JavaScript
LLM Frameworks
GraphQL
Node
#gpt-3#knowledge-graph#natural-language-processing

NVlabs/tiny-cuda-nn

A fast C++/CUDA neural network framework for high-performance deep learning and rendering.

4.4K
Stable
C++
Frameworks
API Frameworks
PyTorch
#cuda#deep-learning#gpu

googleapis/google-cloud-go

Google Cloud Client Libraries for Go, providing easy access to Google Cloud services.

4.4K
Active
Go
API Clients & Testing
Databases
Go
#google-cloud#bigquery#datastore

CLUEbenchmark/CLUEDatasetSearch

A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.

4.4K
Archived
Python
Datasets
Tutorials & Courses
Python
#chinese#nlp#datasets

infiniflow/infinity

A high-performance, AI-native database for LLM applications with hybrid search capabilities.

4.4K
Active
C++
Vector Databases
Vector Databases
#ai-native#hybrid-search#vector-search

ceres-solver/ceres-solver

A large-scale non-linear optimization library for computer vision and other scientific applications.

4.4K
Active
C++
Computer Vision
API Frameworks
#nonlinear-optimization#computer-vision#bundle-adjustment

dhall-lang/dhall-lang

Dhall is a maintainable and human-readable configuration language for developers.

4.4K
Active
Dhall
CLI Tools
API Frameworks
#configuration-language#dhall#maintainable

tensorflow/probability

Probabilistic reasoning and statistical analysis tools built on TensorFlow for data scientists and ML researchers.

4.4K
Active
Jupyter Notebook
ML Ops
Databases
TensorFlow
#bayesian-methods#deep-learning#machine-learning

hanc00l/wooyun_public

This is an archived repository that provides a web crawler and search engine for the now-defunct Wooyun security vulnerability database.

4.4K
Archived
PHP
Security Research
Backend Frameworks
PHP
#security#vulnerability#database
1...4446...263

Stay in the loop

Get weekly updates on trending AI coding tools and projects.