Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 2161-2180 of 3,485 projects

mlrun/mlrun

MLRun is an open-source MLOps platform for building and managing continuous ML applications.

1.7K
Active
Python
MLOps
API Frameworks
Python
#machine-learning#data-engineering#workflow

ClimbsRocks/auto_ml

Automated machine learning for analytics & production use cases powered by popular ML libraries.

1.7K
Archived
Python
ML Ops
ETL & Pipelines
scikit-learn
#automated-machine-learning#data-science#production-ready

digoarthur/github-automated-repos

A TypeScript library that enables developers to manage their GitHub projects and portfolios from a single place.

1.7K
Stable
TypeScript
Component Libraries (React)
CLI Tools
React
#github-api#github-automation#github-portfolio

greyblake/nutype

A Rust library that provides newtype-based data validation and sanitization with strong type guarantees.

1.7K
Active
Rust
Validation
Authentication
#rust#data-validation#data-sanitization

j-easy/easy-random

A simple Java library for generating random data and test data for Java beans and records.

1.7K
Active
Java
Validation
CLI Tools
#java#random#random-data-generation

JackySoft/marsview

Marsview is a low-code visualization platform for building middle and backend applications with event interaction, API integration, and data orchestration.

1.7K
Stable
TypeScript
MCP Servers
Component Libraries (React)
React
#lowcode#visualization#middleware

TryCatchHCF/Cloakify

A Python tool for data exfiltration and infiltration using text-based steganography to evade detection.

1.7K
Archived
Python
Penetration Testing
Privacy Tools
#av-evasion#data-exfiltration#dlp-evasion

discord/sorted_set_nif

An Elixir SortedSet backed by a Rust-based NIF, useful for building high-performance data structures.

1.7K
Active
Elixir
API Frameworks
Databases
#elixir#rust#nif

jasonwei20/eda_nlp

Data augmentation for NLP using CNN and RNN, presented at EMNLP 2019

1.6K
Archived
Python
Python
#data-augmentation#nlp#text-classification

rethinkdb/rethinkdb-go

A Go language driver for the RethinkDB real-time database, enabling developers to build data-driven applications.

1.6K
Stable
Go
API Frameworks
Databases
#go#rethinkdb#database

Hiflylabs/awesome-dbt

A curated list of awesome resources for the data transformation tool dbt, focused on analytics engineering.

1.6K
Active
ETL & Pipelines
#analytics-engineering#data-engineering#dbt

iamtodor/data-science-interview-questions-and-answers

A collection of data science interview questions and answers for developers to improve their skills.

1.6K
Archived
AI & Machine Learning
React
#data-science#interview-preparation#machine-learning

mostafa-saad/ArabicCompetitiveProgramming

This repository provides training materials and a library for competitive programming in Arabic.

1.6K
Archived
C++
Tutorials & Courses
API Frameworks
#competitive-programming#algorithms#data-structures

matplotlib/ipympl

A Jupyter notebook integration for the popular data visualization library Matplotlib.

1.6K
Active
Jupyter Notebook
Charts & Visualization
IDE Extensions
Jupyter
#data-visualization#jupyter-notebook#matplotlib

KilledByAPixel/JSONCrush

A JavaScript library that compresses JSON into URL-friendly strings for efficient data transmission.

1.6K
Experimental
JavaScript
API Clients & Testing
Frontend Frameworks
JavaScript
#json#compression#url-shortener

Toyhom/Chinese-medical-dialogue-data

This repository contains a dataset of Chinese medical dialogues for NLP and conversational AI research.

1.6K
Archived
Python
LLM Frameworks
Datasets
#medical-data#chinese-language#natural-language-processing

tylertreat/BoomFilters

Performant probabilistic data structures for processing continuous, unbounded streams in Go.

1.6K
Stable
Go
Caching
CLI Tools
#bloom-filter#count-min-sketch#cuckoo-filter

neomatrix369/awesome-ai-ml-dl

A curated list of awesome AI, ML, and DL resources, including code, tutorials, and study notes.

1.6K
Stable
Jupyter Notebook
LLM Frameworks
Agents & Orchestration
Jupyter Notebook
#ai#machine-learning#deep-learning

threatexpress/domainhunter

A Python tool that checks expired domains for categorization, reputation, and historical data to identify phishing and C2 domain candidates.

1.6K
Archived
Python
Security Research
CLI Tools
#phishing#security-research#domain-analysis

cswinter/LocustDB

A blazingly fast analytics database built with Rust, optimized for rapidly devouring large amounts of data.

1.6K
Experimental
Rust
Databases
API Frameworks
#analytics#database#rust
1...108110...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.