Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: data×
Clear all

Showing 2341-2360 of 3,485 projects

zed-0xff/zsteg

A Ruby library for detecting hidden data in PNG and BMP images using stegano techniques.

1.5K
Active
Ruby
CLI Tools
Security Research
#steganography#image-analysis#data-extraction

hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

1.5K
Archived
Python
ETL & Pipelines
API Frameworks
#big-data-cleaning#data-analysis#data-cleaning

NVlabs/noise2noise

Official TensorFlow implementation of the Noise2Noise: Learning Image Restoration without Clean Data paper.

1.5K
Archived
Python
Computer Vision
TensorFlow
#image-restoration#computer-vision#machine-learning

paradigmxyz/cryo

cryo is a Rust library for extracting blockchain data to parquet, CSV, JSON, or Python dataframes.

1.5K
Archived
Rust
ETL & Pipelines
API Frameworks
#blockchain#ethereum#parquet

yongzhuo/nlp_xiaojiang

A comprehensive NLP toolkit for Chinese language processing, including chatbots, text similarity, classification, and more.

1.5K
Archived
Python
LLM Frameworks
API Frameworks
PyTorch
#nlp#chatbot#text-classification

nubank/fklearn

fklearn: A functional machine learning library for Python.

1.5K
Experimental
Jupyter Notebook
React
#machine learning#python#data analysis

ecomfe/awesome-echarts

Awesome list of the popular data visualization library Apache ECharts

1.5K
Archived
Charts & Visualization
#data-visualization#charts#open-source

aws-samples/aws-glue-samples

AWS Glue code samples for building data integration and ETL pipelines on AWS.

1.5K
Stable
Python
ETL & Pipelines
#aws#glue#etl

AgentEra/Agently

A Python framework that makes it easy to build AI applications using large language models like GPT and LLMs.

1.5K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#llm#agent-based-framework#ai-application-development

combust/mleap

MLeap is a library for deploying machine learning pipelines to production using Scala, Python, and Spark.

1.5K
Active
Scala
ML Ops
API Frameworks
Scala
#machine-learning#pipeline#production

OBenner/data-engineering-interview-questions

This GitHub repository contains over 2,000 data engineering interview questions to help developers prepare.

1.5K
Active
Python
Interview Prep
ETL & Pipelines
#data-engineering#interview-questions#interview-prep

graphite-project/carbon

Carbon is a component of the Graphite monitoring system that receives and writes metrics to disk for time-series data.

1.5K
Stable
Python
API Frameworks
Databases
#metrics#time-series#monitoring

shuyu-labs/AntSK

An offline AI knowledge base and agent built with .NET, Blazor, and Semantic Kernel, supporting local AI models.

1.5K
Stable
CSS
LLM Frameworks
API Frameworks
Blazor
#ai#agent#dotnet

crownpku/Rasa_NLU_Chi

A Python library for turning Chinese natural language into structured data for chatbots and NLP applications.

1.5K
Archived
Python
LLM Frameworks
API Frameworks
Python
#chatbot#natural-language-processing#chinese

ilyakatz/data-migrate

A Ruby library for migrating and updating data alongside your database structure.

1.5K
Stable
Ruby
API Frameworks
ORMs & Query Builders
Rails
#data-schema#schema-migrations#rails

msgpack/msgpack-javascript

msgpack-javascript is a TypeScript library for serializing and deserializing MessagePack data in JavaScript.

1.5K
Active
TypeScript
API Clients & Testing
Serialization
TypeScript
#messagepack#serialization#typescript

KruxAI/ragbuilder

A Python toolkit to create optimal Production-ready Retrieval Augmented Generation (RAG) setups for AI/ML projects.

1.5K
Experimental
Python
RAG & Vector
API Frameworks
Python
#rag#retrieval-augmented-generation#ai-tools

andris9/jStorage

jStorage is a simple key-value database to store data on the browser side.

1.5K
Archived
JavaScript
Frontend Frameworks
Component Libraries (React)
React
#storage#client-side#react

CovenantSQL/CovenantSQL

A decentralized, high-performance SQL database with blockchain features for developers needing a trusted data storage solution.

1.5K
Archived
Go
Databases
#blockchain#decentralized#sql

ecomfe/echarts-liquidfill

A JavaScript library that provides liquid fill charts for Apache ECharts, a popular data visualization framework.

1.5K
Archived
JavaScript
Charts & Visualization
React
#data-visualization#charts#echarts
1...117119...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.