Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 681-700 of 3,485 projects

aimhubio/aim

Aim is an open-source experiment tracker that makes it easy to track and visualize machine learning experiments.

6.0K
Active
Python
ML Ops
CLI Tools
PyTorch
#experiment-tracking#metadata-tracking#visualization

apache/hive

Apache Hive is a data warehouse software built on top of Apache Hadoop for querying and managing large datasets.

6.0K
Active
Java
Databases
API Frameworks
#apache#big-data#database

datajuicer/data-juicer

A Python library for processing and analyzing data with foundation models and large language models.

6.0K
Active
Python
LLM Frameworks
ETL & Pipelines
Python
#data-processing#data-analysis#foundation-models

cue-lang/cue

CUE is a data validation and definition language for text-based and dynamic configuration.

6.0K
Active
Go
API Frameworks
Databases
Go
#configuration#data-validation#kubernetes

apache/nifi

Apache NiFi is a powerful data flow management system that enables developers to build complex data pipelines.

6.0K
Active
Java
API Frameworks
ETL & Pipelines
#data-pipeline#etl#streaming

WeiYe-Jing/datax-web

DataX-Web is a visual data integration platform that supports RDBMS, Hive, HBase, ClickHouse, MongoDB and other data sources.

6.0K
Archived
Java
BaaS Platforms
ETL & Pipelines
Java
#data-integration#etl#rdbms

naver/billboard.js

A reusable, easy-to-use JavaScript chart library based on D3.js for building data visualizations.

6.0K
Active
TypeScript
Charts & Visualization
Frontend Frameworks
React
#data-visualization#charts#d3

beamandrew/medical-data

No description provided for this medical data repository.

6.0K
Archived
Databases
Backend Frameworks
#medical-data#dataset

evidence-dev/evidence

A business intelligence platform that allows developers to build interactive data visualizations in SQL and Markdown.

6.0K
Stable
JavaScript
Charts & Visualization
Databases
Svelte
#analytics#business-intelligence#dashboard

schemaorg/schemaorg

Schema.org provides a shared vocabulary for structured data on the web, enabling interoperable applications.

6.0K
Active
HTML
API Clients & Testing
Backend Frameworks
#structured-data#json-ld#rdf

niderhoff/nlp-datasets

A curated list of free/public domain text datasets for natural language processing (NLP) tasks.

6.0K
Archived
Datasets
#nlp#text-data#public-datasets

airweave-ai/airweave

Open-source context retrieval layer for AI agents

6.0K
Active
Python
React
#context-retrieval#ai-agents#open-source

dragonflyoss/dragonfly-archived

Dragonfly is a P2P-based CDN solution for accelerating Docker image distribution across data centers and cloud providers.

6.0K
Archived
Go
Containerization
CI/CD
Docker
#p2p#docker#registry

youtube/api-samples

Code samples for YouTube APIs, including Data API, Analytics API, and Live Streaming API.

5.9K
Archived
Java
API Clients & Testing
API Documentation
Java
#youtube-api#data-api#analytics-api

springfox/springfox

Springfox is an open-source Java library that automatically generates Swagger documentation for APIs built with Spring.

5.9K
Archived
Java
API Documentation
API Frameworks
Spring
#swagger#openapi#documentation

snorkel-team/snorkel

A powerful system for quickly generating high-quality training data with weak supervision for AI/ML projects.

5.9K
Archived
Python
LLM Frameworks
Data Pipelines
Python
#data-augmentation#weak-supervision#machine-learning

MontFerret/ferret

Declarative web scraping library written in Go, providing a powerful DSL for extracting data from websites.

5.9K
Stable
Go
Backend Frameworks
CLI Tools
#web-scraping#crawler#data-mining

itsgoingd/clockwork

Clockwork is a PHP debugging and profiling tool that provides a browser-based interface for inspecting server-side data.

5.9K
Stable
PHP
Laravel
#debugging#profiling#php

Respect/Validation

A powerful and flexible validation library for PHP developers that provides a fluent interface for defining and validating data.

5.9K
Active
PHP
API Frameworks
Validation
#validation#fluent-interface#standalone

matthewmueller/x-ray

A versatile and powerful web scraping library for JavaScript, designed to help developers extract data from the web with ease.

5.9K
Active
JavaScript
Frontend Frameworks
API Frameworks
Node.js
#web-scraping#data-extraction#crawling
1...3436...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.