Explore Projects

Discover 7 open source projects

Active filters (1):
Search: dataengineeringร—
Clear all

Showing 1-7 of 7 projects

DataExpert-io/data-engineer-handbook

Comprehensive data engineering resource hub with learning paths, books, communities, and tools

40.4K
Stable
Jupyter Notebook
Tutorials & Courses
Awesome Lists
Apache Airflow
#dataengineering#bigdata#apachespark

open-metadata/OpenMetadata

A unified metadata platform for data discovery, data observability, and data governance.

8.8K
Active
TypeScript
Data Catalog
Data Governance
TypeScript
#data-discovery#data-lineage#data-quality

datafold/data-diff

A Python library for comparing data across databases, supporting various database engines.

3.0K
Archived
Python
Databases
ETL & Pipelines
#data-diffing#data-quality#data-engineering

TobikoData/sqlmesh

Scalable and efficient data transformation framework with backwards compatibility for dbt.

2.9K
Active
Python
ETL & Pipelines
Databases
Python
#data-engineering#dataops#dbt

Marktechpost/AI-Tutorial-Codes-Included

A collection of Jupyter Notebook codes and tutorials for a variety of AI projects and data science tasks.

2.1K
Active
Jupyter Notebook
Tutorials & Courses
Tutorials & Courses
Jupyter Notebook
#ai#machine-learning#data-science

zinggAI/zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

1.2K
Active
Java
ETL & Pipelines
ML Ops
#identity-resolution#entity-resolution#data-deduplication

abhishek-ch/around-dataengineering

A comprehensive knowledge hub for data engineering, machine learning, and MLOps tools and practices.

1.1K
Archived
Python
ETL & Pipelines
ML Ops
Python
#data-engineering#machine-learning#mlops

Stay in the loop

Get weekly updates on trending AI coding tools and projects.