Explore Projects

Discover 13 open source projects

Active filters (1):
Search: databricksร—
Clear all

Showing 1-13 of 13 projects

getredash/redash

Redash enables data-driven decisions by connecting to data sources and creating visualizations and dashboards.

28.3K
Active
Python
Analytics & Tracking
Search
Python
#analytics#dashboard#data-visualization

cube-js/cube

Open-source semantic layer for AI, BI, and embedded analytics

19.6K
Active
Rust
Agents & Orchestration
ETL & Pipelines
Rust
#semantic-layer#ai-analytics#bi-tool

Tencent/APIJSON

APIJSON is a secure, coding-free ORM library that provides APIs and documentation without backend coding.

18.4K
Active
Java
BaaS Platforms
Java
#baas#api#orm

databrickslabs/dolly

Databricks' Dolly, a large language model trained on the Databricks Machine Learning Platform

10.8K
Archived
Python
LLM Frameworks
Python
#chatbot#databricks#llm

tobymao/sqlglot

A Python library for parsing and transpiling SQL queries across various databases and engines.

9.0K
Active
Python
API Frameworks
ORMs & Query Builders
Python
#sql-parser#sql-transpiler#database-abstraction

microsoft/SynapseML

SynapseML is a simple and distributed machine learning library for building and deploying AI models at scale.

5.2K
Active
Scala
ML Ops
Big Data
Apache Spark
#machine-learning#distributed-computing#big-data

mosaicml/llm-foundry

LLM training code for Databricks foundation models, a tool for vibe coders working with AI and language models.

4.4K
Stable
Python
LLM Frameworks
API Frameworks
PyTorch
#deep-learning#llm#neural-networks

delta-io/delta-rs

A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.

3.2K
Active
Rust
ETL & Pipelines
API Frameworks
#delta-lake#etl#data-engineering

datafold/data-diff

A Python library for comparing data across databases, supporting various database engines.

3.0K
Archived
Python
Databases
ETL & Pipelines
#data-diffing#data-quality#data-engineering

databricks/scala-style-guide

Databricks Scala Coding Style Guide - a reference for writing idiomatic Scala code

2.8K
Archived
Linters & Formatters
API Frameworks
#scala#style-guide#coding-standards

Multiwoven/multiwoven

Open-source reverse ETL tool for data activation and customer data platform integration.

1.6K
Active
Ruby
API Frameworks
ETL & Pipelines
React
#data-activation#customer-data-platform#reverse-etl

databricks/megablocks

A Python library for developers to create and manage Databricks Megablock resources.

1.5K
Experimental
Python
API Frameworks
CLI Tools
Python
#databricks#megablock#api

zinggAI/zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

1.2K
Active
Java
ETL & Pipelines
ML Ops
#identity-resolution#entity-resolution#data-deduplication

Stay in the loop

Get weekly updates on trending AI coding tools and projects.