scd-type-2

Here are 15 public repositories matching this topic...

minhky2185 / incident_data_warehouse

Build a data warehouse from scratch, including full load, daily incremental load, design schema, SCD Type 1 and 2.

python docker airflow s3 orchestration data-warehouse data-engineering redshift powerbi airflow-plugin staging-area star-schema etl-pipeline etl-automation scd-type-2 airflow-dag bridge-table incident-data-warehouse scd-type-1

Updated Feb 1, 2023
Python

SaadAhmedWaqar / Data-Warehousing-Redshift

Star

A Data Warehousing project for retail sales using dimension modelling best practices with SCD type 2 on AWS Redshift. Utilizing AWS Lambda, Glue Workflows and Python Shell jobs to create and automate an ELT pipeline where batch data coming into S3 is loaded onto Redshift and necessary transformations are performed to meet requirements.

aws-s3 data-warehousing aws-redshift aws-glue scd-type-2 dimensional-modeling

Updated Aug 10, 2023
Python

DavidKingGH / dbt_portfolio_project_2

Star

Production-grade ELT pipeline classifying VIX volatility regimes. Features incremental models, SCD-2 snapshots, recursive CTEs, and Slim CI/CD. Stack: dbt, DuckDB/MotherDuck, Python.

python dbt elt financial-modeling github-actions scd-type-2 analytics-engineering duckdb motherduck

Updated Nov 18, 2025
Jupyter Notebook

rAmIro-89 / retail-sales-data-warehouse-sql-refactored

Star

Enterprise Data Warehouse with Star Schema, SCD Type 2, ETL pipelines, and multi-currency analytics (SQL Server + Python)

python portfolio sql-server etl jupyter-notebook pandas data-warehouse business-intelligence data-analytics t-sql multi-currency rfm-analysis star-schema scd-type-2 dimensional-modeling

Updated Dec 2, 2025
Jupyter Notebook

Annielytix / azure-data-factory-data-vault

Sponsor

Star

Working with SCD Type (Change Data Capture) and need a Data Vault model to test Azure Data Factory v2? - This Code with Help!

azure orchestration-framework orchestration adf change-data-capture cloud-migration data-factory data-vault scd-type-2 adf-v2 orchetration data-orchestration

Updated Aug 2, 2019

lavanchukka / Patient-Flow-Azure-Data-Engineer-Project

Star

In this project we'll create real time healthcare patient data pipeline as data source and use arious services and tools like Azure Eventhubs, Azure Databricks, Delta lake and synapse analytics. also, implement medallion architecture, schema evolution and create facts and dimension tables and connect the cleaned and transformed data to PowerBI.

pyspark schema-evolution powerbi azure-data-factory star-schema azure-databricks scd-type-2 synapse-analytics azure-eventhubs

Updated Jan 6, 2026

lawren-ai / enterprise-data-integration

Star

A complete data integration solution that migrates data from multiple source systems into a modern data warehouse, with full documentation, quality validation, and analytical reporting.

python postgresql data-warehouse data-engineering data-quality kimball star-schema etl-pipeline scd-type-2 dimensional-modeling

Updated Jan 30, 2026
HTML

NikhilReddyhalli / retail-lakehouse-fabric

Star

End-to-End Retail Lakehouse on Microsoft Fabric | Medallion Architecture | PySpark | SCD Type-2 | DirectLake

power-bi pyspark data-engineering delta-lake scd-type-2 lakehouse medallion-architecture microsoft-fabric

Updated Feb 19, 2026
Jupyter Notebook

husskhosravi / elt-data-warehouse-snowflake

Star

Design and implement a full ELT data pipeline using Snowflake and S3, featuring star schema modelling, SCD Type 1 & 2 handling, and incremental load automation

sql aws-s3 snowflake data-warehouse data-engineering star-schema scd-type-2 dimensional-modeling scd-type-1

Updated May 16, 2025

KMoex-HZ / modern-data-platform-dagster

Star

A production-grade Modern Data Stack (MDS) implementation featuring automated ELT, SCD Type 2 history tracking, and CI/CD quality guardrails using Dagster, dbt Core, DuckDB, and Soda.

python sql data-engineering dbt cicd data-modeling data-quality github-actions dagster scd-type-2 duckdb modern-data-stack soda-core

Updated Feb 17, 2026
Python

johnathon-smith / scd-type-2-snowflake-dbt

Star

Production-style Slowly Changing Dimension (SCD Type 2) pipeline built with Snowflake, dbt, and AWS S3. Demonstrates secure S3 ingestion, layered bronze/silver/gold modeling, dbt snapshots for historical tracking, and analytics-ready views identifying active vs historical records.

python aws sql etl snowflake data-warehouse data-engineering dbt elt data-modeling slowly-changing-dimensions amazon-s3 scd-type-2 analytics-engineering dimensional-modeling modern-data-stack cloud-data-engineering bronze-silver-gold dbt-snapshots

Updated Jan 10, 2026
Python

minnu-et / sales-data-pipeline

Star

End-to-end data pipeline: CSV → Bronze → Silver → Gold on AWS S3 with PySpark, Airflow, Great Expectations, SCD Type 2, and Streamlit dashboard

python airflow aws-s3 pyspark data-engineering etl-pipeline scd-type-2 streamlit great-expectations medallion-architecture

Updated Feb 19, 2026
Python

AmeeJoshi-MCA / azure-databricks-end-to-end-retail-lakehouse

Star

End-to-end Azure Databricks retail data engineering project using Medallion Architecture (Bronze, Silver, Gold). Implements Auto Loader, Unity Catalog, Delta Lake, SCD Type 1 & 2 dimensions, and Fact Orders for analytics-ready star schema modeling.