Sales ETL & Data Quality Pipeline

An end-to-end ETL workflow in Power BI.

The Project

This project focuses on building an end-to-end ETL workflow in Power BI to clean, validate, and consolidate fragmented sales datasets into reliable, analysis-ready reporting structures.

Using Adventure Works sales data distributed across multiple Excel sources, the workflow involved data cleaning, anomaly detection, append and merge operations, and relational integrity validation using Power Query.

A major focus of the project was identifying how incomplete relational coverage and incorrect join strategies can distort business metrics, highlighting the importance of validating data integrity before performing analytical reporting.

Key points

  • Imported and consolidated multi-source Excel datasets
  • Cleaned transactional sales data using Power Query
  • Profiled columns to detect anomalies and outliers
  • Validated data integrity after merge operations
  • Combined yearly sales records using append queries
  • Identified incomplete relational coverage in transactional details
  • Investigated revenue inconsistencies caused by Inner Join behavior
  • Evaluated merge strategies and analytical reliability
  • Built analysis-ready datasets for reporting workflows
Previous
Previous

Typing Test

Next
Next

Pong