Sales ETL & Data Quality Pipeline
An end-to-end ETL workflow in Power BI.
The Project
This project focuses on building an end-to-end ETL workflow in Power BI to clean, validate, and consolidate fragmented sales datasets into reliable, analysis-ready reporting structures.
Using Adventure Works sales data distributed across multiple Excel sources, the workflow involved data cleaning, anomaly detection, append and merge operations, and relational integrity validation using Power Query.
A major focus of the project was identifying how incomplete relational coverage and incorrect join strategies can distort business metrics, highlighting the importance of validating data integrity before performing analytical reporting.
Key points
Imported and consolidated multi-source Excel datasetsCleaned transactional sales data using Power QueryProfiled columns to detect anomalies and outliersValidated data integrity after merge operationsCombined yearly sales records using append queriesIdentified incomplete relational coverage in transactional detailsInvestigated revenue inconsistencies caused by Inner Join behaviorEvaluated merge strategies and analytical reliabilityBuilt analysis-ready datasets for reporting workflows