Duration: 30 minutes. Follow each step and track your completion status.
Define treatment and control columns in your dataset.
Check missing values and flag suspicious entries.
Create a cleaned subset excluding incomplete critical rows.
Visualize treatment vs control quickly (boxplot or scatter).
Document at least 3 potential data quality risks.
Mark complete when your dataset is analysis-ready for module 4.
Back to Workshop overview