Walgreens Interview Question

How do you handle data quality issues in a large dataset?

Interview Answer

Anonymous

Oct 11, 2024

I usually just remove any data that looks incorrect or incomplete, and I don’t spend too much time analyzing it. If there are missing values, I delete those rows, and if there are duplicates, I just remove them without checking further. I assume the data is good enough if it loads into the system without any errors