Making Data Useful

How Good Data Goes Bad

The data quality crisis no one is talking about

Cassie Kozyrkov
3 min readSep 26, 2023

--

A rule of thumb to save you tears in the long run is to assume every dataset is more like a hoarder’s storage locker than a well-curated museum until proven otherwise.

When in doubt, assume your data’s a junkyard.

But even if you’re not dealing with a dataset that’s a hoardsplosion of we-may-as-wells, there are two ways that fit-for-purpose data turns into garbage:

--

--

Cassie Kozyrkov

Chief Decision Scientist, Google. ❤️ Stats, ML/AI, data, puns, art, theatre, decision science. All views are my own. twitter.com/quaesita