Gotta love the 80/20 rule. It's a universal phenomenon that applies in every aspect of data management and exploitation. Here's an instance from a data scientist working in housing: “Anecdotally, a data scientist will spend 80% of his or her time cleaning a data set.” This is tragic. The valuable time and skills of motivated professionals are being squandered on dumb, repetitive tasks that could be removed at a stroke – with industry data standards.
Whenever we see people in this kind of data mess, we don't need to look far for either the cause, or the solution. Every community that suffers needless data complexity is generating its own problem, which it can cure with data standards. Let's stop wasting people lives with thoughtless data creation habits. This is a business hygiene issue. Down with dirty data! HACT
Comments