Discussion about this post

User's avatar
Mike Emeigh's avatar

I'm amazed that in this lengthy article there isn't a single mention of the largest potential source of error in the data - data bias due to either undetected biases in the population, or due to inappropriate selective sampling of the data. Data teams need to check continually to ensure that the data that is being used is truly representative of the population being modeled and to take care to evaluate whether population biases are creeping in - not to mention their own biases.

Expand full comment
1 more comment...

No posts