A framework for understanding label leakage in machine learning for health care.
The pitfalls of label leakage, contamination of model input features with outcome information, are well established. Unfortunately, avoiding label leakage in clinical prediction models requires more nuance than the common advice of applying "no time machine rule."
Author(s): Davis, Sharon E, Matheny, Michael E, Balu, Suresh, Sendak, Mark P
DOI: 10.1093/jamia/ocad178