Keeping synthetic patients on track: feedback mechanisms to mitigate performance drift in longitudinal health data simulation.
Synthetic data are increasingly relied upon to share electronic health record (EHR) data while maintaining patient privacy. Current simulation methods can generate longitudinal data, but the results are unreliable for several reasons. First, the synthetic data drifts from the real data distribution over time. Second, the typical approach to quality assessment, which is based on the extent to which real records can be distinguished from synthetic records using a critic [...]
Author(s): Zhang, Ziqi, Yan, Chao, Malin, Bradley A
DOI: 10.1093/jamia/ocac131