A framework for employing longitudinally collected multicenter electronic health records to stratify heterogeneous patient populations on disease history.
To facilitate patient disease subset and risk factor identification by constructing a pipeline which is generalizable, provides easily interpretable results, and allows replication by overcoming electronic health records (EHRs) batch effects.
Author(s): Maurits, Marc P, Korsunsky, Ilya, Raychaudhuri, Soumya, Murphy, Shawn N, Smoller, Jordan W, Weiss, Scott T, Huizinga, Thomas W J, Reinders, Marcel J T, Karlson, Elizabeth W, van den Akker, Erik B, Knevel, Rachel
DOI: 10.1093/jamia/ocac008