Democratizing EHR analyses with FIDDLE: a flexible data-driven preprocessing pipeline for structured clinical data.
In applying machine learning (ML) to electronic health record (EHR) data, many decisions must be made before any ML is applied; such preprocessing requires substantial effort and can be labor-intensive. As the role of ML in health care grows, there is an increasing need for systematic and reproducible preprocessing techniques for EHR data. Thus, we developed FIDDLE (Flexible Data-Driven Pipeline), an open-source framework that streamlines the preprocessing of data extracted [...]
Author(s): Tang, Shengpu, Davarmanesh, Parmida, Song, Yanmeng, Koutra, Danai, Sjoding, Michael W, Wiens, Jenna
DOI: 10.1093/jamia/ocaa139