Supervised embedding of textual predictors with applications in clinical diagnostics for pediatric cardiology.
Electronic health records possess critical predictive information for machine-learning-based diagnostic aids. However, many traditional machine learning methods fail to simultaneously integrate textual data into the prediction process because of its high dimensionality. In this paper, we present a supervised method using Laplacian Eigenmaps to enable existing machine learning methods to estimate both low-dimensional representations of textual data and accurate predictors based on these low-dimensional representations at the same time.
Author(s): Perry, Thomas Ernest, Zha, Hongyuan, Zhou, Ke, Frias, Patricio, Zeng, Dadan, Braunstein, Mark
DOI: 10.1136/amiajnl-2013-001792