High-throughput multimodal automated phenotyping (MAP) with application to PheWAS.
Electronic health records linked with biorepositories are a powerful platform for translational studies. A major bottleneck exists in the ability to phenotype patients accurately and efficiently. The objective of this study was to develop an automated high-throughput phenotyping method integrating International Classification of Diseases (ICD) codes and narrative data extracted using natural language processing (NLP).
Author(s): Liao, Katherine P, Sun, Jiehuan, Cai, Tianrun A, Link, Nicholas, Hong, Chuan, Huang, Jie, Huffman, Jennifer E, Gronsbell, Jessica, Zhang, Yichi, Ho, Yuk-Lam, Castro, Victor, Gainer, Vivian, Murphy, Shawn N, O'Donnell, Christopher J, Gaziano, J Michael, Cho, Kelly, Szolovits, Peter, Kohane, Isaac S, Yu, Sheng, Cai, Tianxi
DOI: 10.1093/jamia/ocz066