A semi-supervised approach for rapidly creating clinical biomarker phenotypes in the UK Biobank using different primary care EHR and clinical terminology systems.
The UK Biobank (UKB) is making primary care electronic health records (EHRs) for 500 000 participants available for COVID-19-related research. Data are extracted from four sources, recorded using five clinical terminologies and stored in different schemas. The aims of our research were to: (a) develop a semi-supervised approach for bootstrapping EHR phenotyping algorithms in UKB EHR, and (b) to evaluate our approach by implementing and evaluating phenotypes for 31 common biomarkers.
Author(s): Denaxas, Spiros, Shah, Anoop D, Mateen, Bilal A, Kuan, Valerie, Quint, Jennifer K, Fitzpatrick, Natalie, Torralbo, Ana, Fatemifar, Ghazaleh, Hemingway, Harry
DOI: 10.1093/jamiaopen/ooaa047