Quantitatively assessing the impact of the quality of SNOMED CT subtype hierarchy on cohort queries.
SNOMED CT provides a standardized terminology for clinical concepts, allowing cohort queries over heterogeneous clinical data including Electronic Health Records (EHRs). While it is intuitive that missing and inaccurate subtype (or is-a) relations in SNOMED CT reduce the recall and precision of cohort queries, the extent of these impacts has not been formally assessed. This study fills this gap by developing quantitative metrics to measure these impacts and performing statistical [...]
Author(s): Hao, Xubing, Li, Xiaojin, Huang, Yan, Shi, Jay, Abeysinghe, Rashmie, Tao, Cui, Roberts, Kirk, Zhang, Guo-Qiang, Cui, Licong
DOI: 10.1093/jamia/ocae272