Evaluating the state-of-the-art in automatic de-identification.
To facilitate and survey studies in automatic de-identification, as a part of the i2b2 (Informatics for Integrating Biology to the Bedside) project, authors organized a Natural Language Processing (NLP) challenge on automatically removing private health information (PHI) from medical discharge records. This manuscript provides an overview of this de-identification challenge, describes the data and the annotation process, explains the evaluation metrics, discusses the nature of the systems that addressed the [...]
Author(s): Uzuner, Ozlem, Luo, Yuan, Szolovits, Peter
DOI: 10.1197/jamia.M2444