Improving domain adaptation in de-identification of electronic health records through self-training.
De-identification is a fundamental task in electronic health records to remove protected health information entities. Deep learning models have proven to be promising tools to automate de-identification processes. However, when the target domain (where the model is applied) is different from the source domain (where the model is trained), the model often suffers a significant performance drop, commonly referred to as domain adaptation issue. In de-identification, domain adaptation issues can [...]
Author(s): Liao, Shun, Kiros, Jamie, Chen, Jiyang, Zhang, Zhaolei, Chen, Ting
DOI: 10.1093/jamia/ocab128