pyDeid: an improved, fast, flexible, and generalizable rule-based approach for deidentification of free-text medical records.
Deidentification of personally identifiable information in free-text clinical data is fundamental to making these data broadly available for research. However, there exist gaps in the deidentification landscape with regard to the functionality and flexibility of extant tools, as well as suboptimal tradeoffs between deidentification accuracy and speed. To address these gaps and tradeoffs, we develop a new Python-based deidentification software, pyDeid.
Author(s): Sundrelingam, Vaakesan, Parimoo, Shireen, Pogacar, Frances, Koppula, Radha, Shin, Saeha, Pou-Prom, Chloe, Roberts, Surain B, Verma, Amol A, Razak, Fahad
DOI: 10.1093/jamiaopen/ooae152