Enabling qualitative research data sharing using a natural language processing pipeline for deidentification: moving beyond HIPAA Safe Harbor identifiers.
Sharing health research data is essential for accelerating the translation of research into actionable knowledge that can impact health care services and outcomes. Qualitative health research data are rarely shared due to the challenge of deidentifying text and the potential risks of participant reidentification. Here, we establish and evaluate a framework for deidentifying qualitative research data using automated computational techniques including removal of identifiers that are not considered HIPAA Safe [...]
Author(s): Gupta, Aditi, Lai, Albert, Mozersky, Jessica, Ma, Xiaoteng, Walsh, Heidi, DuBois, James M
DOI: 10.1093/jamiaopen/ooab069