Enabling realistic health data re-identification risk assessment through adversarial modeling.
Re-identification risk methods for biomedical data often assume a worst case, in which attackers know all identifiable features (eg, age and race) about a subject. Yet, worst-case adversarial modeling can overestimate risk and induce heavy editing of shared data. The objective of this study is to introduce a framework for assessing the risk considering the attacker's resources and capabilities.
Author(s): Xia, Weiyi, Liu, Yongtai, Wan, Zhiyu, Vorobeychik, Yevgeniy, Kantacioglu, Murat, Nyemba, Steve, Clayton, Ellen Wright, Malin, Bradley A
DOI: 10.1093/jamia/ocaa327