De-Identifying Government Datasets: Techniques and Governance

Category Science

tldr #

NIST has published a document detailing the de-identification techniques used to protect government datasets as well as the potential risks associated with traditional de-identification approaches when compared to formal privacy methods such as differential privacy.


content #

NIST has published Special Publication (SP) 800-188, De-Identifying Government Datasets: Techniques and Governance. De-identification removes identifying information from a data set so that the remaining data cannot be linked to specific individuals. Government agencies can use de-identification to reduce the privacy risks associated with collecting, processing, archiving, distributing, or publishing government data.

Differential privacy is an approach to securely share data while preserving the privacy of individual records.

Previously, NIST published NIST Internal Report (IR) 8053, De-Identification of Personal Information, which provided a survey of de-identification and re-identification techniques. SP 800-188 provides specific guidance to government agencies that wish to use de-identification.

This final document was authored by experts at NIST and the U.S. Census Bureau and references up-to-date research and practices for both traditional de-identification approaches as well as the use of formal privacy methods, such as differential privacy to create de-identified datasets.

Using de-identification, a data set can be released publicly without putting the individual at risk of re-identification.

This document also addresses other approaches for making datasets that contain sensitive information available to researchers and for public transparency. Where appropriate, this document cautions users about the inherent limitations of traditional de-identification approaches when compared to formal privacy methods, such as differential privacy.


hashtags #
worddensity #

Share