U.S. Department of Commerce

Research Reports

You are here: Census.govSubjects A to ZResearch Reports Sorted by Year › Abstract of RRS2002/03
Skip top of page navigation

Working Papers for Mixture Model Additive Noise for Microdata Masking

William E. Yancey

KEY WORDS: microdata masking, information loss statistics, record linkage re-identification


We consider some aspect of using an additive mixture noise model for real microdata masking as a generalization to using normally distributed masking noise introduced by Roque. We introduce a simplified procedure for computing additive mixture noise and consider the effectiveness of this approach from the point of view of information loss measures and record re-identification. We concentrate on the information loss statistics for the variance/covariance matrix of the full data set and for arbitrary subsets. We consider some of the information loss statistics introduced by Domingo-Ferrer and we introduce some analytic alternatives. We see that for the full data sets, the analytic properties are well preserved and the data masking is effective. The analytic properties are less well preserved on the subsets of this highly skewed data. We include some SAS programs used in the study.


Source: U.S. Census Bureau, Statistical Research Division

Created: 30-MAY-2002

Source: U.S. Census Bureau | Statistical Research Division | (301) 763-3215 (or chad.eric.russell@census.gov) |   Last Revised: October 08, 2010