combining rules; confidentiality; rejection sampling; statistical disclosure limitation; top code data.
A statistical analysis of data that have been multiplied by randomly drawn noise variables in order to protect the confidentiality of individual values has recently drawn some attention. If the distribution generating the noise variables has low to moderate variance, then noise multiplied data have been shown to yield accurate inferences in several typical parametric models under a formal likelihood based analysis. However, the likelihood based analysis is generally complicated due to the non-standard and often complex nature of the distribution of the noise perturbed sample even when the parent distribution is simple. This complexity places a burden on data users who must either develop the required statistical methods or implement the methods if already available or have access to specialized software perhaps yet to be developed. In this paper we propose an alternate analysis of noise multiplied data based on multiple imputation. Some advantages of this approach are that (1) the data user can analyze the released data as if it were never perturbed, and (2) the distribution of the noise variables does not need to be disclosed to the data user.
Martin Klein and Bimal Sinha. (2013). Statistical Analysis of Noise Multiplied Data Using Multiple Imputation. Center for Statistical Research & Methodology, Research and Methodology Directorate Research Report Series (Statistics #2013-01). U.S. Census Bureau. Available online at <http://www.census.gov/srd/papers/pdf/rrs2013-01.pdf>.
[PDF] or
denotes a file in Adobe’s Portable Document Format. To view the file, you will need the Adobe® Reader®
available free from Adobe.
This symbol
indicates a link to a non-government web site. Our linking to these sites does not constitute an endorsement of any products, services or the information found on them. Once you link to another site you are subject to the policies of the new site.