Research Reports

You are here: Census.govSubjects A to ZResearch Reports Sorted by Year › Abstract of RRS2007/19
Skip top of page navigation

Analytically Valid Discrete Microdata Files and Re-identification

William E. Winkler

KEY WORDS: Data Quality, Loglinear Model Fit, Missing Data, Convex Constraints


Loglinear modeling methods have become quite straightforward to apply to discrete data X. A good-fitting loglinear model can be used to generate synthetic copies of X1, …, Xn of X that preserve analytic properties but may allow reidentification of small cells. With fitting algorithms that use more general convex constraints and are designed to deal with missing data, we are able to disperse the counts associated with small cells over other cells in a manner that reduces reidentification risk while still maintaining most analytic properties.


Source: U.S. Census Bureau, Statistical Research Division

Created: December 10, 2007
Last revised: December 10, 2007

[PDF] or PDF denotes a file in Adobe’s Portable Document Format. To view the file, you will need the Adobe® Reader® Off Site available free from Adobe.

This symbol Off Site indicates a link to a non-government web site. Our linking to these sites does not constitute an endorsement of any products, services or the information found on them. Once you link to another site you are subject to the policies of the new site.

Source: U.S. Census Bureau | Statistical Research Division | (301) 763-3215 (or |   Last Revised: October 08, 2010