Simulation studies that are carefully designed under realistic sample survey or census conditions can be used to evaluate the quality of new statistical methodology for Census Bureau data. Furthermore, new computationally intensive statistical methodology is often beneficial because it can require less strict assumptions, offer more flexibility in sampling or modeling, accommodate complex features in the data, enable valid inference where other methods might fail, etc. Statistical modeling is at the core of the design of realistic simulation studies and the development of computationally intensive statistical methods. Modeling also enables one to efficiently use all available information when producing estimates. Such studies can benefit from software for data processing, especially large data sets from nontraditional sources. Data visualizations can help reveal insights. Statistical disclosure avoidance methods are also developed and properties studied.
Guin, A., Roy, A., and Sinha, B. (In Press). “Bayesian Analysis of Singly Imputed Partially Synthetic Data Generated by Plug-In Sampling and Posterior Predictive Sampling under the Multiple Linear Regression Model,” International Journal of Statistical Applications.
Moura, R., Klein, M., Zylstra, J., Coelho, C., and Sinha, B. (In Press). “Inference for Multivariate Regression Model Based on Synthetic Data Generated Under Plug-In Sampling,” Journal of the American Statistical Association (Theory & Methods).
Chai, J. and Nayak, T.K. (2021). “Minimax Randomized Response Methods for Protecting Respondent’s Privacy,” Communications in Statistics - Theory and Methods, https://doi.org/10.1080/03610926.2021.1973503
Klein, M., Wright, T., and Wieczorek, J. (2020). “A Joint Confidence Region for an Overall Ranking of Populations,” Journal of the Royal Statistical Society, Series C, 69, Part 3, 589-606.
Klein, M.D., Zylstra, J., and Sinha, B.K. (2019). “Finite Sample Inference for Multiply Imputed Synthetic Data under a Multiple Linear Regression Model,” Calcutta Statistical Association Bulletin. https://doi.org/10.1177/0008068318803814
Wright, T., Klein, M., and Wieczorek, J. (2019). “A Primer on Visualizations for Comparing Populations Including the Issue of Overlapping Confidence Intervals,” The American Statistician, Vol 73, No. 2, 165-178.
Chai, J. and Nayak, T.K. (2018). “A Criterion for Privacy Protection in Data Collection and Its Attainment via Randomized Response Procedures,” Electronic Journal of Statistics, 12, 4264-4287.
Klein, M. and Datta, G. (2018). “Statistical Disclosure Control via Sufficiency under the Multiple Linear Regression Model,” Journal of Statistical Theory and Practice, 12(1), 100-110.
Nayak, T.K., Zhang, C., and You, J. (2018). “Measuring Identification Risk in Microdata Release and Its Control by Post-randomisation,” International Statistical Review, 86(2), 300-321.
Moura, R., Klein, M., Coelho, C., and Sinha, B. (2017). "Inference for Multivariate Regression Model Based on Synthetic Data Generated under Fixed-Posterior Predictive Sampling: Comparison with Plug-in Sampling," REVSTAT – Statistical Journal, 15(2): 155-186.
Klein, M. and Sinha, B. (2016). “Likelihood Based Finite Sample Inference for Singly Imputed Synthetic Data under the Multivariate Normal and Multiple Linear Regression Models,” Journal of Privacy and Confidentiality, 7: 43-98.
Klein, M. and Sinha, B. (2015). “Inference for Singly Imputed Synthetic Data Based on Posterior Predictive Sampling under Multivariate Normal and Multiple Linear Regression Models,” Sankhya B: The Indian Journal of Statistics, 77-B, 293-311.
Klein, M. and Sinha, B. (2015). “Likelihood-Based Inference for Singly and Multiply Imputed Synthetic Data under a Normal Model,” Statistics and Probability Letters, 105, 168-175.
Klein, M. and Sinha, B. (2015). “Likelihood-Based Finite Sample Inference for Synthetic Data Based on Exponential Model,” Thailand Statistician: Journal of The Thai Statistical Association, 13, 33-47.
Wright, T., Klein, M., and Wieczorek, J. (2014). “Ranking Populations Based on Sample Survey Data,” Center for Statistical Research and Methodology, Research and Methodology Directorate Research Report Series (Statistics #2014-12). U.S. Census Bureau. Available online: https://www.census.gov/srd/papers/pdf/rrs2014-12.pdf.
Klein, M., Lineback, J.F., and Schafer, J. (2014). “Evaluating Imputation Techniques in the Monthly Wholesale Trade Survey,” Proceedings of the Joint Statistical Meetings, Alexandria, VA: American Statistical Association.
Klein, M., Mathew, T., and Sinha, B. (2014). “Noise Multiplication for Statistical Disclosure Control of Extreme Values in Log-normal Regression Samples.” Journal of Privacy and Confidentiality, 6, 77-125.
Klein, M., Mathew, T., and Sinha, B. (2014). “Likelihood Based Inference under Noise Multiplication,” Thailand Statistician: Journal of The Thai Statistical Association, 12, 1-23.
Wright, T., Klein, M., and Wieczorek, J. (2013). “An Overview of Some Concepts for Potential Use in Ranking Populations Based on Sample Survey Data,” The 59th International Statistical Institute World Statistics Congress, Hong Kong, China.
Klein, M. and Sinha, B. (2013). “Statistical Analysis of Noise Multiplied Data Using Multiple Imputation,” Journal of Official Statistics, 29, 425-465.
Klein, M. and Linton, P. (2013). “On a Comparison of Tests of Homogeneity of Binomial Proportions,” Journal of Statistical Theory and Applications, 12, 208-224.
Klein, M., Mathew, T., and Sinha, B. (2013). “A Comparison of Statistical Disclosure Control Methods: Multiple Imputation versus Noise Multiplication.” Center for Statistical Research and Methodology, Research and Methodology Directorate Research Report Series (Statistics #2013-02). U.S. Census Bureau. Available online: https://www.census.gov/srd/papers/pdf/rrs2013-02.pdf.
Shao, J., Klein, M., and Xu, J. (2012). “Imputation for Nonmonotone Nonresponse in the Survey of Industrial Research and Development,” Survey Methodology, 38, 143-155.
Klein, M. and Wright, T. (2011). “Ranking Procedures for Several Normal Populations: An Empirical Investigation,” International Journal of Statistical Sciences, 11, 37-58.
Nayak, T.K., Sinha, B., and Zayatz, L. (2011). “Statistical Properties of Multiplicative Noise Masking for Confidentiality Protection,” Journal of Official Statistics, 27 (3), 527-544.
Sinha, B., Nayak, T.K., and Zayatz, L. (2011). “Privacy Protection and Quantile Estimation from Noise Multiplied Data,” Sankhya, Ser. B, 73, 297-315.
Klein, M. and Creecy, R. (2010). “Steps toward Creating a Fully Synthetic Decennial Census Microdata File,” Proceedings of the Joint Statistical Meetings, Alexandria, VA: American Statistical Association.
Tommy Wright, Tapan Nayak, Bimal Sinha, Nathan Yau
0331 – Working Capital Fund / General Research Project
Various Decennial, Demographic, and Economic Projects