Missing data problems are endemic in the conduct of statistical experiments and data collection operations. The investigators almost never observe all the outcomes they had set to record. When dealing with sample surveys, this means that individuals or entities in the survey do not respond at all or give only part of the information they are being asked to provide. Even if a response is obtained, the information provided may be logically inconsistent making such responses in effect missing. Statistical agencies compensate for these types of missing data in computing reliable official statistics using methods such as imputation and survey weight adjustment. Such techniques utilize non-missing survey information to methodically "fill in” the missing data. As data collection becomes more expensive and response rates decrease, observational data sources such as administrative records and commercial data providing alternate information on individuals or entities becomes more available. Deeper model-based imputation and survey weight adjustment methods are useful for improving and/or evaluating how sample survey or census data can be supplemented with information obtained from quality observational data. All these missing data problems and associated techniques involve statistical modeling along with subject matter experience.
Ibrahim, S., Mazumder, R., Radchenko, P., and Ben-David, E. (In Press). "Predicting Census Survey Response Rates with Parsimonious Additive Models and Structured Interactions," The Annals of Applied Statistics.
Kaputa, S., Morris, D.S., and Holan, S. (2024). “Bayesian Multi-Source Hierarchical Models with Applications to the Monthly Retail Trade Survey,” Journal of Survey Statistics and Methodology.
Kang, J., Morris, D.S., Joyce, P., and Dompreh, I. (2023). “On Calibrated Inverse Probability Weighting and Generalized Boosting Propensity Score Models for Mean Estimation with Incomplete Survey Data,” Wiley Interdisciplinary Reviews (WIREs) Computational Statistics.
Morris, D.S. and Sellers, K.F. (2022). “A Flexible Mixed Model for Clustered Count Data,” Stats: Special Issue on Statistics, Data Analytics, and Inferences for Discrete Data, 5(1): 52–69. https://doi.org/10.3390/stats5010004.
Morris, D.S., Raim, A.M., and Sellers, K.F. (2020). “Conway-Maxwell-Multinomial Distribution for Flexible Modeling of Clustered Categorical Data,” Journal of Multivariate Analysis, 179.
Dumbacher, B., Morris, D.S., and Hogue, C. (2019). “Using Electronic Transaction Data to Add Geographic Granularity to Official Estimates of Retail Sales,” Journal of Big Data, 6(80).
Keller, A., Mule, V.T., Morris, D.S., and Konicki, S. (2018). "A Distance Metric for Modeling the Quality of Administrative Records for Use in the 2020 Census," Journal of Official Statistics, 34(3): 1-27.
Morris, D. S. (2017). “A Modeling Approach for Administrative Record Enumeration in the Decennial Census,” Public Opinion Quarterly: Special Issue on Survey Research, Today and Tomorrow, 81(S1): 357-384.
Thibaudeau Y., Slud, E., and Gottschalck, A. O. (2017). “Modeling Log-Linear Conditional Probabilities for Estimation in Surveys,” Annals of Applied Statistics 11(2), 680-697.
Morris, D.S., Keller, A., and Clark, B. (2016). “An Approach for Using Administrative Records to Reduce Contacts in the 2020 Census,” Statistical Journal of the International Association for Official Statistics, 32(2): 177-188.
Thibaudeau, Y. (2002). “Model Explicit Item Imputation for Demographic Categories,” Survey Methodology, 28(2), 135-143.
Powers, R., Eltinge, J., Martinez, W., and Morris, D.S. (2024). “Using Linked Micromaps for Evidence-Based Policy,” In JSM Proceedings, Section on Statistical Graphics. Alexandria, VA: American Statistical Association.
Morris, D.S. and Raim, A.M. (2023). “Comparing Trial and Variable Association in Contingency Table Data Using Multinomial Models for Clustered Data,” in Proceedings of the 37th International Workshop on Statistical Modelling. Dortmund, Germany: Statistical Modelling Society, 536-542.
Winkler, W. E. (2018). “Cleaning and Using Administrative Lists: Enhanced Practices and Computational Algorithms for Record Linkage and Modeling/Edit/Imputation,” Research Report Series (Statistics #2018-05), Center for Statistical Research and Methodology, U.S. Census Bureau, Washington, D.C.
Thibaudeau, Y. and Morris, D.S. (2016). “Bayesian Decision Theory to Optimize the Use of Administrative Records in Census NRFU,” Proceedings of the Joint Statistical Meetings. Alexandria, VA: American Statistical Association.
Bechtel, L., Morris, D.S., and Thompson, K.J. (2015). “Using Classification Trees to Recommend Hot Deck Imputation Methods: A Case Study,” in FCSM Proceedings. Washington, D.C: Federal Committee on Statistical Methodology.
Garcia, M., Morris, D.S., and Diamond, L.K. (2015). “Implementation of Ratio Imputation and Sequential Regression Multivariate Imputation on Economic Census Products,” Proceedings of the Joint Statistical Meetings.
Winkler, W. and Garcia, M. (2009). “Determining a Set of Edits,” Research Report Series (Statistics #2009-05), Statistical Research Division, U.S. Census Bureau, Washington, D.C.
Winkler, W. E. (2008). “General Methods and Algorithms for Imputing Discrete Data under a Variety of Constraints,” Research Report Series (Statistics #2008-08), Statistical Research Division, U.S. Census Bureau, Washington D.C.
Darcy Morris, Joseph Kang, Isaac Dompreh, Yves Thibaudeau, Jun Shao, Emanuel Ben-David, Sixia Chen (ASA/NSF/Census Research Fellow/University of Oklahoma Health Sciences)
0331 – Working Capital Fund / General Research Project
Various Decennial, Demographic, and Economic Projects