Estimates from the CPS ASEC provide the measures of income and poverty that serve as the dependent variables in the regression models. The CPS ASEC estimates were chosen over those from the decennial census for several reasons. First, the CPS ASEC provides the only timely, consistent series of income and poverty estimates during the intercensal period. Second, the CPS ASEC is the official source of national poverty estimates. Third, if we relied on estimates from the most recent census as the dependent variable we would have to assume that the relationship between the dependent and predictor (administrative) variables remained constant during the postcensal period; events have already proven that assumption false. Hence, using the CPS ASEC data permits us to develop new sets of equations for each target year during the decade and these equations will reflect then-current relationships between income, poverty, and their predictor variables.
Choosing of the CPS ASEC as the dependent variable in our regression models has two consequences. First, postcensal estimates are not updates of the census income and poverty measures, because the CPS and the decennial census are known to estimate different measures of income and poverty; the model-based intercensal estimates are not directly comparable to the census (further information is available by returning to the Methodology page and consulting the "Cautions" section). Second, because the CPS ASEC sample size is relatively small, we divide the task of providing intercensal estimates for states and counties into related but separate modeling efforts. While the CPS ASEC sample sizes for some states are large enough to permit the derivation of direct state estimates for some of the key statistics, they are not sufficient for all statistics in some states or any statistics in most states. Direct, useable estimates from the CPS ASEC are possible for only a handful of counties, and only slightly more than one-third of all counties contain any sample households. The strategy of separating the state and county models was adopted because we felt that models constructed for states would be superior in terms of goodness-of-fit, and that their results could provide "controls" to which the weaker county estimates could be adjusted.
For the state regression models, single-year CPS ASEC estimates are used as the dependent variable. In the case of the county regression models, a three-year average of the CPS ASEC income and poverty estimates (e.g., 1992-1994 for 1993) is used. A county regression equation is estimated on the basis of observations from the 1200 counties included in the CPS ASEC sample. From this estimated equation and known values of administrative variables, a regression "prediction" is obtained for each county. For each county with sample cases in the CPS, the model prediction is combined with the direct sample estimate, with each component receiving a weight. The sum of the two weights for each county is 1.0; the weight for the model prediction component is the ratio of the sampling variance of the direct estimate to the total variance (sampling plus "lack of fit") of the direct estimate. Using this technique, the more uncertain the direct sample estimate, the larger the contribution from the regression model. These weights are commonly referred to as "shrinkage weights" and the final estimates as "shrinkage estimates." For counties which are not in the CPS sample, the estimates are based solely on the regression equation.
Shrinkage techniques are commonly used in estimating values for small geographic areas. We use them to help reduce the uncertainty of our estimates and to take advantage of all the information we have. However, significant reductions in variances are achieved only in a few counties where the CPS ASEC sample size is relatively large. The average of the weights on the direct CPS estimate, over all counties, is less than 0.02 for related children 5-17 and all people under age 18 in poverty, and less than 0.03 for people of all ages in poverty. Of course, over half the counties have zero weights on their direct estimates because they are not in the CPS sample. For only a handful of counties (7 to 49 depending on the poverty population being estimated and the year) are the weights on the direct CPS estimates 0.25 or more, and only 1 to 3 of these weights exceed 0.50.
The last step in our state and county estimation procedure is to use a simple
ratio technique to control the sum of the number in poverty to the state-level
estimates and the CPS ASEC national estimates. Estimated medians for counties
are not similarly controlled to state or national medians. For ease of reference,
we usually refer to the final county estimates as 'model-based'.