Annual microdata files contain all Survey of Construction (SOC) sampled single-family houses started, sold, and/or completed during the year. Houses authorized by building permits but not started at the end of the year, under construction at the end of the year, or for sale at the end of the year are also included. Houses for which construction was abandoned after permit issuance or after start are not included.
SOC is a national sample survey of new houses selected from building permits and a canvassing of areas not requiring permits. Builders or owners of the houses selected are interviewed for information including start, sale, and completion dates, and more than 40 physical and financial characteristics of the houses. The characteristics are collected throughout the construction process, thus the highest response rate is associated with completed homes. The Department of Housing and Urban Development (HUD) funds the collection of these characteristics and allows the Census Bureau to provide the data to the public free of charge. The overall national sampling rate is about 1 in 50 new houses, although this varies considerably by individual survey location based on activity.
Available geography on the files include the nine Census Divisions (DIV) and whether a house is inside or outside a metropolitan area (METRO).
All new houses are classified into one of four categories (CAT):
Tabulating records on the files by characteristics for a given year (for example, all houses completed in 2012) can be done by using one of the available dates (start, sale, or completion), which appear in year/month (YYYYMM) format. The files also allow tabulations of the number of units authorized but not started, under construction (started but not completed), or for sale (built for sale but not sold) at the end of the year.
Each record contains a sampling weight (WEIGHT) that indicates how many houses a sampled house represents. Tabulated totals from the files will not exactly match published estimates, primarily because the files were produced at different times. The files do not provide accurate tabulations of total units authorized by permits for a number of reasons.
The files contain no information that would identify the specific addresses or builders of the sampled houses. In addition, certain variables such as sales price (SLPR) and square footage (SQFS) have been "top and bottom-coded" to further remove the possibility of disclosure. Cases that have been modified as such are identified with "flags." The relationship of modified variables with other variables will be altered.
Select a year and click "Download File" to download SOC Microdata. Due to the size of these files [up to 8 MB], they have been zipped to save file space.
Information on each variable contained in the files is shown in the Microdata Variables Excel Guide [48 kb] or the Printer-friendly Microdata Guide PDF [261 kb]. Files for the previous year are released on the first workday of June (excluding weekends and holidays) every year.
If you are unsure how to use the Microdata files, you may want to download the Working with SOC Microdata guide to get you started. This guide walks though the steps of creating basic data tables using Excel PivotTables.
To obtain SOC microdata file(s) in SAS format, please call (301) 763-5160.
[PDF] or denotes a file in Adobe’s Portable Document Format. To view the file, you will need the Adobe® Reader® available free from Adobe. [Excel] or the letters [xls] indicate a document is in the Microsoft® Excel® Spreadsheet Format (XLS). To view the file, you will need the Microsoft® Excel® Viewer available for free from Microsoft®.
This symbol indicates a link to a non-government web site. Our linking to these sites does not constitute an endorsement of any products, services or the information found on them. Once you link to another site you are subject to the policies of the new site.