FTP ASCII text data files
Contents
Uncompress ASCII text data files
Which data files do you need to download?
Uncompress ASCII text data files
The data files from the census FTP site contain a line feed character only at the end of each record. For successful use with many programs running in a Windows environment, these files need to be modified to use the ASCII carriage return/line feed sequence, chr(13) + chr(10), as a record terminator. This is an easy step in the UnZIP process using any UnZIP software which offers a conversion option. We tested PKZIP for Windows, version 4.00 following the steps outlined below. This PKZIP shareware can be downloaded from pkware.com. After installing PKZIP, perform the following steps:
Which data file(s) do you need to download?
All geographic area identifiers (area names and codes, ...) are in the geography file only. This file must be downloaded in order to identify the area being tabulated in the data files. See chapter 3: Subject Locator to identify tables of interest and then Chapter 2: How to Use This File to identify the data file(s) that contain these tables from the technical documentation.
Technical information
An SF1 data set consists of one geographic file and thirty nine data files. The geographic file has a fixed record length of 400 characters and is not field (or column) delimited. Each of the thirty nine data files are comma field delimited with variable record lengths. None of the ASCII text files contain header records (a first record with fieldnames).
The only relationship between any one file (geographic or data) and another is the field LOGRECNO (logical record number). This field is a unique key.
There is a one-to-one correspondence based on LOGRECNO between the geographic file and data files 01-11 and 37-39 only. Data files 12-36 contain only PCT tables. PCT tables contain data down to the census tract level only.
Imagine the geographic file and any or all of the thirty nine data files laid side by side or a very wide dataset sliced vertically. The data set was split into multiple files due to a limit in the number of fields allowed per record in most popular database and spreadsheet programs.
Electronic Products Development Branch
Last Updated: Wednesday, 26-Sep-2001 11:13:52 EDT
Administrative and Customer Services Division
U.S. Census Bureau