Skip Main Navigation Skip To Navigation Content

Research Reports

You are here: Census.govSubjects A to ZResearch Reports Sorted by Year › Abstract of RRS2005/05
Skip top of page navigation

Evaluating String Comparator Performance for Record Linkage

William E. Yancey

KEY WORDS:

ABSTRACT

We compare variations of string comparators based on the Jaro-Winkler comparator and edit distance comparator. We apply the comparators to Census data to see which are better classifiers for matches and non-matches, first by comparing their classification abilities using a ROC curve based analysis, then by considering a direct comparison between two candidate comparators in record linkage results.

CITATION:

Source: U.S. Census Bureau, Statistical Research Division

Created: June 13, 2005
Last revised: June 13, 2005


Source: U.S. Census Bureau | Statistical Research Division | (301) 763-3215 (or chad.eric.russell@census.gov) |   Last Revised: October 08, 2010