Research Output

A comparison of techniques for name matching.

  Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of businesses to maintain high quality of data in their information applications, such as data integration, text and web mining, information retrieval, search engine, etc. In such applications, matching names is one of the popular tasks. There are a number of name matching techniques available. Unfortunately, there is no existing name matching technique that performs the best in all situations. Therefore, a problem that every researcher or a practitioner has to face is how to select an appropriate technique for a given dataset. This paper analyses and evaluates a set of popular name matching techniques on several carefully designed different datasets. The experimental comparison confirms the statement that there is no clear best technique. Some suggestions have been presented, which can be used as guidance for researchers and practitioners to select an appropriate name matching technique in a given dataset.

  • Type:

    Article

  • Date:

    30 November 2011

  • Publication Status:

    Published

  • ISSN:

    2010-2283

  • Library of Congress:

    QA76 Computer software

  • Dewey Decimal Classification:

    004 Data processing & computer science

Citation

Peng, T., Li, L. & Kennedy, J. (2011). A comparison of techniques for name matching. GSTF journal on computing. 2. ISSN 2010-2283

Authors

Keywords

Name matching; dataset;

Available Documents