Thumbnail
Access Restriction
Subscribed

Author Byung-Won On ♦ Jaewoo Kang ♦ Dongwon Lee ♦ Mitra, P.
Sponsorship ACM SIG on Inf. Retrieval ♦ ACM SIG on Hypertext, Hypermedia and the Web ♦ IEEE Tech. Comm. for Digital Libr
Source IEEE Xplore Digital Library
Content type Text
Publisher Institute of Electrical and Electronics Engineers, Inc. (IEEE)
File Format PDF
Copyright Year ©2005
Language English
Subject Domain (in DDC) Computer science, information & general works ♦ Data processing & computer science ♦ Library & information sciences
Subject Keyword Partitioning algorithms ♦ Computer science ♦ Large-scale systems ♦ Software libraries ♦ Portals ♦ Permission ♦ Books ♦ Error correction ♦ Information systems ♦ Information retrieval ♦ name disambiguation ♦ blocking ♦ measuring distances
Abstract In this paper, we consider the problem of ambiguous author names in bibliographic citations, and comparatively study alternative approaches to identify and correct such name variants (e.g., "Vannevar Bush" and "V. Vush"). Our study is based on a scalable two-step framework, where step 1 is to substantially reduce the number of candidates via blocking, and step 2 is to measure the distance of two names via coauthor information. Combining four blocking methods and seven distance measures on four data sets, we present extensive experimental results, and identify combinations that are scalable and effective to disambiguate author names in citations
Description Author affiliation: Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA (Byung-Won On)
ISBN 1581138768
Educational Role Student ♦ Teacher
Age Range above 22 year
Educational Use Research ♦ Reading
Education Level UG and PG
Learning Resource Type Article
Publisher Date 2005-06-07
Publisher Place USA
Rights Holder Association for Computing Machinery, Inc. (ACM)
Size (in Bytes) 1.01 MB
Page Count 10
Starting Page 344
Ending Page 353


Source: IEEE Xplore Digital Library