Augmenting concept definition in gloss vector semantic relatedness measure using Wikipedia articles

Semantic relatedness measures are widely used in text mining and information retrieval applications. Considering these automated measures, in this research paper we attempt to improve Gloss Vector relatedness measure for more accurate estimation of relatedness between two given concepts. Generally,...

Full description

Saved in:
Bibliographic Details
Main Authors: Pesaranghader, Ahmad, Pesaranghader, Ali, Rezaei, Azadeh
Format: Conference or Workshop Item
Language:English
Published: Springer 2013
Online Access:http://psasir.upm.edu.my/id/eprint/60362/1/Augmenting%20concept%20definition%20in%20gloss%20vector%20semantic%20relatedness%20measure%20using%20Wikipedia%20articles.pdf
http://psasir.upm.edu.my/id/eprint/60362/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Semantic relatedness measures are widely used in text mining and information retrieval applications. Considering these automated measures, in this research paper we attempt to improve Gloss Vector relatedness measure for more accurate estimation of relatedness between two given concepts. Generally, this measure, by constructing concepts definitions (Glosses) from a thesaurus, tries to find the angle between the concepts’ gloss vectors for the calculation of relatedness. Nonetheless, this definition construction task is challenging as thesauruses do not provide full coverage of expressive definitions for the particularly specialized concepts. By employing Wikipedia articles and other external resources, we aim at augmenting these concepts’ definitions. Applying both definition types to the biomedical domain, using MEDLINE as corpus, UMLS as the default thesaurus, and a reference standard of 68 concept pairs manually rated for relatedness, we show exploiting available resources on the Web would have positive impact on final measurement of semantic relatedness.