Rule based autonomous citation mining with TIERL

Citations management is an important task in managing digital libraries. Citations provide valuable information e.g., used in evaluating an author’s influences or scholarly quality (the impact factor of research journals). But although a reliable and effective autonomous citation management is essen...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Tanvir, Afzal, Maurer, Hermann, Balke, Wolf-Tilo, Narayanan, Kulathuramaiyer
Format: E-Article
Language:English
Published: Journal of Digital Information Management 2006
Subjects:
Online Access:http://ir.unimas.my/id/eprint/542/1/Rule_based_Autonomous_Citation_Mining_with_TIERL.pdf
http://ir.unimas.my/id/eprint/542/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Citations management is an important task in managing digital libraries. Citations provide valuable information e.g., used in evaluating an author’s influences or scholarly quality (the impact factor of research journals). But although a reliable and effective autonomous citation management is essential, manual citation management can be extremely costly. Automatic citation mining on the other hand is a non-trivial task mainly due to non-conforming citation styles, spelling errors and the difficulty of reliably extracting text from PDF documents. In this paper we propose a novel rule-based autonomous citation mining technique, to address this important task. We define a set of common heuristics that together allow to improve the state of the art in automatic citation mining. Moreover, by first disambiguating citations based on venues, our technique significantly enhances the correct discovery of citations. Our experiments show that the proposed approach is indeed able to overcome limitations of current leading citation indexes such as ISI Web of Knowledge, Citeseer and Google Scholar.