Rule based autonomous citation mining with TIERL

Citations management is an important task in managing digital libraries. Citations provide valuable information e.g., used in evaluating an author’s influences or scholarly quality (the impact factor of research journals). But although a reliable and effective autonomous citation management is essen...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Tanvir, Afzal, Maurer, Hermann, Balke, Wolf-Tilo, Narayanan, Kulathuramaiyer
Format: E-Article
Language:English
Published: Journal of Digital Information Management 2006
Subjects:
Online Access:http://ir.unimas.my/id/eprint/542/1/Rule_based_Autonomous_Citation_Mining_with_TIERL.pdf
http://ir.unimas.my/id/eprint/542/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.unimas.ir.542
record_format eprints
spelling my.unimas.ir.5422016-04-12T06:43:45Z http://ir.unimas.my/id/eprint/542/ Rule based autonomous citation mining with TIERL Muhammad Tanvir, Afzal Maurer, Hermann Balke, Wolf-Tilo Narayanan, Kulathuramaiyer T Technology (General) Z665 Library Science. Information Science Citations management is an important task in managing digital libraries. Citations provide valuable information e.g., used in evaluating an author’s influences or scholarly quality (the impact factor of research journals). But although a reliable and effective autonomous citation management is essential, manual citation management can be extremely costly. Automatic citation mining on the other hand is a non-trivial task mainly due to non-conforming citation styles, spelling errors and the difficulty of reliably extracting text from PDF documents. In this paper we propose a novel rule-based autonomous citation mining technique, to address this important task. We define a set of common heuristics that together allow to improve the state of the art in automatic citation mining. Moreover, by first disambiguating citations based on venues, our technique significantly enhances the correct discovery of citations. Our experiments show that the proposed approach is indeed able to overcome limitations of current leading citation indexes such as ISI Web of Knowledge, Citeseer and Google Scholar. Journal of Digital Information Management 2006 E-Article NonPeerReviewed text en http://ir.unimas.my/id/eprint/542/1/Rule_based_Autonomous_Citation_Mining_with_TIERL.pdf Muhammad Tanvir, Afzal and Maurer, Hermann and Balke, Wolf-Tilo and Narayanan, Kulathuramaiyer (2006) Rule based autonomous citation mining with TIERL. Journal of Digital Information Management, 8 (3). pp. 196-204.
institution Universiti Malaysia Sarawak
building Centre for Academic Information Services (CAIS)
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sarawak
content_source UNIMAS Institutional Repository
url_provider http://ir.unimas.my/
language English
topic T Technology (General)
Z665 Library Science. Information Science
spellingShingle T Technology (General)
Z665 Library Science. Information Science
Muhammad Tanvir, Afzal
Maurer, Hermann
Balke, Wolf-Tilo
Narayanan, Kulathuramaiyer
Rule based autonomous citation mining with TIERL
description Citations management is an important task in managing digital libraries. Citations provide valuable information e.g., used in evaluating an author’s influences or scholarly quality (the impact factor of research journals). But although a reliable and effective autonomous citation management is essential, manual citation management can be extremely costly. Automatic citation mining on the other hand is a non-trivial task mainly due to non-conforming citation styles, spelling errors and the difficulty of reliably extracting text from PDF documents. In this paper we propose a novel rule-based autonomous citation mining technique, to address this important task. We define a set of common heuristics that together allow to improve the state of the art in automatic citation mining. Moreover, by first disambiguating citations based on venues, our technique significantly enhances the correct discovery of citations. Our experiments show that the proposed approach is indeed able to overcome limitations of current leading citation indexes such as ISI Web of Knowledge, Citeseer and Google Scholar.
format E-Article
author Muhammad Tanvir, Afzal
Maurer, Hermann
Balke, Wolf-Tilo
Narayanan, Kulathuramaiyer
author_facet Muhammad Tanvir, Afzal
Maurer, Hermann
Balke, Wolf-Tilo
Narayanan, Kulathuramaiyer
author_sort Muhammad Tanvir, Afzal
title Rule based autonomous citation mining with TIERL
title_short Rule based autonomous citation mining with TIERL
title_full Rule based autonomous citation mining with TIERL
title_fullStr Rule based autonomous citation mining with TIERL
title_full_unstemmed Rule based autonomous citation mining with TIERL
title_sort rule based autonomous citation mining with tierl
publisher Journal of Digital Information Management
publishDate 2006
url http://ir.unimas.my/id/eprint/542/1/Rule_based_Autonomous_Citation_Mining_with_TIERL.pdf
http://ir.unimas.my/id/eprint/542/
_version_ 1644508647552712704
score 13.160551