Cross-document Structural Relationship Identification Using Supervised Machine Learning

Multi document analysis has been a field of interest for decades and is still being actively researched until today. One example of such analysis could be for the task of multi document summarization which is meant to represent the concise description of the original documents. In this paper, we wil...

Full description

Saved in:
Bibliographic Details
Main Author: Jaya Kumar, Yogan
Format: Article
Language:English
Published: ELSEVIER 2012
Subjects:
Online Access:http://eprints.utem.edu.my/id/eprint/6663/2/Revised_manuscript.pdf
http://eprints.utem.edu.my/id/eprint/6663/
http://www.sciencedirect.com/science/article/pii/S1568494612002967
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utem.eprints.6663
record_format eprints
spelling my.utem.eprints.66632022-02-04T13:10:23Z http://eprints.utem.edu.my/id/eprint/6663/ Cross-document Structural Relationship Identification Using Supervised Machine Learning Jaya Kumar, Yogan T Technology (General) Multi document analysis has been a field of interest for decades and is still being actively researched until today. One example of such analysis could be for the task of multi document summarization which is meant to represent the concise description of the original documents. In this paper, we will focus on some special properties that multi document articles hold, specifically news articles. Information across news articles reporting on the same story are often related. Cross-document Structure Theory (CST) gives several relationships between pairs of sentences from different documents. Among them, we focus on four relations namely “Identity”, “Overlap”, “Subsumption”, and “Description”. Our aim is to automatically identify these CST relationships. We applied three machine learning techniques, i.e. SVM, Neural Network and our proposed Case-based reasoning (CBR) model. Comparison between these techniques shows that the proposed CBR model yields better results. ELSEVIER 2012 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/6663/2/Revised_manuscript.pdf Jaya Kumar, Yogan (2012) Cross-document Structural Relationship Identification Using Supervised Machine Learning. Applied Soft Computing, 12 (10). pp. 3124-3131. ISSN 1568-4946 http://www.sciencedirect.com/science/article/pii/S1568494612002967 10.1016/j.asoc.2012.06.017
institution Universiti Teknikal Malaysia Melaka
building UTEM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknikal Malaysia Melaka
content_source UTEM Institutional Repository
url_provider http://eprints.utem.edu.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Jaya Kumar, Yogan
Cross-document Structural Relationship Identification Using Supervised Machine Learning
description Multi document analysis has been a field of interest for decades and is still being actively researched until today. One example of such analysis could be for the task of multi document summarization which is meant to represent the concise description of the original documents. In this paper, we will focus on some special properties that multi document articles hold, specifically news articles. Information across news articles reporting on the same story are often related. Cross-document Structure Theory (CST) gives several relationships between pairs of sentences from different documents. Among them, we focus on four relations namely “Identity”, “Overlap”, “Subsumption”, and “Description”. Our aim is to automatically identify these CST relationships. We applied three machine learning techniques, i.e. SVM, Neural Network and our proposed Case-based reasoning (CBR) model. Comparison between these techniques shows that the proposed CBR model yields better results.
format Article
author Jaya Kumar, Yogan
author_facet Jaya Kumar, Yogan
author_sort Jaya Kumar, Yogan
title Cross-document Structural Relationship Identification Using Supervised Machine Learning
title_short Cross-document Structural Relationship Identification Using Supervised Machine Learning
title_full Cross-document Structural Relationship Identification Using Supervised Machine Learning
title_fullStr Cross-document Structural Relationship Identification Using Supervised Machine Learning
title_full_unstemmed Cross-document Structural Relationship Identification Using Supervised Machine Learning
title_sort cross-document structural relationship identification using supervised machine learning
publisher ELSEVIER
publishDate 2012
url http://eprints.utem.edu.my/id/eprint/6663/2/Revised_manuscript.pdf
http://eprints.utem.edu.my/id/eprint/6663/
http://www.sciencedirect.com/science/article/pii/S1568494612002967
_version_ 1724077953827995648
score 13.154949