Graphics and scene text classification in video

Achieving good accuracy for text detection and recognition is a challenging and interesting problem in the field of video document analysis because of the presences of both graphics text that has good clarity and scene text that is unpredictable in video frames. Therefore, in this paper, we pres...

Full description

Saved in:
Bibliographic Details
Main Authors: Xu, J., Shivakumara, P., Lu, T., Phan, T.Q., Tan, C.L.
Format: Conference or Workshop Item
Language:English
Published: 2014
Subjects:
Online Access:http://eprints.um.edu.my/13091/1/graphics_and_scene.pdf
http://eprints.um.edu.my/13091/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.13091
record_format eprints
spelling my.um.eprints.130912015-03-24T01:35:23Z http://eprints.um.edu.my/13091/ Graphics and scene text classification in video Xu, J. Shivakumara, P. Lu, T. Phan, T.Q. Tan, C.L. T Technology (General) Achieving good accuracy for text detection and recognition is a challenging and interesting problem in the field of video document analysis because of the presences of both graphics text that has good clarity and scene text that is unpredictable in video frames. Therefore, in this paper, we present a novel method for classifying graphics texts and scene texts by exploiting temporal information and finding the relationship between them in video. The method proposes an iterative procedure to identify Probable Graphics Text Candidates (PGTC) and Probable Scene Text Candidates (PSTC) in video based on the fact that graphics texts in general do not have large movements especially compared to scene texts which are usually embedded on background. In addition to PGTC and PSTC, the iterative process automatically identifies the number of video frames with the help of a converging criterion. The method further explores the symmetry between intra and inter character components to identify graphics text candidates and scene text candidates. Boundary growing method is employed to restore the complete text line. For each segmented text line, we finally introduce Eigen value analysis to classify graphics and scene text lines based on the distribution of respective Eigen values. Experimental results with the existing methods show that the proposed method is effective and useful to improve the accuracy of text detection and recognition. 2014-08 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.um.edu.my/13091/1/graphics_and_scene.pdf Xu, J. and Shivakumara, P. and Lu, T. and Phan, T.Q. and Tan, C.L. (2014) Graphics and scene text classification in video. In: International Conference on Pattern Recognition (ICPR), 24-28 Aug 2014, Stockholm, Sweden. (Submitted)
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Xu, J.
Shivakumara, P.
Lu, T.
Phan, T.Q.
Tan, C.L.
Graphics and scene text classification in video
description Achieving good accuracy for text detection and recognition is a challenging and interesting problem in the field of video document analysis because of the presences of both graphics text that has good clarity and scene text that is unpredictable in video frames. Therefore, in this paper, we present a novel method for classifying graphics texts and scene texts by exploiting temporal information and finding the relationship between them in video. The method proposes an iterative procedure to identify Probable Graphics Text Candidates (PGTC) and Probable Scene Text Candidates (PSTC) in video based on the fact that graphics texts in general do not have large movements especially compared to scene texts which are usually embedded on background. In addition to PGTC and PSTC, the iterative process automatically identifies the number of video frames with the help of a converging criterion. The method further explores the symmetry between intra and inter character components to identify graphics text candidates and scene text candidates. Boundary growing method is employed to restore the complete text line. For each segmented text line, we finally introduce Eigen value analysis to classify graphics and scene text lines based on the distribution of respective Eigen values. Experimental results with the existing methods show that the proposed method is effective and useful to improve the accuracy of text detection and recognition.
format Conference or Workshop Item
author Xu, J.
Shivakumara, P.
Lu, T.
Phan, T.Q.
Tan, C.L.
author_facet Xu, J.
Shivakumara, P.
Lu, T.
Phan, T.Q.
Tan, C.L.
author_sort Xu, J.
title Graphics and scene text classification in video
title_short Graphics and scene text classification in video
title_full Graphics and scene text classification in video
title_fullStr Graphics and scene text classification in video
title_full_unstemmed Graphics and scene text classification in video
title_sort graphics and scene text classification in video
publishDate 2014
url http://eprints.um.edu.my/13091/1/graphics_and_scene.pdf
http://eprints.um.edu.my/13091/
_version_ 1643689460175470592
score 13.18916