Detection of multi-oriented moving text in videos / Vijeta Khare

Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, aut...

Full description

Saved in:
Bibliographic Details
Main Author: Vijeta, Khare
Format: Thesis
Published: 2016
Subjects:
Online Access:http://studentsrepo.um.edu.my/6739/4/vijeta.pdf
http://studentsrepo.um.edu.my/6739/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.stud.6739
record_format eprints
spelling my.um.stud.67392019-10-07T19:12:44Z Detection of multi-oriented moving text in videos / Vijeta Khare Vijeta, Khare T Technology (General) TA Engineering (General). Civil engineering (General) Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, automatic navigation, license plate recognition, assisting blind person and other surveillance applications. There are approaches in the field of content based image retrieval to solve the above mentioned problems. However, these approaches are inadequate to generate annotation based on semantics according to content of video or images due to opening between high level and low level features. Therefore text detection and recognition in videos grow into active and important research areas in computer vision and document analysis, which is capable of understanding the content of video and images at high level with the help of Optical Character Recognizer (OCR). Especially in recent years, the researchers has seen a flow of research efforts and considerable developments in these fields, however many challenges e.g. low resolution, complex background and variations in colors, font, font size, Multi-orientations, Multi-orientation text movements, noise, blur, and distortion still remain. The objectives of this work are in four folds: (1) to introduce a new descriptor called Histogram Oriented Moments (HOM) for detecting multi-oriented text from videos. The HOM is created by considering the orientations calculated with the second order geometrical moments. Further, to verify the detected text, optical flow properties are used to estimate the motion between text candidates in temporal frames. However, the use of temporal information is limited to false positive elimination but not as main features to find text candidates. (2) to propose new models for finding multi-oriented moving text from video and scene images through moments, motion vectors are utilized to identify moving regions that have constant velocity. However, the model is slightly sensitive to window size used for moment‟s calculation and different scripts in video. (3) To develop automatic window size determination for detecting text from videos, the next method explored stroke width transform based on the information that the stroke width remains constant throughout the characters. Further, the temporal frames are used for identifying text candidates based on the fact that caption text stays at the same unchanged location for few frames. However, the performance of the proposed method degrades when there is blur present in the video frames because moments and stroke width transforms are sensitive to blur. (4) To develop a method for text detection and recognition in blur frames, a blind deconvolution model is introduced that enhances the edge sharpness by suppressing blurred pixels. In summary, each work has been tested over benchmark datasets and authors‟ created datasets from different resources using standard measures. Furthermore, the results of the proposed methods are compared with the state of art methods to show that the proposed methods are competent to existing methods. 2016-08 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/6739/4/vijeta.pdf Vijeta, Khare (2016) Detection of multi-oriented moving text in videos / Vijeta Khare. PhD thesis, University of Malaya. http://studentsrepo.um.edu.my/6739/
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Student Repository
url_provider http://studentsrepo.um.edu.my/
topic T Technology (General)
TA Engineering (General). Civil engineering (General)
spellingShingle T Technology (General)
TA Engineering (General). Civil engineering (General)
Vijeta, Khare
Detection of multi-oriented moving text in videos / Vijeta Khare
description Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, automatic navigation, license plate recognition, assisting blind person and other surveillance applications. There are approaches in the field of content based image retrieval to solve the above mentioned problems. However, these approaches are inadequate to generate annotation based on semantics according to content of video or images due to opening between high level and low level features. Therefore text detection and recognition in videos grow into active and important research areas in computer vision and document analysis, which is capable of understanding the content of video and images at high level with the help of Optical Character Recognizer (OCR). Especially in recent years, the researchers has seen a flow of research efforts and considerable developments in these fields, however many challenges e.g. low resolution, complex background and variations in colors, font, font size, Multi-orientations, Multi-orientation text movements, noise, blur, and distortion still remain. The objectives of this work are in four folds: (1) to introduce a new descriptor called Histogram Oriented Moments (HOM) for detecting multi-oriented text from videos. The HOM is created by considering the orientations calculated with the second order geometrical moments. Further, to verify the detected text, optical flow properties are used to estimate the motion between text candidates in temporal frames. However, the use of temporal information is limited to false positive elimination but not as main features to find text candidates. (2) to propose new models for finding multi-oriented moving text from video and scene images through moments, motion vectors are utilized to identify moving regions that have constant velocity. However, the model is slightly sensitive to window size used for moment‟s calculation and different scripts in video. (3) To develop automatic window size determination for detecting text from videos, the next method explored stroke width transform based on the information that the stroke width remains constant throughout the characters. Further, the temporal frames are used for identifying text candidates based on the fact that caption text stays at the same unchanged location for few frames. However, the performance of the proposed method degrades when there is blur present in the video frames because moments and stroke width transforms are sensitive to blur. (4) To develop a method for text detection and recognition in blur frames, a blind deconvolution model is introduced that enhances the edge sharpness by suppressing blurred pixels. In summary, each work has been tested over benchmark datasets and authors‟ created datasets from different resources using standard measures. Furthermore, the results of the proposed methods are compared with the state of art methods to show that the proposed methods are competent to existing methods.
format Thesis
author Vijeta, Khare
author_facet Vijeta, Khare
author_sort Vijeta, Khare
title Detection of multi-oriented moving text in videos / Vijeta Khare
title_short Detection of multi-oriented moving text in videos / Vijeta Khare
title_full Detection of multi-oriented moving text in videos / Vijeta Khare
title_fullStr Detection of multi-oriented moving text in videos / Vijeta Khare
title_full_unstemmed Detection of multi-oriented moving text in videos / Vijeta Khare
title_sort detection of multi-oriented moving text in videos / vijeta khare
publishDate 2016
url http://studentsrepo.um.edu.my/6739/4/vijeta.pdf
http://studentsrepo.um.edu.my/6739/
_version_ 1738505952026427392
score 13.214268