Staff View: Detection of multi-oriented moving text in videos / Vijeta Khare

Detection of multi-oriented moving text in videos / Vijeta Khare

Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, aut...

Full description

Saved in:

Bibliographic Details
Main Author:	Vijeta, Khare
Format:	Thesis
Published:	2016
Subjects:	T Technology (General) TA Engineering (General). Civil engineering (General)
Online Access:	http://studentsrepo.um.edu.my/6739/4/vijeta.pdf http://studentsrepo.um.edu.my/6739/
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.um.stud.6739
record_format	eprints
spelling	my.um.stud.67392019-10-07T19:12:44Z Detection of multi-oriented moving text in videos / Vijeta Khare Vijeta, Khare T Technology (General) TA Engineering (General). Civil engineering (General) Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, automatic navigation, license plate recognition, assisting blind person and other surveillance applications. There are approaches in the field of content based image retrieval to solve the above mentioned problems. However, these approaches are inadequate to generate annotation based on semantics according to content of video or images due to opening between high level and low level features. Therefore text detection and recognition in videos grow into active and important research areas in computer vision and document analysis, which is capable of understanding the content of video and images at high level with the help of Optical Character Recognizer (OCR). Especially in recent years, the researchers has seen a flow of research efforts and considerable developments in these fields, however many challenges e.g. low resolution, complex background and variations in colors, font, font size, Multi-orientations, Multi-orientation text movements, noise, blur, and distortion still remain. The objectives of this work are in four folds: (1) to introduce a new descriptor called Histogram Oriented Moments (HOM) for detecting multi-oriented text from videos. The HOM is created by considering the orientations calculated with the second order geometrical moments. Further, to verify the detected text, optical flow properties are used to estimate the motion between text candidates in temporal frames. However, the use of temporal information is limited to false positive elimination but not as main features to find text candidates. (2) to propose new models for finding multi-oriented moving text from video and scene images through moments, motion vectors are utilized to identify moving regions that have constant velocity. However, the model is slightly sensitive to window size used for moment‟s calculation and different scripts in video. (3) To develop automatic window size determination for detecting text from videos, the next method explored stroke width transform based on the information that the stroke width remains constant throughout the characters. Further, the temporal frames are used for identifying text candidates based on the fact that caption text stays at the same unchanged location for few frames. However, the performance of the proposed method degrades when there is blur present in the video frames because moments and stroke width transforms are sensitive to blur. (4) To develop a method for text detection and recognition in blur frames, a blind deconvolution model is introduced that enhances the edge sharpness by suppressing blurred pixels. In summary, each work has been tested over benchmark datasets and authors‟ created datasets from different resources using standard measures. Furthermore, the results of the proposed methods are compared with the state of art methods to show that the proposed methods are competent to existing methods. 2016-08 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/6739/4/vijeta.pdf Vijeta, Khare (2016) Detection of multi-oriented moving text in videos / Vijeta Khare. PhD thesis, University of Malaya. http://studentsrepo.um.edu.my/6739/
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Student Repository
url_provider	http://studentsrepo.um.edu.my/
topic	T Technology (General) TA Engineering (General). Civil engineering (General)
spellingShingle	T Technology (General) TA Engineering (General). Civil engineering (General) Vijeta, Khare Detection of multi-oriented moving text in videos / Vijeta Khare
description	Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, automatic navigation, license plate recognition, assisting blind person and other surveillance applications. There are approaches in the field of content based image retrieval to solve the above mentioned problems. However, these approaches are inadequate to generate annotation based on semantics according to content of video or images due to opening between high level and low level features. Therefore text detection and recognition in videos grow into active and important research areas in computer vision and document analysis, which is capable of understanding the content of video and images at high level with the help of Optical Character Recognizer (OCR). Especially in recent years, the researchers has seen a flow of research efforts and considerable developments in these fields, however many challenges e.g. low resolution, complex background and variations in colors, font, font size, Multi-orientations, Multi-orientation text movements, noise, blur, and distortion still remain. The objectives of this work are in four folds: (1) to introduce a new descriptor called Histogram Oriented Moments (HOM) for detecting multi-oriented text from videos. The HOM is created by considering the orientations calculated with the second order geometrical moments. Further, to verify the detected text, optical flow properties are used to estimate the motion between text candidates in temporal frames. However, the use of temporal information is limited to false positive elimination but not as main features to find text candidates. (2) to propose new models for finding multi-oriented moving text from video and scene images through moments, motion vectors are utilized to identify moving regions that have constant velocity. However, the model is slightly sensitive to window size used for moment‟s calculation and different scripts in video. (3) To develop automatic window size determination for detecting text from videos, the next method explored stroke width transform based on the information that the stroke width remains constant throughout the characters. Further, the temporal frames are used for identifying text candidates based on the fact that caption text stays at the same unchanged location for few frames. However, the performance of the proposed method degrades when there is blur present in the video frames because moments and stroke width transforms are sensitive to blur. (4) To develop a method for text detection and recognition in blur frames, a blind deconvolution model is introduced that enhances the edge sharpness by suppressing blurred pixels. In summary, each work has been tested over benchmark datasets and authors‟ created datasets from different resources using standard measures. Furthermore, the results of the proposed methods are compared with the state of art methods to show that the proposed methods are competent to existing methods.
format	Thesis
author	Vijeta, Khare
author_facet	Vijeta, Khare
author_sort	Vijeta, Khare
title	Detection of multi-oriented moving text in videos / Vijeta Khare
title_short	Detection of multi-oriented moving text in videos / Vijeta Khare
title_full	Detection of multi-oriented moving text in videos / Vijeta Khare
title_fullStr	Detection of multi-oriented moving text in videos / Vijeta Khare
title_full_unstemmed	Detection of multi-oriented moving text in videos / Vijeta Khare
title_sort	detection of multi-oriented moving text in videos / vijeta khare
publishDate	2016
url	http://studentsrepo.um.edu.my/6739/4/vijeta.pdf http://studentsrepo.um.edu.my/6739/
_version_	1738505952026427392
score	13.214268

Detection of multi-oriented moving text in videos / Vijeta Khare

Similar Items