Detection of multi-oriented moving text in videos / Vijeta Khare
Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, aut...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Published: |
2016
|
Subjects: | |
Online Access: | http://studentsrepo.um.edu.my/6739/4/vijeta.pdf http://studentsrepo.um.edu.my/6739/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.um.stud.6739 |
---|---|
record_format |
eprints |
spelling |
my.um.stud.67392019-10-07T19:12:44Z Detection of multi-oriented moving text in videos / Vijeta Khare Vijeta, Khare T Technology (General) TA Engineering (General). Civil engineering (General) Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, automatic navigation, license plate recognition, assisting blind person and other surveillance applications. There are approaches in the field of content based image retrieval to solve the above mentioned problems. However, these approaches are inadequate to generate annotation based on semantics according to content of video or images due to opening between high level and low level features. Therefore text detection and recognition in videos grow into active and important research areas in computer vision and document analysis, which is capable of understanding the content of video and images at high level with the help of Optical Character Recognizer (OCR). Especially in recent years, the researchers has seen a flow of research efforts and considerable developments in these fields, however many challenges e.g. low resolution, complex background and variations in colors, font, font size, Multi-orientations, Multi-orientation text movements, noise, blur, and distortion still remain. The objectives of this work are in four folds: (1) to introduce a new descriptor called Histogram Oriented Moments (HOM) for detecting multi-oriented text from videos. The HOM is created by considering the orientations calculated with the second order geometrical moments. Further, to verify the detected text, optical flow properties are used to estimate the motion between text candidates in temporal frames. However, the use of temporal information is limited to false positive elimination but not as main features to find text candidates. (2) to propose new models for finding multi-oriented moving text from video and scene images through moments, motion vectors are utilized to identify moving regions that have constant velocity. However, the model is slightly sensitive to window size used for moment‟s calculation and different scripts in video. (3) To develop automatic window size determination for detecting text from videos, the next method explored stroke width transform based on the information that the stroke width remains constant throughout the characters. Further, the temporal frames are used for identifying text candidates based on the fact that caption text stays at the same unchanged location for few frames. However, the performance of the proposed method degrades when there is blur present in the video frames because moments and stroke width transforms are sensitive to blur. (4) To develop a method for text detection and recognition in blur frames, a blind deconvolution model is introduced that enhances the edge sharpness by suppressing blurred pixels. In summary, each work has been tested over benchmark datasets and authors‟ created datasets from different resources using standard measures. Furthermore, the results of the proposed methods are compared with the state of art methods to show that the proposed methods are competent to existing methods. 2016-08 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/6739/4/vijeta.pdf Vijeta, Khare (2016) Detection of multi-oriented moving text in videos / Vijeta Khare. PhD thesis, University of Malaya. http://studentsrepo.um.edu.my/6739/ |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Student Repository |
url_provider |
http://studentsrepo.um.edu.my/ |
topic |
T Technology (General) TA Engineering (General). Civil engineering (General) |
spellingShingle |
T Technology (General) TA Engineering (General). Civil engineering (General) Vijeta, Khare Detection of multi-oriented moving text in videos / Vijeta Khare |
description |
Text, as one of the most significant creations of humankind, has played a vital part in humanoid life, so far from olden periods. High level semantics embodied in the text are beneficial in a wide range of vision-based applications. For example, image understanding, image indexing, geo location, automatic navigation, license plate recognition, assisting blind person and other surveillance applications. There are approaches in the field of content based image retrieval to solve the above mentioned problems. However, these approaches are inadequate to generate annotation based on semantics according to content of video or images due to opening between high level and low level features. Therefore text detection and recognition in videos grow into active and important research areas in computer vision and document analysis, which is capable of understanding the content of video and images at high level with the help of Optical Character Recognizer (OCR). Especially in recent years, the researchers has seen a flow of research efforts and considerable developments in these fields, however many challenges e.g. low resolution, complex background and variations in colors, font, font size, Multi-orientations, Multi-orientation text movements, noise, blur, and distortion still remain. The objectives of this work are in four folds: (1) to introduce a new descriptor called Histogram Oriented Moments (HOM) for detecting multi-oriented text from videos. The HOM is created by considering the orientations calculated with the second order geometrical moments. Further, to verify the detected text, optical flow properties are used to estimate the motion between text candidates in temporal frames. However, the use of temporal information is limited to false positive elimination but not as main features to find text candidates. (2) to propose new models for finding multi-oriented moving text from video and scene images through moments, motion vectors are utilized to identify moving regions that have constant velocity. However, the model is slightly sensitive to window size used for moment‟s calculation and different scripts in video. (3) To develop automatic window size determination for detecting text from videos, the next method explored stroke width transform based on the information that the stroke width remains constant throughout the characters. Further, the temporal frames are used for identifying text candidates based on the fact that caption text stays at the same unchanged location for few frames. However, the performance of the proposed method degrades when there is blur present in the video frames because moments and stroke width transforms are sensitive to blur. (4) To develop a method for text detection and recognition in blur frames, a blind deconvolution model is introduced that enhances the edge sharpness by suppressing blurred pixels. In summary, each work has been tested over benchmark datasets and authors‟ created datasets from different resources using standard measures. Furthermore, the results of the proposed methods are compared with the state of art methods to show that the proposed methods are competent to existing methods. |
format |
Thesis |
author |
Vijeta, Khare |
author_facet |
Vijeta, Khare |
author_sort |
Vijeta, Khare |
title |
Detection of multi-oriented moving text in videos / Vijeta Khare |
title_short |
Detection of multi-oriented moving text in videos / Vijeta Khare |
title_full |
Detection of multi-oriented moving text in videos / Vijeta Khare |
title_fullStr |
Detection of multi-oriented moving text in videos / Vijeta Khare |
title_full_unstemmed |
Detection of multi-oriented moving text in videos / Vijeta Khare |
title_sort |
detection of multi-oriented moving text in videos / vijeta khare |
publishDate |
2016 |
url |
http://studentsrepo.um.edu.my/6739/4/vijeta.pdf http://studentsrepo.um.edu.my/6739/ |
_version_ |
1738505952026427392 |
score |
13.214268 |