Improving ocraccuracy for scanned historical newspapers

OCR is part of the computer vision field and the use of it has grown rapidly in recent decade due to the incessant demand in document digitization. Different techniques are used by OCR tools to process varieties of input formats(.pdf, doc, .jpeg, etc.). However, from our point of view, no research...

Full description

Saved in:
Bibliographic Details
Main Author: Naiker, Nithyananthan
Format: Final Year Project Report
Language:English
Published: Universiti Malaysia Sarawak, (UNIMAS) 2013
Subjects:
Online Access:http://ir.unimas.my/id/eprint/39020/3/Naiker%20Nithyananthan%20ft.pdf
http://ir.unimas.my/id/eprint/39020/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.unimas.ir.39020
record_format eprints
spelling my.unimas.ir.390202023-11-14T07:57:53Z http://ir.unimas.my/id/eprint/39020/ Improving ocraccuracy for scanned historical newspapers Naiker, Nithyananthan N Visual arts (General) For photography, see TR Q Science (General) OCR is part of the computer vision field and the use of it has grown rapidly in recent decade due to the incessant demand in document digitization. Different techniques are used by OCR tools to process varieties of input formats(.pdf, doc, .jpeg, etc.). However, from our point of view, no research has been done in applying the chain code technique on historical documents stored in image format. In this project, one variant of the chain code algorithm known as Compare images algorithm is presented when it has been tuned to process some samples of Sarawak Gazette. Experimental results show relatively high accuracy improvement (approximately 6.90%). Future works will focus on testing the algorithm to other historical documents. Universiti Malaysia Sarawak, (UNIMAS) 2013 Final Year Project Report NonPeerReviewed text en http://ir.unimas.my/id/eprint/39020/3/Naiker%20Nithyananthan%20ft.pdf Naiker, Nithyananthan (2013) Improving ocraccuracy for scanned historical newspapers. [Final Year Project Report] (Unpublished)
institution Universiti Malaysia Sarawak
building Centre for Academic Information Services (CAIS)
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sarawak
content_source UNIMAS Institutional Repository
url_provider http://ir.unimas.my/
language English
topic N Visual arts (General) For photography, see TR
Q Science (General)
spellingShingle N Visual arts (General) For photography, see TR
Q Science (General)
Naiker, Nithyananthan
Improving ocraccuracy for scanned historical newspapers
description OCR is part of the computer vision field and the use of it has grown rapidly in recent decade due to the incessant demand in document digitization. Different techniques are used by OCR tools to process varieties of input formats(.pdf, doc, .jpeg, etc.). However, from our point of view, no research has been done in applying the chain code technique on historical documents stored in image format. In this project, one variant of the chain code algorithm known as Compare images algorithm is presented when it has been tuned to process some samples of Sarawak Gazette. Experimental results show relatively high accuracy improvement (approximately 6.90%). Future works will focus on testing the algorithm to other historical documents.
format Final Year Project Report
author Naiker, Nithyananthan
author_facet Naiker, Nithyananthan
author_sort Naiker, Nithyananthan
title Improving ocraccuracy for scanned historical newspapers
title_short Improving ocraccuracy for scanned historical newspapers
title_full Improving ocraccuracy for scanned historical newspapers
title_fullStr Improving ocraccuracy for scanned historical newspapers
title_full_unstemmed Improving ocraccuracy for scanned historical newspapers
title_sort improving ocraccuracy for scanned historical newspapers
publisher Universiti Malaysia Sarawak, (UNIMAS)
publishDate 2013
url http://ir.unimas.my/id/eprint/39020/3/Naiker%20Nithyananthan%20ft.pdf
http://ir.unimas.my/id/eprint/39020/
_version_ 1783883538412601344
score 13.214268