Improving ocraccuracy for scanned historical newspapers
OCR is part of the computer vision field and the use of it has grown rapidly in recent decade due to the incessant demand in document digitization. Different techniques are used by OCR tools to process varieties of input formats(.pdf, doc, .jpeg, etc.). However, from our point of view, no research...
Saved in:
Main Author: | |
---|---|
Format: | Final Year Project Report |
Language: | English |
Published: |
Universiti Malaysia Sarawak, (UNIMAS)
2013
|
Subjects: | |
Online Access: | http://ir.unimas.my/id/eprint/39020/3/Naiker%20Nithyananthan%20ft.pdf http://ir.unimas.my/id/eprint/39020/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.unimas.ir.39020 |
---|---|
record_format |
eprints |
spelling |
my.unimas.ir.390202023-11-14T07:57:53Z http://ir.unimas.my/id/eprint/39020/ Improving ocraccuracy for scanned historical newspapers Naiker, Nithyananthan N Visual arts (General) For photography, see TR Q Science (General) OCR is part of the computer vision field and the use of it has grown rapidly in recent decade due to the incessant demand in document digitization. Different techniques are used by OCR tools to process varieties of input formats(.pdf, doc, .jpeg, etc.). However, from our point of view, no research has been done in applying the chain code technique on historical documents stored in image format. In this project, one variant of the chain code algorithm known as Compare images algorithm is presented when it has been tuned to process some samples of Sarawak Gazette. Experimental results show relatively high accuracy improvement (approximately 6.90%). Future works will focus on testing the algorithm to other historical documents. Universiti Malaysia Sarawak, (UNIMAS) 2013 Final Year Project Report NonPeerReviewed text en http://ir.unimas.my/id/eprint/39020/3/Naiker%20Nithyananthan%20ft.pdf Naiker, Nithyananthan (2013) Improving ocraccuracy for scanned historical newspapers. [Final Year Project Report] (Unpublished) |
institution |
Universiti Malaysia Sarawak |
building |
Centre for Academic Information Services (CAIS) |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Sarawak |
content_source |
UNIMAS Institutional Repository |
url_provider |
http://ir.unimas.my/ |
language |
English |
topic |
N Visual arts (General) For photography, see TR Q Science (General) |
spellingShingle |
N Visual arts (General) For photography, see TR Q Science (General) Naiker, Nithyananthan Improving ocraccuracy for scanned historical newspapers |
description |
OCR is part of the computer vision field and the use of it has grown rapidly in recent decade due to the incessant demand in document digitization. Different techniques are used by OCR tools to process varieties of input formats(.pdf, doc, .jpeg, etc.). However, from our point of view, no research has been done in applying the chain code technique on historical documents stored in image format. In this project, one variant of the chain code algorithm known as Compare images algorithm is presented when it has been tuned to process some samples of Sarawak Gazette. Experimental results show relatively high accuracy improvement (approximately 6.90%). Future works will focus on testing the algorithm to other historical documents. |
format |
Final Year Project Report |
author |
Naiker, Nithyananthan |
author_facet |
Naiker, Nithyananthan |
author_sort |
Naiker, Nithyananthan |
title |
Improving ocraccuracy for scanned historical newspapers |
title_short |
Improving ocraccuracy for scanned historical newspapers |
title_full |
Improving ocraccuracy for scanned historical newspapers |
title_fullStr |
Improving ocraccuracy for scanned historical newspapers |
title_full_unstemmed |
Improving ocraccuracy for scanned historical newspapers |
title_sort |
improving ocraccuracy for scanned historical newspapers |
publisher |
Universiti Malaysia Sarawak, (UNIMAS) |
publishDate |
2013 |
url |
http://ir.unimas.my/id/eprint/39020/3/Naiker%20Nithyananthan%20ft.pdf http://ir.unimas.my/id/eprint/39020/ |
_version_ |
1783883538412601344 |
score |
13.214268 |