Two bigrams based language model for auto correction of Arabic OCR errors

In Optical character recognition (OCR), the characteristics of Arabic text cause more errors than in English text.In this paper, a two bi-grams based language model that uses Wikipedia's database is presented.The method can perform auto detection and correction of non-word errors in Arabic OCR...

Full description

Saved in:
Bibliographic Details
Main Authors: Habeeb, Imad Q., Mohd Yusof, Shahrul Azmi, Ahmad, Faudziah
Format: Article
Language:English
Published: AICIT, Korea 2014
Subjects:
Online Access:http://repo.uum.edu.my/12602/1/JDCTA3630PPL.pdf
http://repo.uum.edu.my/12602/
http://www.aicit.org/jdcta/global/paper_detail.html?jname=JDCTA&q=3630
Tags: Add Tag
No Tags, Be the first to tag this record!