Journal Articles - Computer and Information Technology - 2019
Permanent URI for this collection
Browsing Journal Articles - Computer and Information Technology - 2019 by Subject "Error correction"
Results Per Page
Sort Options
-
PublicationOCR Error Correction for Unconstrained Vietnamese Handwritten Text( 2019)
;Quoc-Dung Nguyen ;Duc-Anh LeIvan ZelinkaPost-processing is an essential step in detecting and correcting errors in OCR-generated texts. In this paper, we present an automatic OCR post-processing model which comprises both error detection and error correction phases for OCR output texts of unconstrained Vietnamese handwriting. We propose a hybrid approach of generating and scoring correction candidates for both non-syllable and real-syllable errors based on the linguistic features as well as the error characteristics of OCR outputs. We evaluate our proposed model on a Vietnamese benchmark database at the line level. The experimental results show that our model achieves 4.17% of character error rate (CER) and 9.82% of word error rate (WER), which helps improve both CER and WER of an attention-based encoder-decoder approach by 0.5% and 3.5% respectively on the VNOnDB-Line dataset of the Vietnamese online handwritten text recognition competition (VOHTR2018). These results outperform those obtained by various recognition systems in the VOHTR2018 competition.