Research Article

A Survey on Various OCR Errors

by  Atul Kumar
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 143 - Issue 4
Published: Jun 2016
Authors: Atul Kumar
10.5120/ijca2016910142
PDF

Atul Kumar . A Survey on Various OCR Errors. International Journal of Computer Applications. 143, 4 (Jun 2016), 8-10. DOI=10.5120/ijca2016910142

                        @article{ 10.5120/ijca2016910142,
                        author  = { Atul Kumar },
                        title   = { A Survey on Various OCR Errors },
                        journal = { International Journal of Computer Applications },
                        year    = { 2016 },
                        volume  = { 143 },
                        number  = { 4 },
                        pages   = { 8-10 },
                        doi     = { 10.5120/ijca2016910142 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2016
                        %A Atul Kumar
                        %T A Survey on Various OCR Errors%T 
                        %J International Journal of Computer Applications
                        %V 143
                        %N 4
                        %P 8-10
                        %R 10.5120/ijca2016910142
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Research has been carried out in correcting words in OCR text and mainly surrounds around (1) non word errors (2) isolated word error correction and context dependent word correction. Various kinds of techniques have been developed. This papers surveys various techniques in correcting these errors and determines which techniques are better.

References
  • Bassil, Y., Alwani, M. 2012 . OCR post-processing error correction algorithm using Google's online spelling suggestion. J. Emer. Trends in Computing and Information Sciences. . Res. 3 (Jan. 2012).
  • Niklas, K. 2010 Unsupervised post-correction of OCR errors. Master’s thesis,. Leibniz Universit¨, Hannover.
  • Lehal, G. S., Singh, C. and Lehal, R. 2001. Shape Based Post Processor for Gurmukhi OCR. In Proceedings of the Sixth International Conference on Document Analysis and Recognition (ICDAR’01) IEEE Computer Society Press, USA.
  • Kukich, K. 1992.Techniques for Automatically Correcting Words in Text. ACM Computing Surveys. Res. 24 (Dec. 1992), 377-439.
  • Sharma, D. V., Lehal G. S. and Mehta S.2009. Shape Encoded Post Processing of Gurmukhi OCR. In proceedings of tenth International Conference on Document Analysis and Recognition.
  • Yuan, L. X., Chew, L T, Xiaoqing, D., Changsong. 2004.Contextual Post-processing based on the Confusion Matrix in Offline Handwritten Chinese Script Recognition. In proceedings of 17th International Conference on Pattern Recognition ICPR.
  • Karthika, M., Jawahar, C. V.2010.A Post-Processing Scheme for Malayalam using Statistical Sub-character Language Models. In proceedings of Ninth IAPR International Workshop On Document Analysis Systems, Boston, MA.
  • Kolak, O. and Resnik, P.2005.OCR Post-Processing for Low Density Languages. In proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing.
  • Bansal, V. and Sinha, K. M. R.1999.Partitioning and searching dictionary for correction of optically read Devnagri character strings. In Proceedings International Conference on Document Analysis and Recognition.
  • Chaudhuri, B. B., Pal, U. 1998.A Complete Printed Bangla OCR systems. Pattern Recognition.1998. Res. 24 (Mar. 1998), 531-549
  • Kernighan, M. D., Church, W. K. and Gale, A. W.1990.A Spelling Correction Program Based on a Noisy Channel Model. In Proceedings of the 13th conference on Computational linguistics.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

OCR Errors NLP. Probability

Powered by PhDFocusTM