CFP last date
20 June 2024
Reseach Article

Bridging the Gap between Handwriting and Machine: AI-based Handwritten Text Recognition

by Dhruv Shah, Sai Bhargav, Dhanush B.
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 185 - Number 7
Year of Publication: 2023
Authors: Dhruv Shah, Sai Bhargav, Dhanush B.
10.5120/ijca2023922727

Dhruv Shah, Sai Bhargav, Dhanush B. . Bridging the Gap between Handwriting and Machine: AI-based Handwritten Text Recognition. International Journal of Computer Applications. 185, 7 ( May 2023), 39-45. DOI=10.5120/ijca2023922727

@article{ 10.5120/ijca2023922727,
author = { Dhruv Shah, Sai Bhargav, Dhanush B. },
title = { Bridging the Gap between Handwriting and Machine: AI-based Handwritten Text Recognition },
journal = { International Journal of Computer Applications },
issue_date = { May 2023 },
volume = { 185 },
number = { 7 },
month = { May },
year = { 2023 },
issn = { 0975-8887 },
pages = { 39-45 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume185/number7/32717-2023922727/ },
doi = { 10.5120/ijca2023922727 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:25:32.123965+05:30
%A Dhruv Shah
%A Sai Bhargav
%A Dhanush B.
%T Bridging the Gap between Handwriting and Machine: AI-based Handwritten Text Recognition
%J International Journal of Computer Applications
%@ 0975-8887
%V 185
%N 7
%P 39-45
%D 2023
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Due to the variety and complexity of handwriting styles, handwritten text recognition (HTR) is a difficult job. In HTR, artificial intelligence (AI) has demonstrated enormous promise, opening the door to the creation of effective and precise identification systems. An overview of the HTR process using AI methods, including preprocessing of the data, feature extraction, classification, and post-processing, is provided in this paper. To get the input data ready for feature extraction, data preprocessing includes image enhancement and segmentation. To feed the classification model with useful features, such as character shapes and patterns, pertinent features are extracted from the input picture. A neural network is frequently used in the classification model to map the extracted features to the associated characters. Spell checkers and language models are two post-processing tools that can be used to improve recognition outcomes. We discuss the challenges and opportunities for HTR using Neural Networks, including the importance of training data, model selection, and performance evaluation. The paper also outlines the methodology, design, and architecture of the Handwriting character recognition system and the testing and results of the system development. The aim is to demonstrate the effectiveness of neural networks for Handwriting character recognition.

References
  1. S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, Nov. 1997.
  2. A. Graves, S. Ferna´ndez, F. Gomez, and J. Schmidhuber, “Connection- ist temporal classification: Labelling unsegmented sequence data with recurrent neural networks,” in ICML, 2006.
  3. A. Graves and J. Schmidhuber, “Offline handwriting recognition with multidimensional recurrent neural networks,” in NIPS, 2009.
  4. A. Graves, M. Liwicki, S. Fernndez, R. Bertolami, H. Bunke, and
  5. J. Schmidhuber, “A novel connectionist system for unconstrained hand- writing recognition,” PAMI, vol. 31, no. 5, pp. 855–868, 2009.
  6. V. Pham, T. Bluche, C. Kermorvant, and J. Louradour, “Dropout improves recurrent neural networks for handwriting recognition,” in ICFHR, 2014.
  7. P. Voigtlaender, P. Doetsch, and H. Ney, “Handwriting recognition with large multidimensional long short-term memory recurrent neural networks,” in ICFHR, 2016.
  8. T. Bluche and R. Messina, “Gated convolutional recurrent neural networks for multilingual handwriting recognition,” in ICDAR, vol. 01, 2017.
  9. J. Puigcerver, “Are multidimensional recurrent layers really necessary for handwritten text recognition?” in ICDAR, 2017.
  10. D. Castro, B. L. D. Bezerra, and M. Valena, “Boosting the deep multi-dimensional long-short-term memory network for handwritten recognition systems,” in ICFHR, 2018.
  11. D. Keysers, T. Deselaers, H. A. Rowley, L. Wang, and V. Carbune, “Multi-language online handwriting recognition,” PAMI, vol. 39, no. 6, pp. 1180–1194, 2017.
  12. V. Carbune, P. Gonnet, T. Deselaers, H. A. Rowley, A. Daryin, M. Calvo, L.-L. Wang, D. Keysers, S. Feuz, and P. Gervais, “Fast multi-language lstm-based online handwriting recognition,” ArXiV, 2019.
  13. J. Walker, Y. Fujii, and A. C. Popat, “A web-based ocr service for documents,” in DAS, Apr 2018, pp. 21–22.
  14. S. Ghosh and A. Joshi, “Text entry in Indian languages on mobile: User perspectives,” in India HCI, 2014.
  15. T. M. Breuel, “Tutorial on ocr and layout analysis,” in DAS, 2018.
  16. Y. Fujii, K. Driesen, J. Baccash, A. Hurst, and A. C. Popat, “Sequence-to-label script identification for multilingual ocr,” in ICDAR, 2017.
  17. M. Kozielski, P. Doetsch, and H. Ney, “Improvements in rwth’s system for off-line handwriting recognition,” in ICDAR, 2013.
  18. P. Doetsch, M. Kozielski, and H. Ney, “Fast and robust training of recurrent neural networks for offline handwriting recognition,” in ICFHR, 2014.
  19. P. Voigtlaender, P. Doetsch, S. Wiesler, R. Schlter, and H. Ney, “Sequence-discriminative training of recurrent neural networks,” in ICASSP, 2015.
  20. T. N. Sainath, O. Vinyals, A. Senior, and H. Sak, “Convolutional, long short-term memory, fully connected deep neural networks,” in ICASSP, 2015.
  21. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan,
  22. V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” in CVPR, 2015.
  23. J. Wang and X. Hu, “Gated recurrent convolution neural network for ocr,” in NIPS, 2017.
  24. M. Liang and X. Hu, “Recurrent convolutional neural network for object recognition,” in CVPR, 2015, pp. 3367–3375.
  25. J. Sa´nchez, A. Toselli, V. Romero, and E. Vidal, “ICDAR 2015 Com- petition HTRtS: Handwritten Text Recognition on the tranScriptorium Dataset,” in ICDAR, 2015.
  26. A. Toselli, V. Romero, M. Villegas, E. Vidal, and J. Sa´nchez, “ICFHR2016 Competition on Handwritten Text Recognition on the READ Dataset,” in ICFHR, 2016.
  27. J. Sa´nchez, V. Romero, A. Toselli, M. Villegas, and E. Vidal, “IC- DAR2017 Competition on Handwritten Text Recognition on the READ Dataset,” in ICDAR, 2017.
  28. Folger Shakespeare Library, “Early Modern Manuscripts Online (EMMO).” [Online]. Available: https://emmo.folger.edu
  29. K. Chen, L. Tian, H. Ding, M. Cai, L. Sun, S. Liang, and Q. Huo, “A compact cnn-dblstm based character model for online handwritten chinese text recognition,” in ICDAR, 2017.
  30. A. Graves, “Generating sequences with recurrent neural networks,” ArXiV, 2013.
  31. U. Marti and H. Bunke, “The IAM-database: An English sentence database for off-line handwriting recognition,” IJDAR, vol. 5, pp. 39–46, 2002.
  32. M. Liwicki and H. Bunke, “Iam-ondb - an on-line English sentence database acquired from handwritten text on a whiteboard,” in ICDAR, 2005.
Index Terms

Computer Science
Information Sciences

Keywords

Handwriting Text Recognition Artificial Intelligence Machine Learning Artificial Neural Networks Image Processing Support Vector Machine Computer Vision.