Handwritten Character Recognition Using CNN

Abstract

Handwritten character recognition holds a significant position in the realms of pattern recognition and machine learning, finding applications across archival document digitization and automated form processing. Convolutional Neural Networks (CNNs) have emerged as a powerful tool for addressing this task due to their ability to learn intricate features and patterns from raw input data. This paper provides a comprehensive overview of CNNs' application in handwritten character recognition, covering various aspects of the recognition process including preprocessing techniques, dataset selection, network architecture design, training methods, and performance evaluation. The synthesis explores the evolution of CNN-based methodologies, examining architectural enhancements, regularization techniques, and data augmentation methods that collectively improve recognition accuracy. Additionally, the paper evaluates advanced approaches that integrate CNNs with other machine learning techniques such as recurrent networks and attention mechanisms, leading to further improvements in recognition performance. Through a thorough review of contemporary literature, the paper highlights achievements, ongoing challenges, and emerging research directions in the field of handwritten character recognition using CNNs. This survey serves as a valuable resource for researchers and practitioners interested in understanding the current landscape of the field and sheds light on potential breakthroughs and the trajectory of advancements in this dynamic domain.

Country : India

1 Dr. V. Suganthi2 S. Sankeerthna

  1. Associate Professor, Department of Computer Science, Sri Ramakrishna College of Arts & Science, Coimbatore, Tamilnadu, India
  2. Student, Department of Computer Science, Sri Ramakrishna College of Arts & Science, Coimbatore, Tamilnadu, India

IRJIET, Volume 8, Issue 3, March 2024 pp. 194-198

doi.org/10.47001/IRJIET/2024.803026

References

  1. Badrinarayanan, V., Kendall, A., & Cipolla, R. (2017). SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(12), 2481-2495.
  2. Gupta, P., Majumdar, A., & Gupta, D. (2019). CNN based approach for multilingual handwritten character recognition. In 2019 3rd International Conference on Electronics, Communication and Aerospace Technology (ICECA) (pp. 1460-1464). IEEE.
  3. Ghosh, T., & Amin, M.A. (2020). Handwritten Bangla character recognition using deep learning with Transfer Learning. In 2020 International Conference on Electrical, Computer and Communication Engineering (ECCE) (pp. 1-4). IEEE.
  4. Hinton, G.E., Osindero, S., & Teh, Y.W. (2006). A fast learning algorithm for deep belief nets. Neural Computation, 18(7), 1527-1554.
  5. Nguyen, C.H., Dang, K., & Doermann, D. (2019). Multiscale CNNs for offline handwritten mathematical expression recognition. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 33, No. 1, pp. 7441-7448).
  6. Ptucha, R., & Savakis, A.E. (2013). A new approach to intelligent character recognition using computational intelligence. In 2013 IEEE International Conference on Systems, Man, and Cybernetics (pp. 3961-3966). IEEE.
  7. Pham, V.T., Bluche, T., Kermorvant, C., & Louradour, J. (2014). Dropout improves recurrent neural networks for handwriting recognition. In Frontiers in Handwriting Recognition (pp. 285-290). IEEE.
  8. Shi, B., Bai, X., & Yao, C. (2016). An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(11), 2298- 2304.
  9. Simard, P.Y., Steinkraus, D., & Platt, J.C. (2003). Best practices for convolutional neural networks applied to visual document analysis. In ICDAR (Vol. 3, pp. 958-962). IEEE.
  10. Wang, K., Babenko, B., & Belongie, S. (2011). End-to-end scene text recognition. In 2011 International Conference on Computer Vision (pp. 1457- 1464). IEEE.
  11. Ziran, Z., Lin, H., Huang, S., & Xiong, H. (2020). Word location recognition in historical document images using R-CNN-based framework. Journal of Imaging Science and Technology, 64(5), 050501.