Image Processing and Natural Language Processing Based Digitalized Document Generator

Abstract

The proposed application aims to provide users with advanced document conversion and classification capabilities, as well as convenient voice command capabilities. The image-to-text conversion feature allows users to convert handwritten documents quickly and easily into digitalized text files directly from their device. The app uses powerful image recognition algorithms to accurately recognize text and convert it into editable digital text. Speech-to-text functionality enables users to convert spoken words into digital text files with high accuracy and speed. Additionally, the app includes a document classification feature that automatically identifies document types based on their content. This feature uses machine learning algorithms to analyze text and classify it into one of several categories. B. Economics, computer science, engineering, etc. The voice command feature allows users to operate the app with their voice, making it easy to initiate document conversion and classification tasks. Overall, this web application a comprehensive document digitization and management solution with a powerful feature set that can be accessed with simple voice commands.

Country : Sri Lanka

1 Thilakarathne U.T2 Rajarathne R.M.H.M3 Weerasinghe V.P4 Kotuwegoda K.S.D5 N.H.P. Ravi Supunya Swarnakantha6 U.U. Samantha Rajapaksha

  1. Department of Computer System Engineering, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  2. Department of Computer System Engineering, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  3. Department of Computer System Engineering, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  4. Department of Computer System Engineering, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  5. Department of Information Technology, Sri Lanka Institute of Information Technology, Matara, Sri Lanka
  6. Department of Information Technology, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka

IRJIET, Volume 7, Issue 6, June 2023 pp. 123-130

doi.org/10.47001/IRJIET/2023.706019

References

  1. V. Rabeux, N. Journet, A. Vialard and J. -P. Domenger, "Quality Evaluation of Ancient Digitized Documents for Binarization Prediction," 2013 12th International Conference on Document Analysis and Recognition, Washington, DC, USA, 2013, pp. 113-117, doi: 10.1109/ICDAR.2013.30.
  2. L. J. van Vliet, P. W. Verbeek and I. T. Young, "Quantitative imaging: how to measure size features in digitized images," 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821), Arlington, VA, USA, 2004, pp. 1227-1230 Vol. 2, doi: 10.1109/ISBI.2004.1398766.
  3. P. Dhande and R. Kharat, "Recognition of cursive English handwritten characters," 2017 International Conference on Trends in Electronics and Informatics (ICEI), Tirunelveli, India, 2017, pp. 199-203, doi: 10.1109/ICOEI.2017.8300915.
  4. M. Shopon, N. Mohammed and M. A. Abedin, "Image augmentation by blocky artifact in Deep Convolutional Neural Network for handwritten digit recognition," 2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR), Dhaka, Bangladesh, 2017, pp. 1-6, doi: 10.1109/ICIVPR.2017.7890867.
  5. J. Arlandis, J. . -C. Perez-Cortes and R. Llobet, "Handwritten character recognition using the continuous distance transformation," Proceedings 15th International Conference on Pattern Recognition. ICPR-2000, Barcelona, Spain, 2000, pp. 940-943 vol.1, doi: 10.1109/ICPR.2000.905602.
  6. Z. C. Li, C. Y. Suen, T. D. Bui and Q. L. Gu, "Harmonic models of shape transformations in digital images and patterns," [1990] Proceedings. 10th International Conference on Pattern Recognition, Atlantic City, NJ, USA, 1990, pp. 1-7 vol.2, doi: 10.1109/ICPR.1990.119319.
  7. D. Oliveira, R. Lins, G. Torreão, J. Fan and M. Thielo, "An Efficient Algorithm for Segmenting Warped Text-Lines in Document Images," 2013 12th International Conference on Document Analysis and Recognition, Washington, DC, USA, 2013, pp. 250-254, doi: 10.1109/ICDAR.2013.57.
  8. T. Hain et al., "Transcribing Meetings With the AMIDA Systems," in IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 2, pp. 486-498, Feb. 2012, doi: 10.1109/TASL.2011.2163395.
  9. Y. Li, S. Junginger, N. Stoll and K. Thurow, "4D simulation system for laboratory workflow of life science automation," 2012 IEEE International Instrumentation and Measurement Technology Conference Proceedings, Graz, Austria, 2012, pp. 1886-1890, doi: 10.1109/I2MTC.2012.6229314.
  10. S. Yu, L. Liu and M. Fu, "The Application Research on Knowledge Management of Project Manager," 2009 International Conference on Information Management, Innovation Management and Industrial Engineering, Xi'an, China, 2009, pp. 340-343, doi: 10.1109/ICIII.2009.391.
  11. İ. Ç. Taş and A. A. Müngen, "Using Pre-Processing Methods to Improve OCR Performances of Digital Historical Documents," 2021 Innovations in Intelligent Systems and Applications Conference (ASYU), Elazig, Turkey, 2021, pp. 1-5, doi: 10.1109/ASYU52992.2021.9598972.
  12. G. Cavalcanti and E. Filho, "An architecture for document management," Proceedings. International Conference on Image Processing, Rochester, NY, USA, 2002, pp. 973-976 vol.3, doi: 10.1109/ICIP.2002.1039137.
  13. M. Ramirez, E. Tapia, M. Block and R. Rojas, "Quantile Linear Algorithm for Robust Binarization of Digitalized Letters," Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil, 2007, pp. 1158-1162, doi: 10.1109/ICDAR.2007.4377097.
  14. M. K. Ugale, S. J. Patil and V. B. Musande, "Document management system: A notion towards paperless office," 2017 1st International Conference on Intelligent Systems and Information Management (ICISIM), Aurangabad, India, 2017, pp. 217-224, doi: 10.1109/ICISIM.2017.8122176.
  15. L. Schomaker, M. Bulacu and K. Franke, "Automatic writer identification using fragmented connected-component contours," Ninth International Workshop on Frontiers in Handwriting Recognition, Kokubunji, Japan, 2004, pp. 185-190, doi: 10.1109/IWFHR.2004.22.
  16. D. Singh, S. V. S and V. K, "A Proposed Approach for Identifying the Connotative Relationship of English Sentences and paragraphs using the NLP package of Python," 2022 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES), Chennai, India, 2022, pp. 1-7, doi: 10.1109/ICSES55317.2022.9914220.
  17. M. Kalaiselvan and A. V. Kathiravan, "A pioneering tool for text summarization using star map," 2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering, Salem, India, 2013, pp. 277-281, doi: 10.1109/ICPRIME.2013.6496486.
  18. IBM. (2021, January 11). Optical Character Recognition (OCR): A Complete Guide. IBM Cloud Blog. [Online] Available:https://www.ibm.com/cloud/blog/optical-character-recognition. [Accessed: 7-Mar-2023].
  19. AI Multiple. (n.d.). Handwriting Recognition: A Comprehensive Guide. [Online] Available: https://research.aimultiple.com/handwriting-recognition/. [Accessed: 3-Mar-2023].
  20. Kundu, S. (2021). Handwritten Text Recognition: A Brief Review. In B. Gupta, D. Mandal, R. Kumar, & H. Gupta (Eds.), Emerging Technologies for Agriculture and Environment (pp. 309-318). Springer.
  21. V. K. Vaisakh and L. B. Das, "Handwritten Malayalam Character Recognition System using Artificial Neural Networks," 2020 IEEE International Students' Conference on Electrical, Electronics and Computer Science (SCEECS), Bhopal, India, 2020, pp. 1-4, doi: 10.1109/SCEECS48394.2020.101.
  22. M. Grůber, "Acoustic analysis of czech expressive recordings from a single speaker in terms of various communicative functions," 2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), Bilbao, Spain, 2011, pp. 293-298, doi: 10.1109/ISSPIT.2011.6151576.
  23. S. M. Kim, C. J. Chun and H. K. Kim, "Multi-channel audio recording based on super directive beam forming for portable multimedia recording devices," in IEEE Transactions on Consumer Electronics, vol. 60, no. 3, pp. 429-435, Aug. 2014, doi: 10.1109/TCE.2014.6937327.
  24. S. Das et al., "Hand-Written and Machine-Printed Text Classification in Architecture, Engineering & Construction Documents," 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), Niagara Falls, NY, USA, 2018, pp. 546-551, doi: 10.1109/ICFHR-2018.2018.00101.
  25. F. Bourgault, T. Furukawa and H. F. Durrant-Whyte, "Process model, constraints, and the coordinated search strategy," IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004, New Orleans, LA, USA, 2004, pp. 5256-5261, doi: 10.1109/ROBOT.2004.1302552.
  26. F. M. Hanis, H. Khoshvaghti, M. Teimouri and H. Veisi, "A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages," 2021 11th International Conference on Computer Engineering and Knowledge (ICCKE), Mashhad, Iran, Islamic Republic of, 2021, pp. 254-259, doi: 10.1109/ICCKE54056.2021.9721512.
  27. J. Sanguinetti, "High Level Design: The Future is Now," 2005 18th Symposium on Integrated Circuits and Systems Design, Florianopolis, Brazil, 2005, pp. 5-5, doi: 10.1109/SBCCI.2005.4286820.
  28. G. Vamvakas, B. Gatos and S. J. Perantoni, "A Novel Feature Extraction and Classification Methodology for the Recognition of Historical Documents," 2009 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, 2009, pp. 491-495, doi: 10.1109/ICDAR.2009.223.
  29. A.Khalid, W. Q. Khan, R. Q. Khan and H. M. P. Memon, "Google image suggestions, extension for unfamiliar terminologies," 2017 International Conference on Information and Communication Technologies (ICICT), Karachi, Pakistan, 2017, pp. 16-22, doi: 10.1109/ICICT.2017.8320158.