Impact Factor (2025): 6.9
DOI Prefix: 10.47001/IRJIET
Vol 7 No 10 (2023): Volume 7, Issue 10, October 2023 | Pages: 97-107
International Research Journal of Innovations in Engineering and Technology
OPEN ACCESS | Research Article | Published Date: 26-10-2023
SinLingua, a novel Python library designed to advance the domain of Sinhala Natural Language Processing (NLP). The primary focus of this work encompasses four distinct areas: Singlish to Sinhala conversion, Sinhala text data cleaning and pre-processing, Sinhala grammar correction, and Sinhala text summarization and translation. Each component is meticulously crafted to prioritize accuracy, speed, customization, and user experience. The Singlish to Sinhala converter is engineered to adeptly recognize and precisely translate Singlish text into formal Sinhala, addressing the paucity of existing tools in this domain. The Sinhala text cleaning and pre-processing function employs optimized rule-based mechanisms to handle the intricacies of the Sinhala language's morphological structures. Furthermore, the Sinhala grammar checker serves the purpose of transforming informal Sinhala sentences into formal ones. Finally, the text summarization and translation module proficiently condenses Sinhala articles while offering translation into the English language. This system provides customization options for summarization parameters, such as word count limits and language translation. The results of this research demonstrate promise, with identified prospects for future enhancements, particularly in the realm of handling intricate grammatical structures and extending user customization features.
Sinhala Natural Language Processing (NLP), Singlish to Sinhala conversion, Sinhala text data cleaning and preprocessing, Sinhala grammar correction, Sinhala text summarization and translation, Python library, Machine Learning (ML)
Supun Sameera, Sandaruwini Galappaththi, Sarada Wijesinghe, Binura Yasodya, Anjalie Gamage, Bhagyanie Chathurika, “SinLingua: Python Library for Sinhala Data Processing” Published in International Research Journal of Innovations in Engineering and Technology - IRJIET, Volume 7, Issue 10, pp 97-107, October 2023. Article DOI https://doi.org/10.47001/IRJIET/2023.710013
This work is licensed under Creative common Attribution Non Commercial 4.0 Internation Licence