Impact Factor (2025): 6.9
DOI Prefix: 10.47001/IRJIET
SinLingua,
a novel Python library designed to advance the domain of Sinhala Natural
Language Processing (NLP). The primary focus of this work encompasses four
distinct areas: Singlish to Sinhala conversion, Sinhala text data cleaning and
pre-processing, Sinhala grammar correction, and Sinhala text summarization and
translation. Each component is meticulously crafted to prioritize accuracy, speed,
customization, and user experience. The Singlish to Sinhala converter is
engineered to adeptly recognize and precisely translate Singlish text into
formal Sinhala, addressing the paucity of existing tools in this domain. The
Sinhala text cleaning and pre-processing function employs optimized rule-based
mechanisms to handle the intricacies of the Sinhala language's morphological
structures. Furthermore, the Sinhala grammar checker serves the purpose of
transforming informal Sinhala sentences into formal ones. Finally, the text
summarization and translation module proficiently condenses Sinhala articles
while offering translation into the English language. This system provides
customization options for summarization parameters, such as word count limits and
language translation. The results of this research demonstrate promise, with
identified prospects for future enhancements, particularly in the realm of
handling intricate grammatical structures and extending user customization
features.
Country : Sri Lanka
IRJIET, Volume 7, Issue 10, October 2023 pp. 97-107