TextMind: Design and Implementation of a Text-Based AI Assistant with Multi-Tier NLP Architecture and Contextual Dialogue Management

Apeksha Ashish RangariStudent, Department of Computer Science and Engineering, Shri Sai College of Engineering and Technology (SSCET), DBATU University, Bhadrawati, Chandrapur, Maharashtra, IndiaSanjana Sandip DeotaleStudent, Department of Computer Science and Engineering, Shri Sai College of Engineering and Technology (SSCET), DBATU University, Bhadrawati, Chandrapur, Maharashtra, IndiaSneha Suryabhan ChideStudent, Department of Computer Science and Engineering, Shri Sai College of Engineering and Technology (SSCET), DBATU University, Bhadrawati, Chandrapur, Maharashtra, IndiaAwantika Praful DhapteStudent, Department of Computer Science and Engineering, Shri Sai College of Engineering and Technology (SSCET), DBATU University, Bhadrawati, Chandrapur, Maharashtra, IndiaAnushri Dadaji AskarStudent, Department of Computer Science and Engineering, Shri Sai College of Engineering and Technology (SSCET), DBATU University, Bhadrawati, Chandrapur, Maharashtra, IndiaJayanti A. ParasharAssistant Professor, Department of Computer Science and Engineering, Shri Sai College of Engineering and Technology (SSCET), DBATU University, Bhadrawati, Chandrapur, Maharashtra, India

Vol 10 No 5 (2026): Volume 10, Issue 5, May 2026 | Pages: 47-55

International Research Journal of Innovations in Engineering and Technology

OPEN ACCESS | Research Article | Published Date: 05-05-2026

doi Logo doi.org/10.47001/IRJIET/2026.105007

Abstract

The emergence of large language models (LLMs) and advances in natural language processing (NLP) have fundamentally transformed human-computer interaction, making text-based conversational agents a viable medium for delivering intelligent, context-aware information services. This paper presents TextMind, a full-stack text-based AI assistant engineered at Shri Sai College of Engineering and Technology (SSCET), DBATU University, Chandrapur. The system implements a multi-tier NLP architecture comprising a fine-tuned BERT-based intent classifier, Retrieval-Augmented Generation (RAG) with FAISS dense vector search over a domain knowledge base, OpenAI GPT-3.5 Turbo and GPT-4 for open-domain response generation, a finite-state dialogue manager for multi-turn context retention, and a 16-category rule-based NLP fallback layer ensuring 100% non-null response coverage. A browser-accessible Python Flask backend exposes a RESTful chat API consumed by a responsive HTML/CSS/JavaScript frontend with real-time AJAX messaging. Evaluation across 500 test queries achieved intent classification accuracy of 94.6%, entity recognition F1 of 91.3%, BLEU-4 of 0.68, and BERTScore F1 of 0.87. Pilot evaluation with 45 student users yielded satisfaction scores of 4.3–4.5/5.0 across relevance, clarity, and ease-of-use dimensions. TextMind demonstrates that an open-source, lightweight web stack augmented with pre-trained LLM APIs can deliver expert-quality, domain-aware conversational assistance at near-zero infrastructure cost, establishing a reproducible reference architecture for AI assistant development in resource-constrained academic environments.

Keywords

Text-Based AI Assistant; Natural Language Processing; Intent Classification; BERT; Retrieval-Augmented Generation; FAISS; OpenAI GPT-4; Dialogue Management; Conversational AI; Named Entity Recognition; Flask; Rule-Based NLP; SSCET; DBATU University


Citation of this Article

Apeksha Ashish Rangari, Sanjana Sandip Deotale, Sneha Suryabhan Chide, Awantika Praful Dhapte, Anushri Dadaji Askar, & Jayanti A. Parashar. (2026). TextMind: Design and Implementation of a Text-Based AI Assistant with Multi-Tier NLP Architecture and Contextual Dialogue Management. International Research Journal of Innovations in Engineering and Technology - IRJIET, 10(5), 47-55. Article DOI https://doi.org/10.47001/IRJIET/2026.105007

References
  1. A.Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, "Attention is all you need," in Proc. Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 30, pp. 5998–6008, 2017.
  2. OpenAI, "GPT-4 Technical Report," arXiv preprint arXiv:2303.08774, Mar. 2023. [Online]. Available: https://arxiv.org/abs/2303.08774
  3. Z. Ji, N. Lee, R. Frieske, T. Yu, D. Su, Y. Xu, E. Ishii, Y. Bang, A. Madotto, and P. Fung, "Survey of hallucination in natural language generation," ACM Comput. Surv., vol. 55, no. 12, pp. 1–38, Mar. 2023, doi: 10.1145/3571730.
  4. Ministry of Education, Government of India, "All India Survey on Higher Education (AISHE) 2021–22," New Delhi, 2023. [Online]. Available: https://aishe.gov.in
  5. J. Weizenbaum, "ELIZA — a computer program for the study of natural language communication between man and machine," Commun. ACM, vol. 9, no. 1, pp. 36–45, Jan. 1966, doi: 10.1145/365153.365168.
  6. J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of deep bidirectional Transformers for language understanding," in Proc. NAACL-HLT, pp. 4171–4186, 2019.
  7. S. Bubeck, V. Chandrasekaran, R. Eldan, J. Gehrke, E. Horvitz, E. Kamar, P. Lee, et al., "Sparks of artificial general intelligence: Early experiments with GPT-4," arXiv preprint arXiv:2303.12712, 2023.
  8. R. E. Schapire and Y. Singer, "BoosTexter: A boosting-based system for text categorization," Mach. Learn., vol. 39, no. 2–3, pp. 135–168, 2000, doi: 10.1023/A:1007649029923.
  9. Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, and V. Stoyanov, "RoBERTa: A robustly optimized BERT pretraining approach," arXiv preprint arXiv:1907.11692, 2019.
  10. E. F. Tjong Kim Sang and F. De Meulder, "Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition," in Proc. HLT-NAACL (CoNLL), pp. 142–147, 2003.
  11. P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. Küttler, M. Lewis, W. Yih, T. Rocktäschel, S. Riedel, and D. Kiela, "Retrieval-augmented generation for knowledge-intensive NLP tasks," in Proc. NeurIPS, vol. 33, pp. 9459–9474, 2020.
  12. K. Guu, K. Lee, Z. Tung, P. Pasupat, and M.-W. Chang, "REALM: Retrieval-augmented language model pre-training," in Proc. ICML, vol. 119, pp. 3929–3938, 2020.
  13. G. Izacard and E. Grave, "Leveraging passage retrieval with generative models for open domain question answering," in Proc. EACL, pp. 874–880, 2021.
  14. S. Young, M. Gašić, B. Thomson, and J. D. Williams, "POMDP-based statistical spoken dialogue systems: A review," Proc. IEEE, vol. 101, no. 5, pp. 1160–1179, 2013, doi: 10.1109/JPROC.2012.2228254.
  15. H. Hosseini-Asl, B. McCann, C.-S. Wu, S. Yavuz, and R. Socher, "A simple language model for task-oriented dialogue," in Proc. NeurIPS, vol. 33, pp. 20179–20191, 2020.
  16. J. A. Pérez, E. Daradoumis, and J. M. Puig, "Rediscovering the use of chatbots in education: A systematic literature review," Comput. Educ. Artif. Intell., vol. 1, p. 100011, 2020, doi: 10.1016/j.caeai.2020.100011.
  17. M. E. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark, K. Lee, and L. Zettlemoyer, "Deep contextualized word representations," in Proc. NAACL-HLT, pp. 2227–2237, 2018.
  18. N. Reimers and I. Gurevych, "Sentence-BERT: Sentence embeddings using Siamese BERT-networks," in Proc. EMNLP-IJCNLP, pp. 3982–3992, 2019.
  19. J. Johnson, M. Douze, and H. Jégou, "Billion-scale similarity search with GPUs," IEEE Trans. Big Data, vol. 7, no. 3, pp. 535–547, Jun. 2021, doi: 10.1109/TBDATA.2019.2921572.
  20. Flask Documentation, Pallets Project, 2024. [Online]. Available: https://flask.palletsprojects.com/