Impact Factor (2025): 6.9
DOI Prefix: 10.47001/IRJIET
Vol 10 No 5 (2026): Volume 10, Issue 5, May 2026 | Pages: 271-283
International Research Journal of Innovations in Engineering and Technology
OPEN ACCESS | Research Article | Published Date: 15-05-2026
Domestic flight ticket pricing in India is influenced by multiple dynamic factors including airline carrier, travel route, departure timing, flight duration, and number of intermediate stops. Retrieving accurate fare estimates remains challenging due to continuously changing market demand, seasonal fluctuations, and varying airline pricing strategies. Existing travel and booking platforms primarily display current ticket prices and provide limited support for predictive fare estimation and intelligent booking analysis. Traditional statistical approaches often fail to capture complex nonlinear relationships between journey attributes and ticket prices, reducing prediction reliability across diverse travel scenarios.
Feature ranking by both mutual information regression and Random Forest importance scores confirms that Duration_total_mins is the dominant predictor, contributing 50.5% of ensemble split importance. Six regression algorithms are benchmarked under identical 75/25 train-test conditions: Linear Regression, Decision Tree, k-Nearest Neighbours (k=5), Gradient Boosting, XGBoost, and Random Forest. XGBoost achieves the highest test-set R² of 0.8312 with a Root Mean Squared Error of 1,812.78 INR and a Mean Absolute Percentage Error of 13.36%. The Random Forest tuned via RandomizedSearchCV with three-fold cross-validation attains R² = 0.8282 and RMSE = 1,828.80 INR. The serialised model artefact is stored as a Pickle file for deployment in booking advisory applications.
Flight price prediction, XGBoost, Random Forest, feature engineering, mutual information, ensemble regression, Indian domestic aviation, hyperparameter tuning, RandomizedSearchCV, machine learning, fare estimation, duration-based prediction.
Gosala Venkata Charvi, Gaddam Samuel Kiran Babu, Y Pavan Narsimha Rao, Dr.Meera Alphy, & Dr.M.Aruna. (2026). Comparative Analysis and Full-Stack Deployment of Machine Learning Models for Flight Fare Prediction. International Research Journal of Innovations in Engineering and Technology - IRJIET, 10(5), 271-283. Article DOI https://doi.org/10.47001/IRJIET/2026.105037
This work is licensed under Creative common Attribution Non Commercial 4.0 Internation Licence
F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
T. Chen and C. Guestrin, “XGBoost: A Scalable Tree Boosting System,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pp. 785–794.
J. H. Friedman, “Greedy Function Approximation: A Gradient Boosting Machine,” Annals of Statistics, vol. 29, no. 5, pp. 1189–1232, 2001.
L. Bottou and V. Vapnik, “Local Learning Algorithms,” Neural Computation, vol. 4, no. 6, pp. 888–900, 1992.
D. Dua and C. Graff, “UCI Machine Learning Repository,” University of California, Irvine, School of Information and Computer Sciences, 2019.
Pandas Development Team, “Pandas Documentation,” 2025.
NumPy Developers, “NumPy Documentation,” 2025.
Scikit-learn Developers, “Scikit-learn User Guide,” 2025.
FastAPI Developers, “FastAPI Documentation,” 2025.
React Team, “React Documentation,” 2025.
Uvicorn Contributors, “Uvicorn ASGI Server Documentation,” 2025.
Vercel Inc., “Vercel Platform Documentation,” 2025.
Kaggle, “Flight Fare Prediction Dataset,” 2025.
S. Raschka and V. Mirjalili, Python Machine Learning: Machine Learning and Deep Learning with Python, Scikit-learn, and TensorFlow, 3rd ed., Packt Publishing, 2019.