Impact Factor (2025): 6.9
DOI Prefix: 10.47001/IRJIET
Vol 9 No 25 (2025): Volume 9, Special Issue of INSPIRE’25 April 2025 | Pages: 278-283
International Research Journal of Innovations in Engineering and Technology
OPEN ACCESS | Research Article | Published Date: 24-04-2025
PDFs are widely used for document sharing, but their popularity also makes them a common target for malware. The software, titled "PDF Malware Detection Using Machine Learning Models," aims to develop and compare ml learning models for detecting malware in PDFs. Using a Kaggle dataset containing examples of both hazardous and secure PDFs, various methods such as Random Forest, C5.0, J48, Support Vector Machines, AdaBoost, Deep Neural Networks, Gradient Boosting Machines, and K-Nearest Neighbors will be employed. The main goal is to attain high detection accuracy while integrating explainability to gain a deeper understanding of the models' behaviour. By leveraging machine learning techniques, this project seeks to enhance cybersecurity measures, offering a robust solution to identify and mitigate potential threats embedded in PDF documents.
PDF malware detection, machine learning, Random Forest, SVM, DNN, cybersecurity, malicious PDF, classification algorithms, Kaggle dataset
A.Komala, Boya Chandu, & Medivala Reddy Hemanth. (2025). PDF Malware Detection Using Machine Learning Models. In proceeding of International Conference on Sustainable Practices and Innovations in Research and Engineering (INSPIRE'25), published by IRJIET, Volume 9, Special Issue of INSPIRE’25, pp 278-283. Article DOI https://doi.org/10.47001/IRJIET/2025.INSPIRE45
This work is licensed under Creative common Attribution Non Commercial 4.0 Internation Licence