Explainable Machine Learning-Based Fake News Detection Using NLP Techniques

Nawodi, N.A.R.; Wijegunasinghe, U.S.; Lakshan, W.K.; Vithuckshiha, P.

dc.contributor.author	Nawodi, N.A.R.
dc.contributor.author	Wijegunasinghe, U.S.
dc.contributor.author	Lakshan, W.K.
dc.contributor.author	Vithuckshiha, P.
dc.date.accessioned	2025-10-17T04:54:59Z
dc.date.available	2025-10-17T04:54:59Z
dc.date.issued	2025
dc.identifier.uri	http://drr.vau.ac.lk/handle/123456789/1397
dc.description.abstract	The rapid spread of misinformation on social media poses significant societal risks. This study develops an interpretable and accurate fake-news detection system that combines traditional supervised learning with Explainable AI (XAI). Two large, publicly available datasets were merged into a single corpus and preprocessed (cleaning, tokenization, stop-word removal, and lemmatization). Features were extracted with TF–IDF and n-gram vectorization (uni/bi-grams). We trained Support Vector Machine (SVM), Logistic Regression, and a Passive-Aggressive classifier, and evaluated them on a held-out test set. The SVM achieved the highest accuracy (93.5%), outperforming Logistic Regression (91.2%) and the Passive-Aggressive classifier (90.5%); precision, recall, and F1-score similarly supported this ranking. Confusion-matrix analysis indicated balanced detection across fake and real classes. To improve transparency, we applied SHAP and LIME to highlight influential tokens and n-grams for both global trends and individual predictions, enabling users to verify the cues driving classifications. These results show that pairing efficient linear text models with post-hoc explanations can deliver competitive accuracy while mitigating black-box concerns. Limitations include vulnerability to evolving adversarial writing styles, limited multilingual coverage, and the need for real-time inference. Future work will explore continual/online learning, cross-lingual transfer, and streaming deployment to enhance robustness and broader applicability.	en_US
dc.language.iso	en	en_US
dc.publisher	Faculty of Technological Studies, University of Vavuniya	en_US
dc.subject	Fake news detection	en_US
dc.subject	Machine learning	en_US
dc.subject	Explainable AI	en_US
dc.subject	Natural language processing	en_US
dc.subject	Model interpretability	en_US
dc.title	Explainable Machine Learning-Based Fake News Detection Using NLP Techniques	en_US
dc.type	Conference abstract	en_US
dc.identifier.proceedings	2nd Research Conference on Advances in Information and Communication Technology - (RCAICT 2025)	en_US