An Ensemble Machine Learning Approach for Reliable Fake News Detection on Social Media Platforms

Authors

  • Javed Ali Mohammad Masters in telecommunications, Middlesex University. Author
  • Sri Harsha Panchali Information Systems Engineer, CrowdStrike Inc. Author
  • Usha Mohani kavirayani Kent State University, MS in Computer Science. Author
  • Krishna Bhardwaj Mylavarapu MS in Computer Science, University of Illinois Springfield. Author
  • Jenitha Pilli MS in Computer Science, University of Louisiana at Lafayette. Author
  • Prathik Kumar Jannu Computer Science Engineering, JNTU Hyderabad. Author

DOI:

https://doi.org/10.63282/3050-9262.IJAIDSML-V3I1P116

Keywords:

Fake News Detection, social media, Machine Learning, Text Classification, ROC-AUC, Class Imbalance, Machine learning

Abstract

The high rates at which such misinformation is spread on social media platforms have further increased the need to have a strong and automated fake news detection system. This paper uses the Fake News Net dataset, which combines news and social characteristics, to offer a workable machine learning (ML)-based method for detecting bogus news. The first step in the methodology consists of a thorough text preprocessing, such as tokenization, lemmatization, noise, normalization of cases, and elimination of stop words, to ensure high-quality input to model training. After the cleaned dataset has been further divided into train and test subsets, the two classification models, Light Gradient Boosting Machine (Light GBM) and Decision Tree (DT), are used. Light GBM has been proven to be better as it can reliably deal with high volumes of textual data, potentially involving intricate patterns in context and is fast thanks to its scalable nature and leaf-wise tree growing approach. Experimental investigations show that Light GBM achieves an accuracy score of 86.02, outperforming the Decision Tree model and popular models like Random Forest and Naive Bayes. Furthermore, performance measurements including as accuracy, recall, F1-score, confusion matrices, and ROC curves confirm the reliability of the proposed technique. The comparative analysis justifies that Light GBM has a greater generalized and stable classification ability. All things considered, the essay presents a practical and successful method for identifying fake news, which raises the material's trustworthiness and stops false information from spreading on social media.

References

[1] K. Rajesh, A. Kumar, and R. Kadu, “Fraudulent News Detection using Machine Learning Approaches,” in 2019 Global Conference for Advancement in Technology (GCAT), IEEE, Oct. 2019, pp. 1–5. doi: 10.1109/GCAT47503.2019.8978436.

[2] K. Poddar, G. B. Amali D., and K. S. Umadevi, “Comparison of Various Machine Learning Models for Accurate Detection of Fake News,” in 2019 Innovations in Power and Advanced Computing Technologies (i-PACT), IEEE, Mar. 2019, pp. 1–5. doi: 10.1109/i-PACT44901.2019.8960044.

[3] S. Garg, “Predictive Analytics and Auto Remediation using Artificial Inteligence and Machine learning in Cloud Computing Operations,” Int. J. Innov. Res. Eng. Multidiscip. Phys. Sci., vol. 7, no. 2, 2019, doi: 10.5281/zenodo.15362327.

[4] T. Traylor, J. Straub, S. Chaudhary, and N. Snell, “Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator,” in 2019 IEEE 13th International Conference on Semantic Computing (ICSC), IEEE, Jan. 2019, pp. 445–449. doi: 10.1109/ICOSC.2019.8665593.

[5] S. Shabani and M. Sokhn, “Hybrid Machine-Crowd Approach for Fake News Detection,” in 2018 IEEE 4th International Conference on Collaboration and Internet Computing (CIC), IEEE, Oct. 2018, pp. 299–306. doi: 10.1109/CIC.2018.00048.

[6] N. Singh, T. Sharma, A. Thakral, and T. Choudhury, “Detection of Fake Profile in Online Social Networks Using Machine Learning,” in 2018 International Conference on Advances in Computing and Communication Engineering (ICACCE), IEEE, Jun. 2018, pp. 231–234. doi: 10.1109/ICACCE.2018.8441713.

[7] S. Achouche, S. Clara, CA, ( US ) Udaya Bhaskar Yalamanchi, U. City, and C. ( U. ) Nisheedh, “Method , Apparatus , And Computer - Readable Medium For Performing A Data Exchange,” 2019

[8] R. K. Kaliyar, A. Goswami, and P. Narang, “Multiclass Fake News Detection using Ensemble Machine Learning,” in 2019 IEEE 9th International Conference on Advanced Computing (IACC), IEEE, Dec. 2019, pp. 103–107. doi: 10.1109/IACC48062.2019.8971579.

[9] E. Qawasmeh, M. Tawalbeh, and M. Abdullah, “Automatic Identification of Fake News Using Deep Learning,” in 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS), IEEE, Oct. 2019, pp. 383–388. doi: 10.1109/SNAMS.2019.8931873.

[10] I. Kareem and S. M. Awan, “Pakistani Media Fake News Classification using Machine Learning Classifiers,” in 2019 International Conference on Innovative Computing (ICIC), IEEE, Nov. 2019, pp. 1–6. doi: 10.1109/ICIC48496.2019.8966734.

[11] C. K. Hiramath and G. C. Deshpande, “Fake News Detection Using Deep Learning Techniques,” in 2019 1st International Conference on Advances in Information Technology (ICAIT), IEEE, Jul. 2019, pp. 411–415. doi: 10.1109/ICAIT47043.2019.8987258.

[12] S. Helmstetter and H. Paulheim, “Weakly Supervised Learning for Fake News Detection on Twitter,” in 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), IEEE, Aug. 2018, pp. 274–277. doi: 10.1109/ASONAM.2018.8508520.

[13] S. Girgis, E. Amer, and M. Gadallah, “Deep Learning Algorithms for Detecting Fake News in Online Text,” in 2018 13th International Conference on Computer Engineering and Systems (ICCES), IEEE, Dec. 2018, pp. 93–97. doi: 10.1109/ICCES.2018.8639198.

[14] M. L. Della Vedova, E. Tacchini, S. Moret, G. Ballarin, M. DiPierro, and L. de Alfaro, “Automatic Online Fake News Detection Combining Content and Social Signals,” in 2018 22nd Conference of Open Innovations Association (FRUCT), IEEE, May 2018, pp. 272–279. doi: 10.23919/FRUCT.2018.8468301.

[15] B. M. Amine, A. Drif, and S. Giordano, “Merging deep learning model for fake news detection,” in 2019 International Conference on Advanced Electrical Engineering (ICAEE), IEEE, Nov. 2019, pp. 1–4. doi: 10.1109/ICAEE47123.2019.9015097.

[16] K. Shu, S. Wang, and H. Liu, “Beyond news contents: The role of social context for fake news detection,” WSDM 2019 - Proc. 12th ACM Int. Conf. Web Search Data Min., no. i, pp. 312–320, 2019, doi: 10.1145/3289600.3290994.

[17] A. Jain, A. Shakya, H. Khatter, and A. K. Gupta, “A smart System for Fake News Detection Using Machine Learning,” in 2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT), IEEE, Sep. 2019, pp. 1–4. doi: 10.1109/ICICT46931.2019.8977659.

[18] W. Han and V. Mehta, “Fake News Detection in Social Networks Using Machine Learning and Deep Learning: Performance Evaluation,” in 2019 IEEE International Conference on Industrial Internet (ICII), IEEE, Nov. 2019, pp. 375–380. doi: 10.1109/ICII.2019.00070.

[19] Polu, A. R., Buddula, D. V. K. R., Narra, B., Gupta, A., Vattikonda, N., & Patchipulusu, H. (2021). Evolution of AI in Software Development and Cybersecurity: Unifying Automation, Innovation, and Protection in the Digital Age. Available at SSRN 5266517.

[20] Padur, S. K. R. (2020). From centralized control to democratized insights: Migrating enterprise reporting from IBM Cognos to Microsoft Power BI. Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol, 6(1), 218-225.

[21] Bitkuri, V., Kendyala, R., Kurma, J., Mamidala, V., Enokkaren, S. J., & Attipalli, A. (2021). Systematic Review of Artificial Intelligence Techniques for Enhancing Financial Reporting and Regulatory Compliance. International Journal of Emerging Trends in Computer Science and Information Technology, 2(4), 73-80.

[22] Padur, S. K. R. (2019). Machine learning for predictive capacity planning: Evolution from analytical modeling to autonomous infrastructure. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 5(5), 285-293.

[23] Attipalli, A., Enokkaren, S., BITKURI, V., Kendyala, R., KURMA, J., & Mamidala, J. V. (2021). Enhancing Cloud Infrastructure Security Through AI-Powered Big Data Anomaly Detection. Available at SSRN 5741305.

[24] Singh, A. A. S., Tamilmani, V., Maniar, V., Kothamaram, R. R., Rajendran, D., & Namburi, V. D. (2021). Predictive Modeling for Classification of SMS Spam Using NLP and ML Techniques. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 2(4), 60-69.

[25] Padur, S. K. R. (2020). AI augmented disaster recovery simulations: From chaos engineering to autonomous resilience orchestration. International Journal of Scientific Research in Science, Engineering and Technology, 7(6), 367-378.

[26] Reddy Padur, S. K. (2021). From Scripts to Platforms-as-Code: The Role of Terraform and Ansible in Declarative Infrastructure Rollouts. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 621-628.

[27] Kothamaram, R. R., Rajendran, D., Namburi, V. D., Singh, A. A. S., Tamilmani, V., & Maniar, V. (2021). A Survey of Adoption Challenges and Barriers in Implementing Digital Payroll Management Systems in Across Organizations. International Journal of Emerging Research in Engineering and Technology, 2(2), 64-72.

[28] Padur, S. K. R. (2018). Autonomous cloud economics: AI driven right sizing and cost optimization in hybrid infrastructures. International Journal of Scientific Research in Science and Technology, 4(5), 2090-2097.

[29] Rajendran, D., Namburi, V. D., Singh, A. A. S., Tamilmani, V., Maniar, V., & Kothamaram, R. R. (2021). Anomaly Identification in IoT-Networks Using Artificial Intelligence-Based Data-Driven Techniques in Cloud Environmen. International Journal of Emerging Trends in Computer Science and Information Technology, 2(2), 83-91.

[30] Padur, S. K. R. (2021). Bridging Human, System, and Cloud Integration through RESTful Automation and Governance. the International Journal of Science, Engineering and Technology, 9(6).

[31] Attipalli, A., BITKURI, V., KURMA, J., Enokkaren, S., Kendyala, R., & Mamidala, J. V. (2021). A Survey of Artificial Intelligence Methods in Liquidity Risk Management: Challenges and Future Directions. Available at SSRN 5741342.

[32] Padur, S. K. R. (2021). From Control to Code: Governance Models for Multi-Cloud ERP Modernization. International Journal of Scientific Research & Engineering Trends, 7(3).

[33] Routhu, K. K. (2021). Harnessing AI Dashboards in Oracle Cloud HCM: Advancing Predictive Workforce Intelligence and Managerial Agility. International Journal of Scientific Research & Engineering Trends, 7(6).

[34] Padur, S. K. R. (2021). Deep learning and process mining for ERP anomaly detection: Toward predictive and self-monitoring enterprise platforms. Available at SSRN 5605531.

Published

2022-03-30

Issue

Section

Articles

How to Cite

1.
Mohammad JA, Panchali SH, kavirayani UM, Mylavarapu KB, Pilli J, Jannu PK. An Ensemble Machine Learning Approach for Reliable Fake News Detection on Social Media Platforms. IJAIDSML [Internet]. 2022 Mar. 30 [cited 2026 Apr. 29];3(1):152-61. Available from: https://ijaidsml.org/index.php/ijaidsml/article/view/484