Hate Speech Detection Using Social Media Discourse: A Multilingual Approach With Large Language Model

Authors

  • Muhammad Ahmad Author
  • Muhammad Usman Author
  • Sulaiman Khan Author
  • Muhammad Muzamil Author
  • Ameer Hamza Author
  • Muhammad Jalal Author
  • Ildar Batyrshin Author
  • Usman Sardar Author
  • Carlos Aguilar-Ibañez Author

DOI:

https://doi.org/10.53555/AJBR.v28i2S.6805

Keywords:

LLM, GPT, Machine learning, CNN, BERT algorithm, Social media, SVM

Abstract

Online social networks (OSN) and microblogging websites are attracting Internet users and have revolutionized how we communicate with individuals, share their feelings, and exchange ideas across the world with ease. In the extensive age of social media, there is increasing online hate speech, which can provoke violence and contribute to societal division. Hate speech based on race, gender, or religion puts those affected at risk of mental health problems and exacerbates social problems. While current protocols have reduced overt hate speech, subtler forms known as implicit hate speech have emerged, making detection more challenging. This study focuses on hate speech detection using social media discourse, by creating a comprehensive multilingual dataset [25] in Urdu and English and applied multiple machine learning, deep learning, transfer learning, and Large Language model models, such as GPT-3.5 Turbo. By comparing GPT-3.5 Turbo, we identified the effectiveness of large language models in detecting both explicit and implicit forms of hate speech. Our analysis underscores the potential of automated classification systems to reduce reliance on human intervention and to promote constructive online discourse. Our proposed methodology achieved the highest accuracy of 0.91, and achieved the highest performance improvement of 5.81% over transformer models such as BERT. This research adds to the growing body of work on multilingual natural language processing (NLP) and offers insights for reducing hate speech and fostering respectful communication across diverse communities.

Author Biographies

  • Muhammad Ahmad

    Centro de Investigación en Computación, Instituto Politécnico Nacional (CIC- PN), Mexico City 07738, Mexico

  • Muhammad Usman

    Centro de Investigación en Computación, Instituto Politécnico Nacional (CIC- PN), Mexico City 07738, Mexico

  • Sulaiman Khan

    Centro de Investigación en Computación, Instituto Politécnico Nacional (CIC- PN), Mexico City 07738, Mexico

  • Muhammad Muzamil

    Department of Artificial Intelligence, Computer Science and Software Engineering, The Islamia University of Bahawalpur, 63100, Pakistan

  • Ameer Hamza

    Department of Artificial Intelligence, Computer Science and Software Engineering, The Islamia University of Bahawalpur, 63100, Pakistan

  • Muhammad Jalal

    Department of Artificial Intelligence, Computer Science and Software Engineering, The Islamia University of Bahawalpur, 63100, Pakistan

  • Ildar Batyrshin

    Centro de Investigación en Computación, Instituto Politécnico Nacional (CIC- PN), Mexico City 07738, Mexico

  • Usman Sardar

    School of Informatics and Robotics, Institute of Arts and Culture, Lahore 54000, Pakistan

  • Carlos Aguilar-Ibañez

    Centro de Investigación en Computación, Instituto Politécnico Nacional (CIC- PN), Mexico City 07738, Mexico

Downloads

Published

2025-02-08

Issue

Section

Research Article

How to Cite

Hate Speech Detection Using Social Media Discourse: A Multilingual Approach With Large Language Model. (2025). African Journal of Biomedical Research, 28(2S), 321-328. https://doi.org/10.53555/AJBR.v28i2S.6805

Most read articles by the same author(s)