loading page

Hate Speech Recognition in multilingual text: Hinglish Documents
  • +3
  • arun kumar yadav ,
  • Abhishek Kumar ,
  • Shivani . ,
  • Kusum . ,
  • Mohit Kumar ,
  • Divakar Yadav
arun kumar yadav
Author Profile
Abhishek Kumar
Author Profile
Shivani .
Author Profile
Mohit Kumar
National Institute of Technology Hamirpur

Corresponding Author:[email protected]

Author Profile
Divakar Yadav
Author Profile


In this paper, we apply and evaluate several machine learning and deep learning methods, along with various feature extraction and word-embedding techniques, on a consolidated dataset of 20600 instances, for hate speech detection from tweets and comments in Hinglish. The experimental results reveal that deep learning models perform better than machine learning models in general. Among the deep learning models, the CNN-BiLSTM model with word2vec word embedding provides the best results. The model yields 0.876 accuracy, 0.830 precision, 0.840 recall and 0.835 F1-score. These results surpass the recent state-of-art approaches.
Mar 2023Published in International Journal of Information Technology volume 15 issue 3 on pages 1319-1331. 10.1007/s41870-023-01211-z