loading page

Regional language toxic comment classification
  • Yashkumar Parikh ,
  • Jinan Fiaidhi
Yashkumar Parikh
Lakehead University

Corresponding Author:[email protected]

Author Profile
Jinan Fiaidhi
Author Profile

Abstract

Social media sites are gaining popularity day by day. They are best for communication, business, entertainment, and many other things. After more than a decade, social media have become very influential. On the flip side, fake news, hate speech, and online trolls are the biggest concerns because of social media. So, a solution to curb this issue is needed, especially in regional languages. Many social media platforms support regional languages. This paper will provide a machine learning-based solution to this problem. The focus of this paper is to classify comments written in regional languages. Firstly, a dataset has been created in Gujarati, Hindi, English, Marathi, and Punjabi languages. After that, different machine learning and deep learning models are applied to the multilingual dataset. At last, a comparison of all model performances was made.