Eshan, Shahnoor C. and HASAN, Mohammad (2017) An application of Machine Learning to Detect Abusive Bengali Text. In: 20th International Conference on Computer and Information Technology (ICCIT 2017), 22 Dec 2017, Dhaka, Bangladesh.
shahnoor-Abusive-Bengali-text-detection-v8-CR-temp.pdf - AUTHOR'S ACCEPTED Version (default)
Available under License Type All Rights Reserved.
Download (1MB) | Preview
Abstract or description
Bengali abusive text detection can be useful to prevent cyberbullying and online harassment as these types of crimes are increasing rapidly in Bangladesh. Machine learning approach can be useful to keep the system always updated with the new types of approaches used by the abusers. This paper investigates machine learning algorithms e.g. Random Forest, Multinomial Naïve Bayes, Support Vector Machine (SVM) with Linear, Radial Basis Function (RBF), Polynomial and Sigmoid kernel and have compared with unigram, bigram and trigram based CountVectorizer and TfidfVectorizer features. The results show that SVM Linear kernel performs the best with trigram TfidfVectorizer features.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Faculty: | School of Computing and Digital Technologies > Computing |
Event Title: | 20th International Conference on Computer and Information Technology (ICCIT 2017) |
Event Location: | Dhaka, Bangladesh |
Event Dates: | 22 Dec 2017 |
Depositing User: | Mohammad HASAN |
Date Deposited: | 12 Mar 2018 14:08 |
Last Modified: | 24 Feb 2023 13:50 |
URI: | https://eprints.staffs.ac.uk/id/eprint/4244 |