Abstract
Cyberbullying detection has become a pressing need in Internet usage governance due to its harmful consequences. Different approaches have been proposed to tackle this problem, including deep learning. In this paper, an empirical study is conducted to evaluate the effectiveness and efficiency of deep learning algorithms, coupled with word embeddings in detecting cyberbullying texts. Three deep learning algorithms were experimented, namely GRU, LSTM and BLSTM. Data pre-processing steps, including oversampling were performed on the selected social media datasets. For feature representations, four different word embeddings models were explored, including word2vec, GloVe, Reddit and ELMO models. Elmo cares of word context by capturing information from the word surroundings which eliminates some of the shortcomings of pre-trained word embeddings models. For more accurate results, 10-fold cross-validation technique was implemented. The experimental results show that BLSTM performs best with ELMO in detecting cyberbullying texts. The efficiency of each model is also measured by calculating the average time taken for training each model. GRU outperforms in terms of time efficiency. Based on the analysis done on false negative cases, three observations were made, which highlight the limitations of word embeddings models on top of GRU algorithm in cyberbullying detection.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2019 2nd International Conference on Computational Intelligence and Intelligent Systems (CIIS 2019) |
| Editors | Ray C.C. Cheung, Prasad KDV Yarlagadda |
| Place of Publication | New York NY USA |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 17-21 |
| Number of pages | 5 |
| ISBN (Electronic) | 9781450372596 |
| DOIs | |
| Publication status | Published - 2019 |
| Event | International Conference on Computing Intelligence and Information System 2019 - Bangkok, Thailand Duration: 23 Nov 2019 → 25 Nov 2019 Conference number: 2nd https://dl.acm.org/doi/proceedings/10.1145/3372422 (Proceedings) |
Conference
| Conference | International Conference on Computing Intelligence and Information System 2019 |
|---|---|
| Abbreviated title | CIIS 2019 |
| Country/Territory | Thailand |
| City | Bangkok |
| Period | 23/11/19 → 25/11/19 |
| Internet address |
|
Keywords
- Cyberbullying
- Deep Learning
- ELMo
- Word Embeddings