Leveraging part-of-speech tagging for sentiment analysis in short texts and regular texts

Wai-Howe Khong, Lay-Ki Soon, Hui-Ngo Goh, Su-Cheng Haw

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

22 Citations (Scopus)

Abstract

Sentiment analysis has been approached from a spectrum of methodologies, including statistical learning using labelled corpus and rule-based approach where rules may be constructed based on the observations on the lexicons as well as the output from natural language processing tools. In this paper, the experiments to transform labelled datasets by using NLP tools and subsequently performing sentiment analysis via statistical learning algorithms are detailed. In addition to the common data pre-processing prior to sentiment analysis, we represent the tokens in the datasets using Part-Of-Speech (POS) tags. The aim of the experiments is to investigate the impact of POS tags on sentiment analysis, particularly on both short texts and regular texts. The experimental results on short texts show that the combination of adjective and adverb predicts the sentiment of short texts the best. While noun is generally deemed to be neutral in sentiment polarity, the experimental results show that it helps to increase the accuracy of sentiment analysis on regular texts. Besides, the role of negation analysis in the datasets has also been investigated and reported based on the experimental results obtained.

Original languageEnglish
Title of host publicationSemantic Technology
Subtitle of host publication8th Joint International Conference, JIST 2018 Awaji, Japan, November 26–28, 2018 Proceedings
EditorsRyutaro Ichise, Freddy Lecue, Takahiro Kawamura, Dongyan Zhao, Stephen Muggleton, Kouji Kozaki
Place of PublicationCham Switzerland
PublisherSpringer
Pages182-197
Number of pages16
ISBN (Electronic)9783030042844
ISBN (Print)9783030042837
DOIs
Publication statusPublished - 2018
EventJoint International Semantic Technology Conference 2018 - Awaji, Japan
Duration: 26 Nov 201828 Nov 2018
Conference number: 8th
http://JIST 2018 (Proceedings)

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume11341
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceJoint International Semantic Technology Conference 2018
Abbreviated titleJIST 2018
Country/TerritoryJapan
CityAwaji
Period26/11/1828/11/18
Internet address

Keywords

  • Part-of-speech tagging
  • Sentiment analaysis

Cite this