An exploratory study on Latent-Dirichlet Allocation models for aspect identification on short sentences

Ameer Abu Bakar, Lay-Ki Soon, Hui-Ngo Goh

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

1 Citation (Scopus)

Abstract

This paper reports an exploratory study conducted to investigate the performance of topic modelling algorithms in aspect identification. Aspect identification is an important step in aspect-based sentiment analysis. Latent-Dirichlet Allocation model serves as the baseline of topic models in the experiments. One of the variations of LDA, namely Phrase-LDA was experimented to benchmark its performance against the original LDA. Although it was reported that PLDA performs better compared to LDA in aspect-based sentiment analysis, our experimental results indicate that LDA works better on dataset with short sentences. A new PLDA model was also proposed by using different types of dependencies to extract the phrases.

Original languageEnglish
Title of host publicationComputational Science and Technology
Subtitle of host publication4th ICCST 2017, Kuala Lumpur, Malaysia, 29–30 November, 2017
EditorsRayner Alfred, Hiroyuki Iida, Ag. Asri Ag. Ibrahim, Yuto Lim
Place of PublicationSingapore Singapore
PublisherSpringer
Pages314-323
Number of pages10
ISBN (Electronic)9789811082764
ISBN (Print)9789811082757
DOIs
Publication statusPublished - 2018
Externally publishedYes
EventInternational Conference on Computational Science and Technology 2017 - Kuala Lumpur, Malaysia
Duration: 29 Nov 201730 Nov 2017
Conference number: 4th
https://link.springer.com/book/10.1007/978-981-10-8276-4 (Proceedings)

Publication series

NameLecture Notes in Electrical Engineering
PublisherSpringer
Volume488
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceInternational Conference on Computational Science and Technology 2017
Abbreviated titleICCST 2017
Country/TerritoryMalaysia
CityKuala Lumpur
Period29/11/1730/11/17
Internet address

Keywords

  • Aspect identification
  • Latent-Dirichlet Allocation
  • Sentiment analysis
  • Topic modeling

Cite this