Recognizing emotions in dialogues with disfluencies and non-verbal vocalisations

Leimin Tian, Catherine Lai, Johanna D. Moore

Research output: Chapter in Book/Report/Conference proceedingConference PaperOther

Abstract

We investigate the usefulness of DISfluencies and Non-verbal Vocalisations (DIS-NV) for recognizing human emotions in dialogues. The proposed features measure filled pauses, fillers, stutters, laughter, and breath in utterances. The predictiveness of DISNV features is compared with lexical features and state-of-the-art low-level acoustic features. Our experimental results show that using DIS-NV features alone is not as predictive as using lexical or acoustic features. However, adding them to lexical or acoustic feature set yields improvement compared to using lexical or acoustic features alone. This indicates that disfluencies and non-verbal vocalisations provide useful information overlooked by the other two types of features for emotion recognition.
Original languageEnglish
Title of host publicationProceedings of the 4th Interdisciplinary Workshop on Laughter and Other Non-verbal Vocalisations in Speech 2015
Subtitle of host publication14–15 April 2015
EditorsKhiet Truong, Dirk Heylen, Jürgen Trouvain, Nick Campbell
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages39-41
Number of pages3
Publication statusPublished - 2015
Externally publishedYes
EventInterdisciplinary Workshop on Laughter and Other Non-verbal Vocalisations in Speech 2015 - Enschede, Netherlands
Duration: 14 Apr 201515 Apr 2015
Conference number: 4th
https://laughterworkshop2015.wordpress.com/

Conference

ConferenceInterdisciplinary Workshop on Laughter and Other Non-verbal Vocalisations in Speech 2015
Country/TerritoryNetherlands
CityEnschede
Period14/04/1515/04/15
Internet address

Keywords

  • emotion recognition
  • dialogue
  • disfluency
  • speech processing
  • HCI

Cite this