Abstract
Quantifying behavioural changes in depression using affective computing techniques is the first step in developing an objective diagnostic aid, with clinical utility, for clinical depression. As part of the AVEC 2013 Challenge, we present a multimodal approach for the Depression Sub-Challenge using a GMM-UBM system with three different kernels for the audio subsystem and Space Time Interest Points in a Bag-of-Words approach for the vision subsystem. These are then fused at the feature level to form the combined AV system. Key results include the strong performance of acoustic audio features and the bag-of-words visual features in predicting an individual's level of depression using regression. Interestingly, in the context of the small amount of literature on the subject, is that our feature level multimodal fusion technique is able to outperform both the audio and visual challenge baselines. ©
Original language | English |
---|---|
Title of host publication | AVEC 2013 - Proceedings of the 3rd ACM International Workshop on Audio/Visual Emotion Challenge |
Publisher | Association for Computing Machinery (ACM) |
Pages | 11-20 |
Number of pages | 10 |
ISBN (Print) | 9781450323956 |
DOIs | |
Publication status | Published - 1 Jan 2013 |
Externally published | Yes |
Event | ACM International Workshop on Audio/Visual Emotion Challenge, AVEC 2013 - Barcelona, Spain Duration: 21 Oct 2013 → 21 Oct 2013 Conference number: 3rd |
Publication series
Name | AVEC 2013 - Proceedings of the 3rd ACM International Workshop on Audio/Visual Emotion Challenge |
---|
Conference
Conference | ACM International Workshop on Audio/Visual Emotion Challenge, AVEC 2013 |
---|---|
Country/Territory | Spain |
City | Barcelona |
Period | 21/10/13 → 21/10/13 |
Keywords
- Acoustic speech features
- Bag-of-words
- Behavioural signals
- Depression
- Multimodal fusion
- Multimodal technologies
- Pyramid of histogram of gradients
- Space-time interest points
- Support vector regression