Fusion of face and voice for automatic human age estimation

Xin Geng, Erqing Fang, Kate Smith-Miles

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

People usually estimate other people s age based on multiple aging traits, such as face, voice, gait, hair, and body build. However, almost all existing work on automatic age estimation is based on unimodal traits. This paper presents one of the first attempts toward multimodal automatic age estimation, which is based on the fusion of face and voice. The age estimators based on face and voice are all trained by the IIS-LLD algorithm, and the fusion is at the score level through weighted sum. Experimental results show that the fusion of face and voice can achieve better age estimation performance than that based on face only.
Original languageEnglish
Title of host publicationProceedings of the 3rd International Conference on Computer Design and Applications
EditorsZ X Yin
Place of PublicationSingapore
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages311 - 314
Number of pages4
Volume4
ISBN (Print)9781424494897
Publication statusPublished - 2011
EventInternational Conference on Computer Design and Application (ICCDA 2011) - Shanxi Normal University, Xi'an, China
Duration: 27 May 201127 May 2011
Conference number: 3rd

Conference

ConferenceInternational Conference on Computer Design and Application (ICCDA 2011)
Abbreviated titleICCDA 2011
CountryChina
CityXi'an
Period27/05/1127/05/11

Cite this

Geng, X., Fang, E., & Smith-Miles, K. (2011). Fusion of face and voice for automatic human age estimation. In Z. X. Yin (Ed.), Proceedings of the 3rd International Conference on Computer Design and Applications (Vol. 4, pp. 311 - 314). Singapore: IEEE, Institute of Electrical and Electronics Engineers.
Geng, Xin ; Fang, Erqing ; Smith-Miles, Kate. / Fusion of face and voice for automatic human age estimation. Proceedings of the 3rd International Conference on Computer Design and Applications. editor / Z X Yin. Vol. 4 Singapore : IEEE, Institute of Electrical and Electronics Engineers, 2011. pp. 311 - 314
@inproceedings{a308e65f971f468c8398c0a48b7c3e55,
title = "Fusion of face and voice for automatic human age estimation",
abstract = "People usually estimate other people s age based on multiple aging traits, such as face, voice, gait, hair, and body build. However, almost all existing work on automatic age estimation is based on unimodal traits. This paper presents one of the first attempts toward multimodal automatic age estimation, which is based on the fusion of face and voice. The age estimators based on face and voice are all trained by the IIS-LLD algorithm, and the fusion is at the score level through weighted sum. Experimental results show that the fusion of face and voice can achieve better age estimation performance than that based on face only.",
author = "Xin Geng and Erqing Fang and Kate Smith-Miles",
year = "2011",
language = "English",
isbn = "9781424494897",
volume = "4",
pages = "311 -- 314",
editor = "Yin, {Z X}",
booktitle = "Proceedings of the 3rd International Conference on Computer Design and Applications",
publisher = "IEEE, Institute of Electrical and Electronics Engineers",
address = "United States of America",

}

Geng, X, Fang, E & Smith-Miles, K 2011, Fusion of face and voice for automatic human age estimation. in ZX Yin (ed.), Proceedings of the 3rd International Conference on Computer Design and Applications. vol. 4, IEEE, Institute of Electrical and Electronics Engineers, Singapore, pp. 311 - 314, International Conference on Computer Design and Application (ICCDA 2011), Xi'an, China, 27/05/11.

Fusion of face and voice for automatic human age estimation. / Geng, Xin; Fang, Erqing; Smith-Miles, Kate.

Proceedings of the 3rd International Conference on Computer Design and Applications. ed. / Z X Yin. Vol. 4 Singapore : IEEE, Institute of Electrical and Electronics Engineers, 2011. p. 311 - 314.

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

TY - GEN

T1 - Fusion of face and voice for automatic human age estimation

AU - Geng, Xin

AU - Fang, Erqing

AU - Smith-Miles, Kate

PY - 2011

Y1 - 2011

N2 - People usually estimate other people s age based on multiple aging traits, such as face, voice, gait, hair, and body build. However, almost all existing work on automatic age estimation is based on unimodal traits. This paper presents one of the first attempts toward multimodal automatic age estimation, which is based on the fusion of face and voice. The age estimators based on face and voice are all trained by the IIS-LLD algorithm, and the fusion is at the score level through weighted sum. Experimental results show that the fusion of face and voice can achieve better age estimation performance than that based on face only.

AB - People usually estimate other people s age based on multiple aging traits, such as face, voice, gait, hair, and body build. However, almost all existing work on automatic age estimation is based on unimodal traits. This paper presents one of the first attempts toward multimodal automatic age estimation, which is based on the fusion of face and voice. The age estimators based on face and voice are all trained by the IIS-LLD algorithm, and the fusion is at the score level through weighted sum. Experimental results show that the fusion of face and voice can achieve better age estimation performance than that based on face only.

M3 - Conference Paper

SN - 9781424494897

VL - 4

SP - 311

EP - 314

BT - Proceedings of the 3rd International Conference on Computer Design and Applications

A2 - Yin, Z X

PB - IEEE, Institute of Electrical and Electronics Engineers

CY - Singapore

ER -

Geng X, Fang E, Smith-Miles K. Fusion of face and voice for automatic human age estimation. In Yin ZX, editor, Proceedings of the 3rd International Conference on Computer Design and Applications. Vol. 4. Singapore: IEEE, Institute of Electrical and Electronics Engineers. 2011. p. 311 - 314