Abstract
The design of robust new interfaces that process conversational speech is a challenging research direction largely because users' spoken language is so variable, which is especially true of children. The present research explored whether children's response latencies before initiating a conversational turn converge with those heard in the text-to-speech (TTS) of a computer partner. A study was conducted in which twenty-four 7-to-10-year-old children conversed with animated characters that responded with different types of TTS voices during an educational software application. Analyses confirmed that, while interacting with opposite TTS voices, children's average response latencies adapted 18.4% in the direction of their computer partner's speech. These adaptations were dynamic, bi-directional, and generalized across different types of users and TTS voices. The long-term goal of this research is the predictive modeling of human-computer communication patterns to guide the design of well synchronized, robust, and adaptive conversational interfaces.
Original language | English |
---|---|
Title of host publication | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
Subtitle of host publication | Denver; United States; 16 September 2002 through 20 September 2002 |
Pages | 561-564 |
Number of pages | 4 |
Publication status | Published - 2002 |
Externally published | Yes |
Event | 7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States of America Duration: 16 Sept 2002 → 20 Sept 2002 |
Conference
Conference | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
---|---|
Country/Territory | United States of America |
City | Denver |
Period | 16/09/02 → 20/09/02 |