Modeling Multimodal Behaviors from Speech Prosody

Yu Ding; Catherine Pelachaud; Thierry Artières

doi:10.1007/978-3-642-40415-3_19

Communication Dans Un Congrès Année : 2013

Modeling Multimodal Behaviors from Speech Prosody

(1, 2) , (1, 2) , (3)

1
2
3

Yu Ding

Fonction : Auteur

Multimédia

Département Traitement du Signal et des Images

Catherine Pelachaud

Fonction : Auteur
PersonId : 754946
IdHAL : liu-yang
IdRef : 276267176

Multimédia

Département Traitement du Signal et des Images

Thierry Artières

Fonction : Auteur

Machine Learning and Information Access

Résumé

Head and eyebrow movements are an important communication mean. They are highly synchronized with speech prosody. Endowing virtual agent with synchronized verbal and nonverbal behavior enhances their communicative performance. In this paper, we propose an animation model for the virtual agent based on a statistical model linking speech prosody and facial movement. A fully parameterized Hidden Markov Model is proposed first to capture the tight relationship between speech and facial movement of a human face extracted from a video corpus and then to drive automatically virtual agent's behaviors from speech signals. The correlation between head and eyebrow movements is also taken into account during the building of the model. Subjective and objective evaluations were conducted to validate this model.

Mots clés

virtual agent speech to motion synthesis head motion synthesis eyebrow motion synthesis Hidden Markov model speech driven

Domaines

Multimédia [cs.MM] Interface homme-machine [cs.HC] Apprentissage [cs.LG] Synthèse d'image et réalité virtuelle [cs.GR] Machine Learning [stat.ML]

TelecomParis HAL : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-02412034

Soumis le : dimanche 15 décembre 2019-12:42:39

Dernière modification le : lundi 9 octobre 2023-12:49:39

Dates et versions

hal-02412034 , version 1 (15-12-2019)

Identifiants

HAL Id : hal-02412034 , version 1
DOI : 10.1007/978-3-642-40415-3_19

Citer

Yu Ding, Catherine Pelachaud, Thierry Artières. Modeling Multimodal Behaviors from Speech Prosody. IVA 2013 - 13th International Conference on Intelligent Virtual Agents, Aug 2013, Edinburgh, United Kingdom. pp.217-228, ⟨10.1007/978-3-642-40415-3_19⟩. ⟨hal-02412034⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UPMC CNRS PARISTECH LIP6 SORBONNE-UNIVERSITE LTCI IDS MM SU-SCIENCES

42 Consultations

0 Téléchargements

Modeling Multimodal Behaviors from Speech Prosody

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager