Mostrar el registro sencillo del ítem

dc.contributor.author
Terissi, Lucas Daniel  
dc.contributor.author
Cerda, Mauricio  
dc.contributor.author
Gómez, Juan Carlos  
dc.contributor.author
Hitschfeld-kahler, Nancy  
dc.contributor.author
Girau, Bernard  
dc.date.available
2015-07-23T14:33:42Z  
dc.date.issued
2013-02  
dc.identifier.citation
Terissi, Lucas Daniel; Cerda, Mauricio; Gómez, Juan Carlos; Hitschfeld-kahler, Nancy; Girau, Bernard; A comprehensive system for facial animation of generic 3D head models driven by speech; Springer; EURASIP Journal on Audio, Speech and Music Processing; 2013; 5; 2-2013; 1-37  
dc.identifier.issn
1687-4722  
dc.identifier.uri
http://hdl.handle.net/11336/1438  
dc.description.abstract
A comprehensive system for facial animation of generic 3D head models driven by speech is presented in this article. In the training stage, audio-visual information is extracted from audio-visual training data, and then used to compute the parameters of a single joint audio-visual hidden Markov model (AV-HMM). In contrast to most of the methods in the literature, the proposed approach does not require segmentation/classification processing stages of the audio-visual data, avoiding the error propagation related to these procedures. The trained AV-HMM provides a compact representation of the audio-visual data, without the need of phoneme (word) segmentation, which makes it adaptable to different languages. Visual features are estimated from the speech signal based on the inversion of the AV-HMM. The estimated visual speech features are used to animate a simple face model. The animation of a more complex head model is then obtained by automatically mapping the deformation of the simple model to it, using a small number of control points for the interpolation. The proposed algorithm allows the animation of 3D head models of arbitrary complexity through a simple setup procedure. The resulting animation is evaluated in terms of intelligibility of visual speech through perceptual tests, showing a promising performance. The computational complexity of the proposed system is analyzed, showing the feasibility of its real-time implementation.  
dc.format
application/pdf  
dc.language.iso
eng  
dc.publisher
Springer  
dc.rights
info:eu-repo/semantics/openAccess  
dc.rights.uri
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/  
dc.subject
Facial Animation  
dc.subject
Hidden Markov Models  
dc.subject
Audio Visual Speech Processing  
dc.subject.classification
Otras Ingeniería Eléctrica, Ingeniería Electrónica e Ingeniería de la Información  
dc.subject.classification
Ingeniería Eléctrica, Ingeniería Electrónica e Ingeniería de la Información  
dc.subject.classification
INGENIERÍAS Y TECNOLOGÍAS  
dc.subject.classification
Control Automático y Robótica  
dc.subject.classification
Ingeniería Eléctrica, Ingeniería Electrónica e Ingeniería de la Información  
dc.subject.classification
INGENIERÍAS Y TECNOLOGÍAS  
dc.title
A comprehensive system for facial animation of generic 3D head models driven by speech  
dc.type
info:eu-repo/semantics/article  
dc.type
info:ar-repo/semantics/artículo  
dc.type
info:eu-repo/semantics/publishedVersion  
dc.date.updated
2016-03-30 10:35:44.97925-03  
dc.journal.volume
2013  
dc.journal.number
5  
dc.journal.pagination
1-37  
dc.journal.pais
Alemania  
dc.journal.ciudad
Berlin  
dc.description.fil
Fil: Terissi, Lucas Daniel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico - CONICET - Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina;  
dc.description.fil
Fil: Cerda, Mauricio. Universidad Austral de Chile; Chile;  
dc.description.fil
Fil: Gómez, Juan Carlos. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico - CONICET - Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina;  
dc.description.fil
Fil: Hitschfeld-kahler, Nancy. Universidad de Chile. Departamento de Ciencias de la Computación; Argentina;  
dc.description.fil
Fil: Girau, Bernard. Loria - INRIA Nancy Grand Est. Cortex Team. Vandoeuvre-lès-Nancy; Francia;  
dc.journal.title
EURASIP Journal on Audio, Speech and Music Processing  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/http://asmp.eurasipjournals.com/content/2013/1/5