Repositorio Institucional
Repositorio Institucional
CONICET Digital
  • Inicio
  • EXPLORAR
    • AUTORES
    • DISCIPLINAS
    • COMUNIDADES
  • Estadísticas
  • Novedades
    • Noticias
    • Boletines
  • Ayuda
    • General
    • Datos de investigación
  • Acerca de
    • CONICET Digital
    • Equipo
    • Red Federal
  • Contacto
JavaScript is disabled for your browser. Some features of this site may not work without it.
  • INFORMACIÓN GENERAL
  • RESUMEN
  • ESTADISTICAS
 
Capítulo de Libro

Random Forest for Audio-Visual Speech Recognition

Título del libro: Horizons in Computer Science Research

Sad, Gonzalo DanielIcon ; Terissi, Lucas DanielIcon ; Gómez, Juan Carlos
Otros responsables: Clary, Thomas S.
Fecha de publicación: 2020
Editorial: Nova Science Publishers
ISBN: 978-1-53618-359-7
Idioma: Inglés
Clasificación temática:
Otras Ciencias de la Computación e Información

Resumen

In this chapter, a review of different methods for Audio-Visual Speech Recognition (AVSR) using Random Forest, is presented. First, a strategy based on the combination of Wavelet multiresolution analysis and Random Forest is proposed. The temporal evolution of the input speech data is represented by a set of wavelet-based features. Then, a Random Forest classifier is employed to carry out the speech recognition task. Second, a novel scheme based on the combination in a cascade wise of two classifiers, Random Forest and Complementary Random Forest, is proposed. Different from Random Forest which is trained with instances of each particular class, the Complementary Random Forest is trained with instances of all the remaining classes. The performance of the proposed speech recognition methods is evaluated in different scenarios, namely, considering only acoustic information, only visual information (lip-reading), and fused audiovisual information. These evaluations are carried out over three different audiovisual databases, two of them public ones and the remaining one compiled by the authors of this chapter. Experimental results show that a good performance is achieved with the proposed methods over the three databases and for the different kinds of input information being considered.
Palabras clave: SPEECH RECOGNITION , AUDIO-VISUAL , RANDOM FOREST
Ver el registro completo
 
Archivos asociados
Tamaño: 382.0Kb
Formato: PDF
.
Solicitar
Licencia
info:eu-repo/semantics/restrictedAccess Excepto donde se diga explícitamente, este item se publica bajo la siguiente descripción: Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Unported (CC BY-NC-SA 2.5)
Identificadores
URI: http://hdl.handle.net/11336/201513
URL: https://novapublishers.com/shop/horizons-in-computer-science-research-volume-19/
Colecciones
Capítulos de libros(CIFASIS)
Capítulos de libros de CENTRO INT.FRANCO ARG.D/CS D/L/INF.Y SISTEM.
Citación
Sad, Gonzalo Daniel; Terissi, Lucas Daniel; Gómez, Juan Carlos; Random Forest for Audio-Visual Speech Recognition; Nova Science Publishers; 19; 2020; 175-221
Compartir

Enviar por e-mail
Separar cada destinatario (hasta 5) con punto y coma.
  • Facebook
  • X Conicet Digital
  • Instagram
  • YouTube
  • Sound Cloud
  • LinkedIn

Los contenidos del CONICET están licenciados bajo Creative Commons Reconocimiento 2.5 Argentina License

https://www.conicet.gov.ar/ - CONICET

Inicio

Explorar

  • Autores
  • Disciplinas
  • Comunidades

Estadísticas

Novedades

  • Noticias
  • Boletines

Ayuda

Acerca de

  • CONICET Digital
  • Equipo
  • Red Federal

Contacto

Godoy Cruz 2290 (C1425FQB) CABA – República Argentina – Tel: +5411 4899-5400 repositorio@conicet.gov.ar
TÉRMINOS Y CONDICIONES