Artículo
Information flow in Deep Restricted Boltzmann Machines: An analysis of mutual information between inputs and outputs
Fecha de publicación:
10/2022
Editorial:
Elsevier Science
Revista:
Neurocomputing
ISSN:
0925-2312
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
Empirical evidence suggests the existence of an entangled relationship between the information flow from inputs features to hidden representations of a deep neural network and its ability to generalize from training samples to unobserved data. For instance, regularization techniques often used to control statistical generalization, are expected to impact this information flow. In this work, we study MI (mutual information) between inputs and representation outputs, and its relationship with various regularization methods commonly used in Restricted Boltzmann Machines (RBM) and their generalizations: Deep Belief Networks and Deep Boltzmann Machines. Our theoretical findings show the existence of fundamental connections between the hyperparameters associated with the regularization and the MI, including relevant practical ingredients such as: network dimension, matrix norms and dropout probability, which are well-known to influence the generalization ability of the network. These results are experimentally corroborated on various visual datasets.
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(CSC)
Articulos de CENTRO DE SIMULACION COMPUTACIONAL P/APLIC. TECNOLOGICAS
Articulos de CENTRO DE SIMULACION COMPUTACIONAL P/APLIC. TECNOLOGICAS
Citación
Vera, Matías Alejandro; Rey Vega, Leonardo Javier; Piantanida, Pablo; Information flow in Deep Restricted Boltzmann Machines: An analysis of mutual information between inputs and outputs; Elsevier Science; Neurocomputing; 507; 10-2022; 235-246
Compartir
Altmétricas