Artículo
How Many Clusters: A Validation Index for Arbitrary-Shaped Clusters
Fecha de publicación:
04/2013
Editorial:
IEEE Computer Society
Revista:
Ieee-acm Transactions On Computational Biology And Bioinformatics
ISSN:
1545-5963
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
Clustering validation indexes are intended to assess the goodness of clustering results. Many methods used to estimate the number of clusters rely on a validation index as a key element to find the correct answer. This paper presents a new validation index based on graph concepts, which has been designed to find arbitrary shaped clusters by exploiting the spatial layout of the patterns and their clustering label. This new clustering index is combined with a solid statistical detection framework, the Gap Statistic. The resulting method is able to find the right number of arbitrary shaped clusters in diverse situations, as we show with examples where this information is available. A comparison with several relevant validation methods is carried out using artificial and gene expression datasets. The results are very encouraging, showing that the underlying structure in the data can be more accurately detected with the new clustering index. Our gene expression data results also indicate that this new index is stable under perturbation of the input data.
Palabras clave:
CLUSTERING
,
GENOMIC DATA
,
VALIDATION INDEX
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(CIFASIS)
Articulos de CENTRO INT.FRANCO ARG.D/CS D/L/INF.Y SISTEM.
Articulos de CENTRO INT.FRANCO ARG.D/CS D/L/INF.Y SISTEM.
Citación
Baya, Ariel Emilio; Granitto, Pablo Miguel; How Many Clusters: A Validation Index for Arbitrary-Shaped Clusters; IEEE Computer Society; Ieee-acm Transactions On Computational Biology And Bioinformatics; 10; 2; 4-2013; 401-414
Compartir
Altmétricas