Artículo
High Class-Imbalance in pre-miRNA Prediction: A Novel Approach Based on deepSOM
Fecha de publicación:
11/2017
Editorial:
IEEE Computer Society
Revista:
Ieee-acm Transactions On Computational Biology And Bioinformatics
ISSN:
1545-5963
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
The computational prediction of novel microRNA within a full genome involves identifying sequences having the highest chance of being a miRNA precursor (pre-miRNA). These sequences are usually named candidates to miRNA. The well-known pre-miRNAs are usually only a few in comparison to the hundreds of thousands of potential candidates to miRNA that have to be analyzed, which makes this task a high classimbalance classification problem. The classical way of approaching it has been training a binary classifier in a supervised manner, using well-known pre-miRNAs as positive class and artificially defining the negative class. However, although the selection of positive labeled examples is straightforward, it is very difficult to build a set of negative examples in order to obtain a good set of training samples for a supervised method. In this work, we propose a novel and effective way of approaching this problem using machine learning, without the definition of negative examples. The proposal is based on clustering unlabeled sequences of a genome together with well-known miRNA precursors for the organism under study, which allows for the quick identification of the best candidates to miRNA as those sequences clustered with known precursors. Furthermore, we propose a deep model to overcome the problem of having very few positive class labels. They are always maintained in the deep levels as positive class while less likely pre-miRNA sequences are filtered level after level. Our approach has been compared with other methods for pre-miRNAs prediction in several species, showing effective predictivity of novel miRNAs. Additionally, we will show that our approach has a lower training time and allows for a better graphical navegability and interpretation of the results. A web-demo interface to try deepSOM is available at http://fich.unl.edu.ar/sinc/web-demo/deepsom/.
Palabras clave:
Genomica
,
Micrornas
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(IMPAM)
Articulos de INSTITUTO DE INVESTIGACIONES EN MICROBIOLOGIA Y PARASITOLOGIA MEDICA
Articulos de INSTITUTO DE INVESTIGACIONES EN MICROBIOLOGIA Y PARASITOLOGIA MEDICA
Articulos(SINC(I))
Articulos de INST. DE INVESTIGACION EN SEÑALES, SISTEMAS E INTELIGENCIA COMPUTACIONAL
Articulos de INST. DE INVESTIGACION EN SEÑALES, SISTEMAS E INTELIGENCIA COMPUTACIONAL
Citación
Stegmayer, Georgina; Yones, Cristian Ariel; Kamenetzky, Laura; Milone, Diego Humberto; High Class-Imbalance in pre-miRNA Prediction: A Novel Approach Based on deepSOM; IEEE Computer Society; Ieee-acm Transactions On Computational Biology And Bioinformatics; 14; 6; 11-2017; 1316-1326
Compartir
Altmétricas