Artículo
Quality flaw prediction in spanish Wikipedia: A case of study with verifiability flaws
Ferretti, Edgardo; Cagnina, Leticia Cecilia
; Paiz, Viviana; Delle Donne, Sebastián; Zacagnini, Rodrigo; Errecalde, Marcelo Luis
Fecha de publicación:
11/2018
Editorial:
Pergamon-Elsevier Science Ltd
Revista:
Information Processing & Management
ISSN:
0306-4573
e-ISSN:
1873-5371
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
In this work, we present the first quality flaw prediction study for articles containing the two most frequent verifiability flaws in Spanish Wikipedia: articles which do not cite any references or sources at all (denominated Unreferenced) and articles that need additional citations for verification (so-called Refimprove). Based on the underlying characteristics of each flaw, different state-of-the-art approaches were evaluated. For articles not citing any references, a well-established rule-based approach was evaluated and interesting findings show that some of them suffer from Refimprove flaw instead. Likewise, for articles that need additional citations for verification, the well-known PU learning and one-class classification approaches were evaluated. Besides, new methods were compared and a new feature was also proposed to model this latter flaw. The results showed that new methods such as under-bagged decision trees with sum or majority voting rules, biased-SVM, and centroid-based balanced SVM, perform best in comparison with the ones previously published.
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(CCT - SAN LUIS)
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - SAN LUIS
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - SAN LUIS
Citación
Ferretti, Edgardo; Cagnina, Leticia Cecilia; Paiz, Viviana; Delle Donne, Sebastián; Zacagnini, Rodrigo; et al.; Quality flaw prediction in spanish Wikipedia: A case of study with verifiability flaws; Pergamon-Elsevier Science Ltd; Information Processing & Management; 54; 6; 11-2018; 1169-1181
Compartir
Altmétricas