Artículo
Cross Domain Author Profiling Task in Spanish Language: An Experimental Study
Garciarena Ucelay, María José; Villegas, María Paula
; Cagnina, Leticia Cecilia
; Errecalde, Marcelo Luis
Fecha de publicación:
11/2015
Editorial:
Universidad Nacional de La Plata. Facultad de Informática
Revista:
Journal of Computer Science and Technology
ISSN:
1666-6046
e-ISSN:
1666-6038
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
Author Profiling is the task of predicting characteristics of the author of a text, such as age, gender, personality, native language, etc. This is a task of growing importance due to the potential applications in security, crime detection and marketing, among others. An interesting point is to study the robustness of a classifier when it is trained with a data set and tested with others containing different characteristics. Commonly this is called cross domain experimentation. Although different cross domain studies have been done for data sets in English language, for Spanish it has recently begun. In this context, this work presents a study of cross domain classification for the author profiling task in Spanish. The experimental results showed that using corpora with different levels of formality we can obtain robust classifiers for the author profiling task in Spanish language.
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(CCT - SAN LUIS)
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - SAN LUIS
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - SAN LUIS
Citación
Garciarena Ucelay, María José; Villegas, María Paula; Cagnina, Leticia Cecilia; Errecalde, Marcelo Luis; Cross Domain Author Profiling Task in Spanish Language: An Experimental Study; Universidad Nacional de La Plata. Facultad de Informática; Journal of Computer Science and Technology; 15; 2; 11-2015; 122-128
Compartir