Artículo
A semi-supervised incremental algorithm to automatically formulate topical queries
Fecha de publicación:
05/2009
Editorial:
Elsevier Science Inc
Revista:
Information Sciences
ISSN:
0020-0255
Idioma:
Inglés
Tipo de recurso:
Artículo publicado
Clasificación temática:
Resumen
The quality of the material collected by a context-based Web search systems is highly dependant on the vocabulary used to generate the search queries. This paper proposes to apply a semi-supervised algorithm to incrementally learn terms that can help bridge the terminology gap existing between the user's information needs and the relevant documents' vocabulary. The learning strategy uses an incrementally-retrieved, topic-dependent selection of Web documents for term-weight reinforcement reflecting the aptness of the terms in describing and discriminating the topic of the user context. The new algorithm learns new descriptors by searching for terms that tend to occur often in relevant documents, and learns good discriminators by identifying terms that tend to occur only in the context of the given topic. The enriched vocabulary allows the formulation of search queries that are more effective than those queries generated directly using terms from the initial topic description. An evaluation on a large collection of topics using a standard and two ad-hoc performance evaluation metrics suggests that the proposed technique is superior to a baseline and other existing query reformulation techniques.
Palabras clave:
Context
,
Query Formulation
,
Topical Queries
,
Web Search
Archivos asociados
Licencia
Identificadores
Colecciones
Articulos(CCT - BAHIA BLANCA)
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - BAHIA BLANCA
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - BAHIA BLANCA
Citación
Lorenzetti, Carlos Martin; Maguitman, Ana Gabriela; A semi-supervised incremental algorithm to automatically formulate topical queries; Elsevier Science Inc; Information Sciences; 179; 12; 5-2009; 1881-1892
Compartir
Altmétricas