Mostrar el registro sencillo del ítem

dc.contributor.author
Grigera, Julián  
dc.contributor.author
Gardey, Juan Cruz  
dc.contributor.author
Garrido, Alejandra  
dc.contributor.author
Rossi, Gustavo Héctor  
dc.contributor.other
Domínguez Mayo, Francisco José  
dc.contributor.other
Marchiori, Massimo  
dc.contributor.other
Filipe, Joaquim  
dc.date.available
2025-02-25T15:03:20Z  
dc.date.issued
2021  
dc.identifier.citation
A Scoring Map Algorithm for Automatically Detecting Structural Similarity of DOM Elements; 17th International Conference on Web Information Systems and Technologies; Setúbal; Portugal; 2021; 174-185  
dc.identifier.isbn
978-989-758-536-4  
dc.identifier.uri
http://hdl.handle.net/11336/255175  
dc.description.abstract
Most documents in the WWW are generated from templates that represent user interface (UI) elements, and later filled with contents. In the field of information extraction, many approaches emerged to analyze the documents? structure, obtain similar features amongst them, and generate wrappers that are used to extract the raw contents from such documents. Therefore, most techniques documented in the literature are optimized to compare full documents, but there are other fields of applicability that require analyzing structural similarity on smaller UI components, like web augmentation or transcoding. In this paper we present two flexible algorithms to measure similarity between DOM Elements by using a mixed approach that considers both elements? location and inner structure. The proposed algorithms were used in the context of two projects: an approach for automatic usability refactoring, and a web accessibility helper. We also present a wrapper induction technique based on such algorithms. Additionally, we present a precision & recall evaluation of our algorithms as compared with other known approaches, applied to DOM elements of different sizes, but smaller than full scaled documents. The proposed algorithms run in linear time, so they are faster than most approaches that analyze structural similarity.  
dc.format
application/pdf  
dc.language.iso
eng  
dc.publisher
ScitePress  
dc.rights
info:eu-repo/semantics/openAccess  
dc.rights.uri
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/  
dc.subject
INFORMATION EXTRACTION  
dc.subject
WEB ADAPTATION  
dc.subject
REFACTORING FOR USABILITY  
dc.subject.classification
Ciencias de la Computación  
dc.subject.classification
Ciencias de la Computación e Información  
dc.subject.classification
CIENCIAS NATURALES Y EXACTAS  
dc.title
A Scoring Map Algorithm for Automatically Detecting Structural Similarity of DOM Elements  
dc.type
info:eu-repo/semantics/publishedVersion  
dc.type
info:eu-repo/semantics/conferenceObject  
dc.type
info:ar-repo/semantics/documento de conferencia  
dc.date.updated
2022-11-09T16:53:09Z  
dc.journal.pagination
174-185  
dc.journal.pais
Portugal  
dc.journal.ciudad
Setúbal  
dc.description.fil
Fil: Grigera, Julián. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina  
dc.description.fil
Fil: Gardey, Juan Cruz. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina  
dc.description.fil
Fil: Garrido, Alejandra. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina  
dc.description.fil
Fil: Rossi, Gustavo Héctor. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://www.scitepress.org/Papers/2021/107163/107163.pdf  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://dblp.org/rec/conf/webist/2021.html  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://webist.scitevents.org/?y=2021  
dc.conicet.rol
Autor  
dc.conicet.rol
Autor  
dc.conicet.rol
Autor  
dc.conicet.rol
Autor  
dc.coverage
Internacional  
dc.type.subtype
Conferencia  
dc.description.nombreEvento
17th International Conference on Web Information Systems and Technologies  
dc.date.evento
2021-10-26  
dc.description.ciudadEvento
Setúbal  
dc.description.paisEvento
Portugal  
dc.type.publicacion
Book  
dc.description.institucionOrganizadora
Polytechnic Institute of Setubal  
dc.source.libro
Proceedings of the 17th International Conference on Web Information Systems and Technologies  
dc.date.eventoHasta
2021-10-28  
dc.type
Conferencia