Mostrar el registro sencillo del ítem
dc.contributor.author
Grigera, Julián

dc.contributor.author
Gardey, Juan Cruz

dc.contributor.author
Garrido, Alejandra

dc.contributor.author
Rossi, Gustavo Héctor

dc.contributor.other
Domínguez Mayo, Francisco José
dc.contributor.other
Marchiori, Massimo
dc.contributor.other
Filipe, Joaquim
dc.date.available
2025-02-25T15:03:20Z
dc.date.issued
2021
dc.identifier.citation
A Scoring Map Algorithm for Automatically Detecting Structural Similarity of DOM Elements; 17th International Conference on Web Information Systems and Technologies; Setúbal; Portugal; 2021; 174-185
dc.identifier.isbn
978-989-758-536-4
dc.identifier.uri
http://hdl.handle.net/11336/255175
dc.description.abstract
Most documents in the WWW are generated from templates that represent user interface (UI) elements, and later filled with contents. In the field of information extraction, many approaches emerged to analyze the documents? structure, obtain similar features amongst them, and generate wrappers that are used to extract the raw contents from such documents. Therefore, most techniques documented in the literature are optimized to compare full documents, but there are other fields of applicability that require analyzing structural similarity on smaller UI components, like web augmentation or transcoding. In this paper we present two flexible algorithms to measure similarity between DOM Elements by using a mixed approach that considers both elements? location and inner structure. The proposed algorithms were used in the context of two projects: an approach for automatic usability refactoring, and a web accessibility helper. We also present a wrapper induction technique based on such algorithms. Additionally, we present a precision & recall evaluation of our algorithms as compared with other known approaches, applied to DOM elements of different sizes, but smaller than full scaled documents. The proposed algorithms run in linear time, so they are faster than most approaches that analyze structural similarity.
dc.format
application/pdf
dc.language.iso
eng
dc.publisher
ScitePress
dc.rights
info:eu-repo/semantics/openAccess
dc.rights.uri
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.subject
INFORMATION EXTRACTION
dc.subject
WEB ADAPTATION
dc.subject
REFACTORING FOR USABILITY
dc.subject.classification
Ciencias de la Computación

dc.subject.classification
Ciencias de la Computación e Información

dc.subject.classification
CIENCIAS NATURALES Y EXACTAS

dc.title
A Scoring Map Algorithm for Automatically Detecting Structural Similarity of DOM Elements
dc.type
info:eu-repo/semantics/publishedVersion
dc.type
info:eu-repo/semantics/conferenceObject
dc.type
info:ar-repo/semantics/documento de conferencia
dc.date.updated
2022-11-09T16:53:09Z
dc.journal.pagination
174-185
dc.journal.pais
Portugal

dc.journal.ciudad
Setúbal
dc.description.fil
Fil: Grigera, Julián. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina
dc.description.fil
Fil: Gardey, Juan Cruz. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina
dc.description.fil
Fil: Garrido, Alejandra. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina
dc.description.fil
Fil: Rossi, Gustavo Héctor. Universidad Nacional de La Plata. Facultad de Informática. Laboratorio de Investigación y Formación en Informática Avanzada; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://www.scitepress.org/Papers/2021/107163/107163.pdf
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://dblp.org/rec/conf/webist/2021.html
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://webist.scitevents.org/?y=2021
dc.conicet.rol
Autor

dc.conicet.rol
Autor

dc.conicet.rol
Autor

dc.conicet.rol
Autor

dc.coverage
Internacional
dc.type.subtype
Conferencia
dc.description.nombreEvento
17th International Conference on Web Information Systems and Technologies
dc.date.evento
2021-10-26
dc.description.ciudadEvento
Setúbal
dc.description.paisEvento
Portugal

dc.type.publicacion
Book
dc.description.institucionOrganizadora
Polytechnic Institute of Setubal
dc.source.libro
Proceedings of the 17th International Conference on Web Information Systems and Technologies
dc.date.eventoHasta
2021-10-28
dc.type
Conferencia
Archivos asociados