Mostrar el registro sencillo del ítem
dc.contributor.author
Da Rocha Araujo, Leonardo Henrique
dc.contributor.author
Rodríguez, Guillermo Horacio
dc.contributor.author
Vidal, Santiago Agustín
dc.contributor.author
Marcos, Claudia Andrea
dc.contributor.author
Pereira Dos Santos, Rodrigo
dc.date.available
2023-10-12T19:57:19Z
dc.date.issued
2022-02
dc.identifier.citation
Da Rocha Araujo, Leonardo Henrique; Rodríguez, Guillermo Horacio; Vidal, Santiago Agustín; Marcos, Claudia Andrea; Pereira Dos Santos, Rodrigo; Empirical Analysis on OpenAPI Topic Exploration and Discovery to Support the Developer Community; Slovak Acad Sciences Inst Informatics; Computing And Informatics; 40; 6; 2-2022; 1345-1369
dc.identifier.issn
1335-9150
dc.identifier.uri
http://hdl.handle.net/11336/215064
dc.description.abstract
OpenAPI has become a dominant standard for documentation in the service-oriented software industry. OpenAPI is used in many analysis and reengineering approaches for RESTful service and microservice-based systems. An OpenAPI document has several components that are usually filled by humans using natural language (e.g. description of a certain functionality). Thus, subjectivity may lead to inconsistencies and ambiguities. Understanding what an API does is a challenging question. As a consequence, this issue could hinder developers from identifying the functionality of APIs, after reading all its components. Along this line, we argue that developers will be provided with supportive tools to find those APIs that better suit their needs. In this paper, we propose a step towards creating these kinds of tools by empirically analyzing a set of 2 000 OpenAPI documents with the goal of extracting the main topics of an API using three topic modeling algorithms. To address this issue, we focus on three tasks: i) determine which component of an OpenAPI document provides the most meaningful information, ii) compare three state-of-the-art topic modeling algorithms, and iii) determine the optimal number of topics to represent an API. Our findings show that the best results could be obtained from the Description component by using the Non-negative Matrix Factorization (NMF) or Latent Semantic Indexing (LSI) algorithms. To help developers find services in the OpenAPI directory, we also propose a prototype tool to explore the OpenAPI documents and analyze extracted topics to assess if the APIs meet developers needs.
dc.format
application/pdf
dc.language.iso
eng
dc.publisher
Slovak Acad Sciences Inst Informatics
dc.rights
info:eu-repo/semantics/openAccess
dc.rights.uri
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.subject
APIS
dc.subject
DOCUMENTATION
dc.subject
OPENAPI
dc.subject
RESTFUL WEB SERVICES
dc.subject
TOPIC COHERENCE
dc.subject
TOPIC MODELING
dc.subject.classification
Ciencias de la Computación
dc.subject.classification
Ciencias de la Computación e Información
dc.subject.classification
CIENCIAS NATURALES Y EXACTAS
dc.title
Empirical Analysis on OpenAPI Topic Exploration and Discovery to Support the Developer Community
dc.type
info:eu-repo/semantics/article
dc.type
info:ar-repo/semantics/artículo
dc.type
info:eu-repo/semantics/publishedVersion
dc.date.updated
2023-07-07T22:25:03Z
dc.journal.volume
40
dc.journal.number
6
dc.journal.pagination
1345-1369
dc.journal.pais
Eslovaquia
dc.journal.ciudad
Bratislava
dc.description.fil
Fil: Da Rocha Araujo, Leonardo Henrique. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tandil. Instituto Superior de Ingeniería del Software. Universidad Nacional del Centro de la Provincia de Buenos Aires. Instituto Superior de Ingeniería del Software; Argentina
dc.description.fil
Fil: Rodríguez, Guillermo Horacio. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tandil. Instituto Superior de Ingeniería del Software. Universidad Nacional del Centro de la Provincia de Buenos Aires. Instituto Superior de Ingeniería del Software; Argentina
dc.description.fil
Fil: Vidal, Santiago Agustín. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tandil. Instituto Superior de Ingeniería del Software. Universidad Nacional del Centro de la Provincia de Buenos Aires. Instituto Superior de Ingeniería del Software; Argentina
dc.description.fil
Fil: Marcos, Claudia Andrea. Universidad Nacional del Centro de la Provincia de Buenos Aires. Facultad de Ciencias Exactas. Instituto de Sistemas Tandil; Argentina
dc.description.fil
Fil: Pereira Dos Santos, Rodrigo. Universidade Federal do Estado do Rio de Janeiro; Brasil
dc.journal.title
Computing And Informatics
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://www.cai.sk/ojs/index.php/cai/article/view/2021_6_1345
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/doi/https://doi.org/10.31577/cai_2021_6_1345
Archivos asociados