Mostrar el registro sencillo del ítem
dc.contributor.author
Uieda, Leonardo
dc.contributor.author
Soler, Santiago Rubén
dc.contributor.author
Rampin, Rémi
dc.contributor.author
Kemenade, Hugo van
dc.contributor.author
Turk, Matthew
dc.contributor.author
Shapero, Daniel
dc.contributor.author
Banihirwe, Anderson
dc.contributor.author
Leeman, John
dc.date.available
2022-05-06T13:43:43Z
dc.date.issued
2020-01
dc.identifier.citation
Uieda, Leonardo; Soler, Santiago Rubén; Rampin, Rémi; Kemenade, Hugo van; Turk, Matthew; et al.; Pooch: A friend to fetch your data files; Journal of Open Source Software; Journal of Open Source Software; 5; 45; 1-2020; 1-3
dc.identifier.issn
2475-9066
dc.identifier.uri
http://hdl.handle.net/11336/156774
dc.description.abstract
Scientific software is usually created to acquire, analyze, model, and visualize data. As such, many software libraries include sample datasets in their distributions for use in documentation, tests, benchmarks, and workshops. A common approach is to include smaller datasets in the GitHub repository directly and package them with the source and binary distributions (e.g., scikit-learn (Pedregosa et al., 2011) and scikit-image (Van der Walt et al., 2014) do this). As data files increase in size, it becomes unfeasible to store them in GitHub repositories. Thus, larger datasets require writing code to download the files from a remote server to the user’s computer. The same problem is faced by scientists using version control to manage their research projects. While downloading a data file over HTTPS can be done easily with modern Python libraries, it is not trivial to manage a set of files, keep them updated, and check for corruption. For example, scikit-learn (Pedregosa et al., 2011), Cartopy (Met Office, n.d.), and PyVista (Sullivan & Kaszynski, 2019) all include code dedicated to this particular task. Instead of scientists and library authors recreating the same code, it would be best to have a minimalistic and easy to set up tool for fetching and maintaining data files.
dc.format
application/pdf
dc.language.iso
eng
dc.publisher
Journal of Open Source Software
dc.rights
info:eu-repo/semantics/openAccess
dc.rights.uri
https://creativecommons.org/licenses/by/2.5/ar/
dc.subject
OPEN SOURCE
dc.subject
PYTHON
dc.subject
DATA
dc.subject
JOSS
dc.subject.classification
Otras Ciencias de la Computación e Información
dc.subject.classification
Ciencias de la Computación e Información
dc.subject.classification
CIENCIAS NATURALES Y EXACTAS
dc.title
Pooch: A friend to fetch your data files
dc.type
info:eu-repo/semantics/article
dc.type
info:ar-repo/semantics/artículo
dc.type
info:eu-repo/semantics/publishedVersion
dc.date.updated
2022-05-02T16:31:56Z
dc.journal.volume
5
dc.journal.number
45
dc.journal.pagination
1-3
dc.journal.pais
Estados Unidos
dc.description.fil
Fil: Uieda, Leonardo. University of Liverpool; Reino Unido
dc.description.fil
Fil: Soler, Santiago Rubén. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - San Juan; Argentina. Universidad Nacional de San Juan. Facultad de Ciencias Exactas, Físicas y Naturales. Instituto Geofísico Sismológico Volponi; Argentina
dc.description.fil
Fil: Rampin, Rémi. University of New York; Estados Unidos
dc.description.fil
Fil: Kemenade, Hugo van. No especifíca;
dc.description.fil
Fil: Turk, Matthew. University of Illinois. Urbana - Champaign; Estados Unidos
dc.description.fil
Fil: Shapero, Daniel. University of Washington; Estados Unidos
dc.description.fil
Fil: Banihirwe, Anderson. National Center for Atmospheric Research; Estados Unidos
dc.description.fil
Fil: Leeman, John. Leeman Geophysical; Estados Unidos
dc.journal.title
Journal of Open Source Software
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://joss.theoj.org/papers/10.21105/joss.01943
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.21105/joss.01943
Archivos asociados