Mostrar el registro sencillo del ítem

dc.contributor.author
Ribeiro, Bruno R.  
dc.contributor.author
Velazco, Santiago José Elías  
dc.contributor.author
Guidoni Martins, Karlo  
dc.contributor.author
Tessarolo, Geiziane  
dc.contributor.author
Jardim, Lucas  
dc.contributor.author
Bachman, Steven P.  
dc.contributor.author
Loyola, Rafael  
dc.date.available
2023-09-27T15:34:37Z  
dc.date.issued
2022-04  
dc.identifier.citation
Ribeiro, Bruno R.; Velazco, Santiago José Elías; Guidoni Martins, Karlo; Tessarolo, Geiziane; Jardim, Lucas; et al.; bdc: A toolkit for standardizing, integrating and cleaning biodiversity data; John Wiley & Sons; Methods in Ecology and Evolution; 13; 2; 4-2022; 1421-1428  
dc.identifier.uri
http://hdl.handle.net/11336/213254  
dc.description.abstract
The increase in online and openly accessible biodiversity databases provides a vast and invaluable resource to support research and policy. However, without scrutiny, errors in primary species occurrence data can lead to erroneous results and misleading information.Here, we introduce the Biodiversity Data Cleaning (bdc), an R package to address quality issues and improve the fitness-for-use of biodiversity datasets. The bdc package brings together several aspects of biodiversity data cleaning in one place. It is organized in thematic modules related to different biodiversity dimensions, including (a) Merge datasets: standardization and integration of different datasets; (b) Pre-filter: flagging and removal of invalid or non-interpretable information, followed by data amendments; (c) Taxonomy: cleaning, parsing and harmonization of scientific names from several taxonomic groups against taxonomic databases locally stored through the application of exact and partial matching algorithms; (d) Space: flagging of erroneous, suspect and low-precision geographic coordinates; and (e) Time: flagging and, whenever possible, correction of inconsistent collection date. In addition, the package contains features to visualize, document and report data quality?which is essential for making data quality assessment transparent and reproducible. The modules illustrated, and functions within, were linked to form a proposed reproducible workflow that can also integrate functions from other R packages.We demonstrated the bdc package´s applicability in cleaning more than 30 million occurrence records for terrestrial plant species in Brazil. We found that around one-fifth of the original datasets hold the standard quality requirements.Compared to other available R packages, the main strengths of the bdc package are that it brings together available tools?and a series of new ones?to assess the quality of different dimensions of biodiversity data into a single and flexible toolkit. The functions can be applied to many taxonomic groups, datasets (including regional or local repositories), countries, or world-wide. We hope the bdc package can facilitate the data cleaning process and catalyse improvements to allow the wise and efficient use of primary biodiversity data.  
dc.format
application/pdf  
dc.language.iso
eng  
dc.publisher
John Wiley & Sons  
dc.rights
info:eu-repo/semantics/restrictedAccess  
dc.rights.uri
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/  
dc.subject
big data  
dc.subject
biodiversity  
dc.subject
data cleaning  
dc.subject
data quality  
dc.subject
fitness-for-use  
dc.subject
GBIF  
dc.subject
plants  
dc.subject
taxonomy  
dc.subject.classification
Otras Ciencias Biológicas  
dc.subject.classification
Ciencias Biológicas  
dc.subject.classification
CIENCIAS NATURALES Y EXACTAS  
dc.title
bdc: A toolkit for standardizing, integrating and cleaning biodiversity data  
dc.type
info:eu-repo/semantics/article  
dc.type
info:ar-repo/semantics/artículo  
dc.type
info:eu-repo/semantics/publishedVersion  
dc.date.updated
2023-07-07T22:04:14Z  
dc.identifier.eissn
2041-210X  
dc.journal.volume
13  
dc.journal.number
2  
dc.journal.pagination
1421-1428  
dc.journal.pais
Estados Unidos  
dc.journal.ciudad
Nueva York  
dc.description.fil
Fil: Ribeiro, Bruno R.. Universidade Federal de Goiás; Brasil  
dc.description.fil
Fil: Velazco, Santiago José Elías. Universidade Federal da Integração Latino-Americana; Brasil. University of California; Estados Unidos. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Nordeste. Instituto de Biología Subtropical. Instituto de Biología Subtropical - Nodo Puerto Iguazú | Universidad Nacional de Misiones. Instituto de Biología Subtropical. Instituto de Biología Subtropical - Nodo Puerto Iguazú; Argentina  
dc.description.fil
Fil: Guidoni Martins, Karlo. Universidade Federal de Goiás; Brasil  
dc.description.fil
Fil: Tessarolo, Geiziane. Universidade Federal de Goiás; Brasil  
dc.description.fil
Fil: Jardim, Lucas. Universidade Federal de Goiás; Brasil  
dc.description.fil
Fil: Bachman, Steven P.. Royal Botanic Gardens; Reino Unido  
dc.description.fil
Fil: Loyola, Rafael. Universidade Federal de Goiás; Brasil. International Institute for Sustainability; Brasil  
dc.journal.title
Methods in Ecology and Evolution  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://onlinelibrary.wiley.com/doi/10.1111/2041-210X.13868  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/doi/https://doi.org/10.1111/2041-210X.13868