Repositorio Institucional
Repositorio Institucional
CONICET Digital
  • Inicio
  • EXPLORAR
    • AUTORES
    • DISCIPLINAS
    • COMUNIDADES
  • Estadísticas
  • Novedades
    • Noticias
    • Boletines
  • Ayuda
    • General
    • Datos de investigación
  • Acerca de
    • CONICET Digital
    • Equipo
    • Red Federal
  • Contacto
JavaScript is disabled for your browser. Some features of this site may not work without it.
  • INFORMACIÓN GENERAL
  • RESUMEN
  • ESTADISTICAS
 
Artículo

Preserving accuracy in GenBank

Bidartondo, Martin I.; Bruns, Thomas D.; Blackwell, Meredith; Edwards, Ivan; Taylor, Andy F. S.; Bianchinotti, Maria VirginiaIcon ; Padamsee, Mahajabeen; Callac, Philippe; Lima, Nelson; White, Merlin M.; Barreau Daly, Camila; Juncai, M. A.; Buyck, Bart; Rabeler, Richard K.; Liles, Mark R.; Estes, Dwayne; Carter, Richard; Herr Jr., J. M.; Chandler, Gregory; Kerekes, Jennifer; Cruse Sanders, Jennifer; Galán Marquez, R.; Horak, Egon; Fitzsimons, Michael; Döering, Heidi; Yao, Su; Hynson, Nicole; Ryberg, Martin; Arnold, A. E.; Hughes, Karen
Fecha de publicación: 21/03/2008
Editorial: American Association for the Advancement of Science
Revista: Science
ISSN: 0036-8075
Idioma: Inglés
Tipo de recurso: Artículo publicado
Clasificación temática:
Otras Ciencias Biológicas

Resumen

GenBank, the public repository for nucleotide and protein sequences, is a critical resource for molecular biology, evolutionary biology, and ecology. While some attention has been drawn to sequence errors, common annotation errors also reduce the value of this database. In fact, for organisms such as fungi, which are notoriously difficult to identify, up to 20% of DNA sequence records may have erroneous lineage designations in GenBank. Gene function annotation in protein sequence databases is similarly error-prone. Because identity and function of new sequences are often determined by bioinformatic analyses, both types of errors are propagated into new accessions, leading to long-term degradation of the quality of the database. Currently, primary sequence data are annotated by the authors of those data, and can only be reannotated by the same authors. This is inefficient and unsustainable over the long term as authors eventually leave the field. Although it is possible to link third-party databases to GenBank records, this is a short-term solution that has little guarantee of permanence. Similarly, the current third-party annotation option in GenBank (TPA) complicates rather than solves the problem by creating an identical record with a new annotation, while leaving the original record unflagged and unlinked to the new record. Since the origin of public zoological and botanical specimen collections, an open system of cumulative annotation has evolved, whereby the original name is retained, but additional opinion is directly appended and used for filing and retrieval. This was needed as new specimens and analyses allowed for reevaluation of older specimens and the original depositors became unavailable. The time has come for the public sequence database to incorporate a community-curated, cumulative annotation process that allows third parties to improve the annotations of sequences when warranted by published peer-reviewed analyses.
Palabras clave: Its , Taxonomy , Ecology , Bioinformatics
Ver el registro completo
 
Archivos asociados
Thumbnail
 
Tamaño: 141.9Kb
Formato: PDF
.
Descargar
Licencia
info:eu-repo/semantics/openAccess Excepto donde se diga explícitamente, este item se publica bajo la siguiente descripción: Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Unported (CC BY-NC-SA 2.5)
Identificadores
URI: http://hdl.handle.net/11336/45720
URL: http://science.sciencemag.org/content/319/5870/1616.1
DOI: http://dx.doi.org/10.1126/science.319.5870.1616a
Colecciones
Articulos(CERZOS)
Articulos de CENTRO REC.NAT.RENOVABLES DE ZONA SEMIARIDA(I)
Citación
Bidartondo, Martin I.; Bruns, Thomas D.; Blackwell, Meredith; Edwards, Ivan; Taylor, Andy F. S.; et al.; Preserving accuracy in GenBank; American Association for the Advancement of Science; Science; 319; 5870; 21-3-2008; 1616
Compartir
Altmétricas
 

Enviar por e-mail
Separar cada destinatario (hasta 5) con punto y coma.
  • Facebook
  • X Conicet Digital
  • Instagram
  • YouTube
  • Sound Cloud
  • LinkedIn

Los contenidos del CONICET están licenciados bajo Creative Commons Reconocimiento 2.5 Argentina License

https://www.conicet.gov.ar/ - CONICET

Inicio

Explorar

  • Autores
  • Disciplinas
  • Comunidades

Estadísticas

Novedades

  • Noticias
  • Boletines

Ayuda

Acerca de

  • CONICET Digital
  • Equipo
  • Red Federal

Contacto

Godoy Cruz 2290 (C1425FQB) CABA – República Argentina – Tel: +5411 4899-5400 repositorio@conicet.gov.ar
TÉRMINOS Y CONDICIONES