Mostrar el registro sencillo del ítem
dc.contributor.author
Turjanski, Pablo Guillermo
dc.contributor.author
Ferreiro, Diego
dc.date.available
2019-12-16T20:27:39Z
dc.date.issued
2018-12
dc.identifier.citation
Turjanski, Pablo Guillermo; Ferreiro, Diego; On the Natural Structure of Amino Acid Patterns in Families of Protein Sequences; American Chemical Society; Journal of Physical Chemistry B; 122; 49; 12-2018; 11295-11301
dc.identifier.issn
1520-6106
dc.identifier.uri
http://hdl.handle.net/11336/92321
dc.description.abstract
All known terrestrial proteins are coded as continuous strings of ≈20 amino acids. The patterns formed by the repetitions of elements in groups of finite sequences describes the natural architectures of protein families. We present a method to search for patterns and groupings of patterns in protein sequences using a mathematically precise definition for “repetition”, an efficient algorithmic implementation and a robust scoring system with no adjustable parameters. We show that the sequence patterns can be well-separated into disjoint classes according to their recurrence in nested structures. The statistics of the occurrences of patterns indicate that short repetitions are sufficient to account for the differences between natural families and randomized groups of sequences by more than 10 standard deviations, while contiguous sequence patterns shorter than 5 residues are effectively random in their occurrences. A small subset of patterns is sufficient to account for a robust ”familiarity” definition between arbitrary sets of sequences.
dc.format
application/pdf
dc.language.iso
eng
dc.publisher
American Chemical Society
dc.rights
info:eu-repo/semantics/openAccess
dc.rights.uri
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.subject
PROTEIN STRUCTURE
dc.subject
MAXIMAL REPEATS
dc.subject
FAMILIARITY
dc.subject.classification
Bioquímica y Biología Molecular
dc.subject.classification
Ciencias Biológicas
dc.subject.classification
CIENCIAS NATURALES Y EXACTAS
dc.title
On the Natural Structure of Amino Acid Patterns in Families of Protein Sequences
dc.type
info:eu-repo/semantics/article
dc.type
info:ar-repo/semantics/artículo
dc.type
info:eu-repo/semantics/publishedVersion
dc.date.updated
2019-10-24T19:06:14Z
dc.journal.volume
122
dc.journal.number
49
dc.journal.pagination
11295-11301
dc.journal.pais
Estados Unidos
dc.description.fil
Fil: Turjanski, Pablo Guillermo. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Departamento de Computación; Argentina
dc.description.fil
Fil: Ferreiro, Diego. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Ciudad Universitaria. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales; Argentina. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Departamento de Química Biológica; Argentina
dc.journal.title
Journal of Physical Chemistry B
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://pubs.acs.org/doi/10.1021/acs.jpcb.8b07206
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.1021/acs.jpcb.8b07206
Archivos asociados