Mostrar el registro sencillo del ítem
dc.contributor.author
Marchi, Jacopo
dc.contributor.author
Galpern, Ezequiel Alejandro
dc.contributor.author
Espada, Rocio
dc.contributor.author
Ferreiro, Diego
dc.contributor.author
Walczak, Aleksandra M.
dc.contributor.author
Mora, Thierry
dc.date.available
2021-01-25T11:45:08Z
dc.date.issued
2019-08
dc.identifier.citation
Marchi, Jacopo; Galpern, Ezequiel Alejandro; Espada, Rocio; Ferreiro, Diego; Walczak, Aleksandra M.; et al.; Size and structure of the sequence space of repeat proteins; Public Library of Science; Plos Computational Biology; 15; 8; 8-2019; 1-23
dc.identifier.issn
1553-734X
dc.identifier.uri
http://hdl.handle.net/11336/123547
dc.description.abstract
The coding space of protein sequences is shaped by evolutionary constraints set by requirements of function and stability. We show that the coding space of a given protein family-the total number of sequences in that family-can be estimated using models of maximum entropy trained on multiple sequence alignments of naturally occuring amino acid sequences. We analyzed and calculated the size of three abundant repeat proteins families, whose members are large proteins made of many repetitions of conserved portions of *30 amino acids. While amino acid conservation at each position of the alignment explains most of the reduction of diversity relative to completely random sequences, we found that correlations between amino acid usage at different positions significantly impact that diversity. We quantified the impact of different types of correlations, functional and evolutionary, on sequence diversity. Analysis of the detailed structure of the coding space of the families revealed a rugged landscape, with many local energy minima of varying sizes with a hierarchical structure, reminiscent of fustrated energy landscapes of spin glass in physics. This clustered structure indicates a multiplicity of subtypes within each family, and suggests new strategies for protein design.
dc.format
application/pdf
dc.language.iso
eng
dc.publisher
Public Library of Science
dc.rights
info:eu-repo/semantics/openAccess
dc.rights.uri
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.subject
protein folding
dc.subject
protein design
dc.subject
protein evolution
dc.subject.classification
Biología
dc.subject.classification
Ciencias Biológicas
dc.subject.classification
CIENCIAS NATURALES Y EXACTAS
dc.title
Size and structure of the sequence space of repeat proteins
dc.type
info:eu-repo/semantics/article
dc.type
info:ar-repo/semantics/artículo
dc.type
info:eu-repo/semantics/publishedVersion
dc.date.updated
2020-12-01T16:26:20Z
dc.journal.volume
15
dc.journal.number
8
dc.journal.pagination
1-23
dc.journal.pais
Estados Unidos
dc.description.fil
Fil: Marchi, Jacopo. Ecole Normale Supérieure; Francia
dc.description.fil
Fil: Galpern, Ezequiel Alejandro. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Ciudad Universitaria. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales; Argentina
dc.description.fil
Fil: Espada, Rocio. PSL University; Francia
dc.description.fil
Fil: Ferreiro, Diego. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Ciudad Universitaria. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales; Argentina
dc.description.fil
Fil: Walczak, Aleksandra M.. Ecole Normale Supérieure; Francia
dc.description.fil
Fil: Mora, Thierry. Ecole Normale Supérieure; Francia
dc.journal.title
Plos Computational Biology
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.1371/journal.pcbi.1007282
Archivos asociados