Mostrar el registro sencillo del ítem

dc.contributor.author
Chacoma, Andrés Alberto  
dc.contributor.author
Zanette, Damian Horacio  
dc.date.available
2021-07-27T14:46:41Z  
dc.date.issued
2020-03  
dc.identifier.citation
Chacoma, Andrés Alberto; Zanette, Damian Horacio; Heaps' Law and Heaps functions in tagged texts: Evidences of their linguistic relevance; Royal Society; Royal Society Open Science; 7; 3; 3-2020; 1-15  
dc.identifier.uri
http://hdl.handle.net/11336/137044  
dc.description.abstract
We study the relationship between vocabulary size and text length in a corpus of 75 literary works in English, authored by six writers, distinguishing between the contributions of three grammatical classes (or 'tags,' namely, nouns, verbs and others), and analyse the progressive appearance of new words of each tag along each individual text. We find that, as prescribed by Heaps' Law, vocabulary sizes and text lengths follow a well-defined power-law relation. Meanwhile, the appearance of new words in each text does not obey a power law, and is on the whole well described by the average of random shufflings of the text. Deviations from this average, however, are statistically significant and show systematic trends across the corpus. Specifically, we find that the appearance of new words along each text is predominantly retarded with respect to the average of random shufflings. Moreover, different tags add systematically distinct contributions to this tendency, with verbs and others being respectively more and less retarded than the mean trend, and nouns following instead the overall mean. These statistical systematicities are likely to point to the existence of linguistically relevant information stored in the different variants of Heaps' Law, a feature that is still in need of extensive assessment.  
dc.format
application/pdf  
dc.language.iso
eng  
dc.publisher
Royal Society  
dc.rights
info:eu-repo/semantics/openAccess  
dc.rights.uri
https://creativecommons.org/licenses/by/2.5/ar/  
dc.subject
GRAMMATICAL CLASSES  
dc.subject
HEAPS' LAW  
dc.subject
LANGUAGE REGULARITIES  
dc.subject
STATISTICAL ANOMALIES  
dc.subject
TAGGED TEXTS  
dc.subject.classification
Otras Ciencias Físicas  
dc.subject.classification
Ciencias Físicas  
dc.subject.classification
CIENCIAS NATURALES Y EXACTAS  
dc.title
Heaps' Law and Heaps functions in tagged texts: Evidences of their linguistic relevance  
dc.type
info:eu-repo/semantics/article  
dc.type
info:ar-repo/semantics/artículo  
dc.type
info:eu-repo/semantics/publishedVersion  
dc.date.updated
2021-07-01T15:21:28Z  
dc.identifier.eissn
2054-5703  
dc.journal.volume
7  
dc.journal.number
3  
dc.journal.pagination
1-15  
dc.journal.pais
Reino Unido  
dc.journal.ciudad
Londres  
dc.description.fil
Fil: Chacoma, Andrés Alberto. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto de Física Enrique Gaviola. Universidad Nacional de Córdoba. Instituto de Física Enrique Gaviola; Argentina  
dc.description.fil
Fil: Zanette, Damian Horacio. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Patagonia Norte; Argentina. Comisión Nacional de Energía Atómica. Gerencia del Área Investigaciones y Aplicaciones no Nucleares; Argentina  
dc.journal.title
Royal Society Open Science  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://royalsocietypublishing.org/doi/10.1098/rsos.200008  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.1098/rsos.200008