IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v414y2014icp378-386.html
   My bibliography  Save this article

Scale and time dependence of serial correlations in word-length time series of written texts

Author

Listed:
  • Rodriguez, E.
  • Aguilar-Cornejo, M.
  • Femat, R.
  • Alvarez-Ramirez, J.

Abstract

This work considered the quantitative analysis of large written texts. To this end, the text was converted into a time series by taking the sequence of word lengths. The detrended fluctuation analysis (DFA) was used for characterizing long-range serial correlations of the time series. To this end, the DFA was implemented within a rolling window framework for estimating the variations of correlations, quantified in terms of the scaling exponent, strength along the text. Also, a filtering derivative was used to compute the dependence of the scaling exponent relative to the scale. The analysis was applied to three famous English-written literary narrations; namely, Alice in Wonderland (by Lewis Carrol), Dracula (by Bram Stoker) and Sense and Sensibility (by Jane Austen). The results showed that high correlations appear for scales of about 50–200 words, suggesting that at these scales the text contains the stronger coherence. The scaling exponent was not constant along the text, showing important variations with apparent cyclical behavior. An interesting coincidence between the scaling exponent variations and changes in narrative units (e.g., chapters) was found. This suggests that the scaling exponent obtained from the DFA is able to detect changes in narration structure as expressed by the usage of words of different lengths.

Suggested Citation

  • Rodriguez, E. & Aguilar-Cornejo, M. & Femat, R. & Alvarez-Ramirez, J., 2014. "Scale and time dependence of serial correlations in word-length time series of written texts," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 378-386.
  • Handle: RePEc:eee:phsmap:v:414:y:2014:i:c:p:378-386
    DOI: 10.1016/j.physa.2014.07.063
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437114006475
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2014.07.063?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Papadimitriou, C. & Karamanos, K. & Diakonos, F.K. & Constantoudis, V. & Papageorgiou, H., 2010. "Entropy analysis of natural language written texts," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(16), pages 3260-3266.
    2. Martin A. Nowak & Natalia L. Komarova & Partha Niyogi, 2002. "Computational and evolutionary aspects of language," Nature, Nature, vol. 417(6889), pages 611-617, June.
    3. Kosmidis, Kosmas & Halley, John M. & Argyrakis, Panos, 2005. "Language evolution and population dynamics in a system of two interacting species," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 353(C), pages 595-612.
    4. Rovenchak, Andrij & Buk, Solomija, 2011. "Application of a quantum ensemble model to linguistic analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(7), pages 1326-1331.
    5. Kosmidis, Kosmas & Kalampokis, Alkiviadis & Argyrakis, Panos, 2006. "Statistical mechanical approach to human language," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 366(C), pages 495-502.
    6. Ausloos, M., 2012. "Measuring complexity with multifractals in texts. Translation effects," Chaos, Solitons & Fractals, Elsevier, vol. 45(11), pages 1349-1357.
    7. Kosmidis, Kosmas & Kalampokis, Alkiviadis & Argyrakis, Panos, 2006. "Language time series analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 370(2), pages 808-816.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bakalis, Evangelos & Galani, Alexandra, 2012. "Modeling language evolution: Aromanian, an endangered language in Greece," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(20), pages 4963-4969.
    2. Stanisz, Tomasz & Drożdż, Stanisław & Kwapień, Jarosław, 2023. "Universal versus system-specific features of punctuation usage patterns in major Western languages," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
    3. Ficcadenti, Valerio & Cerqueti, Roy & Ausloos, Marcel & Dhesi, Gurjeet, 2020. "Words ranking and Hirsch index for identifying the core of the hapaxes in political texts," Journal of Informetrics, Elsevier, vol. 14(3).
    4. An, Zhecheng & Pan, Qiuhui & Yu, Guangying & Wang, Zhen, 2012. "The spatial distribution of clusters and the formation of mixed languages in bilingual competition," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(20), pages 4943-4952.
    5. Amiri, Babak & Karimianghadim, Ramin, 2024. "A novel text clustering model based on topic modelling and social network analysis," Chaos, Solitons & Fractals, Elsevier, vol. 181(C).
    6. Safarzynska, Karolina & van den Bergh, Jeroen C.J.M., 2011. "Beyond replicator dynamics: Innovation-selection dynamics and optimal diversity," Journal of Economic Behavior & Organization, Elsevier, vol. 78(3), pages 229-245, May.
    7. Michael J Weir & Catherine M Ashcraft & Natallia Leuchanka Diessner & Bridie McGreavy & Emily Vogler & Todd Guilfoos, 2020. "Language effects on bargaining," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-20, March.
    8. Edoardo Magnone, 2014. "A novel graphical representation of sentence complexity: the description and its application," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 1301-1329, February.
    9. Yue Yang & Changgui Gu & Qin Xiao & Huijie Yang, 2017. "Evolution of scaling behaviors embedded in sentence series from A Story of the Stone," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-14, February.
    10. Adam Gifford, 2012. "John R. Searle: The making of the social world: the structure of human civilization," Journal of Bioeconomics, Springer, vol. 14(1), pages 95-99, April.
    11. David Bodoff & Ron Bekkerman & Julie Dai, 2017. "Evolution of language: An empirical study at eBay Big Data Lab," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-17, December.
    12. Pedro Ribeiro de Andrade & Antonio Miguel Vieira Monteiro & Gilberto Câmara & Sandra Sandri, 2009. "Games on Cellular Spaces: How Mobility Affects Equilibrium," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 12(1), pages 1-5.
    13. Suárez-García, Pablo & Gómez-Ullate, David, 2014. "Multifractality and long memory of a financial index," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 394(C), pages 226-234.
    14. Dirk Helbing & Anders Johansson, 2010. "Cooperation, Norms, and Revolutions: A Unified Game-Theoretical Approach," PLOS ONE, Public Library of Science, vol. 5(10), pages 1-15, October.
    15. Patriarca, Marco & Heinsalu, Els, 2009. "Influence of geography on language competition," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(2), pages 174-186.
    16. Simon M. Huttegger & Kevin J. S. Zollman, 2016. "The Robustness of Hybrid Equilibria in Costly Signaling Games," Dynamic Games and Applications, Springer, vol. 6(3), pages 347-358, September.
    17. Pablo Su'arez-Garc'ia & David G'omez-Ullate, 2013. "Multifractality and long memory of a financial index," Papers 1306.0490, arXiv.org.
    18. Vieira, Denner S. & Picoli, Sergio & Mendes, Renio S., 2018. "Robustness of sentence length measures in written texts," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 506(C), pages 749-754.
    19. Marcelo A Montemurro & Damián H Zanette, 2011. "Universal Entropy of Word Ordering Across Linguistic Families," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-9, May.
    20. Karolina Safarzyńska & Jeroen Bergh, 2013. "An evolutionary model of energy transitions with interactive innovation-selection dynamics," Journal of Evolutionary Economics, Springer, vol. 23(2), pages 271-293, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:414:y:2014:i:c:p:378-386. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.