IDEAS home Printed from https://ideas.repec.org/p/crs/wpaper/2017-33.html
   My bibliography  Save this paper

Bayesian Hierarchical Finite Mixture Models of Reading Times: A Case Study

Author

Listed:
  • Shravan Vasishth

    (University of Potsdam)

  • Bruno Nicenboim

    (University of Potsdam)

  • Nicolas Chopin

    (CREST; ENSAE)

  • Robin Ryder

    (CNRS; Université Paris-Dauphine; PSL)

Abstract

This theoretical note presents a case study demonstrating the importance of Bayesian hierarchical mixture models as a modelling tool for evaluating the predictions of competing theories of cognitive processes. This note also contributes to improving current practices in data analysis in the psychological sciences. As a case study, we revisit two published data sets from psycholinguistics. In sentence comprehension, it is widely assumed that the distance between linguistic co-dependents affects the latency of dependency resolution: the longer the distance, the longer the time taken to complete the dependency (e.g., Gibson 2000). An alternative theory, direct access (McElree, 2000), assumes that retrieval times are a mixture of two distributions (Nicenboim & Vasishth, 2017): one distribution represents successful retrievals and the other represents an initial failure to retrieve the correct dependent, followed by a reanalysis (McElree, 1993) that leads to successful retrieval. Here, dependency distance has the effect that in long-distance conditions the proportion of reanalyses is higher (due to similarity-based interference). We implement both theories as Bayesian hierarchical models and show that the direct-access model fits the Chinese relative clause reading time data better than the dependency-distance account. This work makes several novel contributions. First, we demonstrate how the researcher can reason about the underlying generative process of their data, thereby expressing the underlying cognitive process as a statistical model. Second, we show how models that have been developed in an exploratory manner to represent different underlying generative processes can be compared in terms of their predictive performance, using both K-fold cross validation on existing data, and using completely new data. Finally, we show how the models can be evaluated using simulated data; this is a method that is standardly used in Bayesian statistics, but remains unutilized in data analysis within the psychological sciences.

Suggested Citation

  • Shravan Vasishth & Bruno Nicenboim & Nicolas Chopin & Robin Ryder, 2017. "Bayesian Hierarchical Finite Mixture Models of Reading Times: A Case Study," Working Papers 2017-33, Center for Research in Economics and Statistics.
  • Handle: RePEc:crs:wpaper:2017-33
    as

    Download full text from publisher

    File URL: http://crest.science/RePEc/wpstorage/2017-33.pdf
    File Function: CREST working paper version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jeffrey Rouder, 2005. "Are unshifted distributional models appropriate for response time?," Psychometrika, Springer;The Psychometric Society, vol. 70(2), pages 377-381, June.
    2. Shravan Vasishth & Zhong Chen & Qiang Li & Gueilan Guo, 2013. "Processing Chinese Relative Clauses: Evidence for the Subject-Relative Advantage," PLOS ONE, Public Library of Science, vol. 8(10), pages 1-15, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mohsen Soltanifar & Chel Hee Lee, 2023. "SimSST: An R Statistical Software Package to Simulate Stop Signal Task Data," Mathematics, MDPI, vol. 11(3), pages 1-15, January.
    2. Jochen Ranger & Jörg-Tobias Kuhn & José-Luis Gaviria, 2015. "A Race Model for Responses and Response Times in Tests," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 791-810, September.
    3. Jeffrey Rouder & Jordan Province & Richard Morey & Pablo Gomez & Andrew Heathcote, 2015. "The Lognormal Race: A Cognitive-Process Model of Choice and Latency with Desirable Psychometric Properties," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 491-513, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:crs:wpaper:2017-33. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Secretariat General (email available below). General contact details of provider: https://edirc.repec.org/data/crestfr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.