IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i15p2374-d1709143.html
   My bibliography  Save this article

A Statistical Framework for Modeling Behavioral Engagement via Topic and Psycholinguistic Features: Evidence from High-Dimensional Text Data

Author

Listed:
  • Dan Li

    (College of Economics and Management, Taiyuan University of Technology, Taiyuan 030024, China)

  • Yi Zhang

    (College of Economics and Management, Taiyuan University of Technology, Taiyuan 030024, China
    School of Business Liu Guojun School of Management, Changzhou University, Changzhou 213159, China)

Abstract

This study investigates how topic-specific expression by women delivery riders on digital platforms predicts their community engagement, emphasizing the mediating role of self-disclosure and the moderating influence of cognitive and emotional language features. Using unsupervised topic modeling (Top2Vec, Topical Vectors via Embeddings and Clustering) and psycholinguistic analysis (LIWC, Linguistic Inquiry and Word Count), the paper extracted eleven thematic clusters and quantified self-disclosure intensity, cognitive complexity, and emotional polarity. A moderated mediation model was constructed to estimate the indirect and conditional effects of topic probability on engagement behaviors (likes, comments, and views) via self-disclosure. The results reveal that self-disclosure significantly mediates the influence of topical content on engagement, with emotional negativity amplifying and cognitive complexity selectively enhancing this pathway. Indirect effects differ across topics, highlighting the heterogeneous behavioral salience of expressive themes. The findings support a statistically grounded, semantically interpretable framework for predicting user behavior in high-dimensional text environments. This approach offers practical implications for optimizing algorithmic content ranking and fostering equitable visibility for marginalized digital labor groups.

Suggested Citation

  • Dan Li & Yi Zhang, 2025. "A Statistical Framework for Modeling Behavioral Engagement via Topic and Psycholinguistic Features: Evidence from High-Dimensional Text Data," Mathematics, MDPI, vol. 13(15), pages 1-34, July.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:15:p:2374-:d:1709143
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/15/2374/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/15/2374/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:15:p:2374-:d:1709143. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.