IDEAS home Printed from https://ideas.repec.org/a/bla/jinfst/v72y2021i12p1511-1527.html
   My bibliography  Save this article

Genre analysis of movies using a topic model of plot summaries

Author

Listed:
  • Paul Matthews
  • Kathrina Glitre

Abstract

Genre plays an important role in the description, navigation, and discovery of movies, but it is rarely studied at large scale using quantitative methods. This allows an analysis of how genre labels are applied, how genres are composed and how these ingredients change, and how genres compare. We apply unsupervised topic modeling to a large collection of textual movie summaries and then use the model's topic proportions to investigate key questions in genre, including recognizability, mapping, canonicity, and change over time. We find that many genres can be quite easily predicted by their lexical signatures and this defines their position on the genre landscape. We find significant genre composition changes between periods for westerns, science fiction and road movies, reflecting changes in production and consumption values. We show that in terms of canonicity, canonical examples are often at the high end of the topic distribution profile for the genre rather than central as might be predicted by categorization theory.

Suggested Citation

  • Paul Matthews & Kathrina Glitre, 2021. "Genre analysis of movies using a topic model of plot summaries," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(12), pages 1511-1527, December.
  • Handle: RePEc:bla:jinfst:v:72:y:2021:i:12:p:1511-1527
    DOI: 10.1002/asi.24525
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.24525
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.24525?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Margaret Roberts & Brandon Stewart & Tingley, Dustin & Edoardo Airoldi, 2013. "The structural topic model and applied social science," Working Paper 132666, Harvard University OpenScholar.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mohamed M. Mostafa, 2023. "A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3905-3935, August.
    2. Cruciani, Caterina & Santagiustina, Carlo R.M.A., 2023. "The present and future of sustainability disclosure in equity investment funds’ pre-contractual documents: Mapping ESG discourse through STM," Finance Research Letters, Elsevier, vol. 58(PA).
    3. Celso Brunetti & Marc Joëts & Valérie Mignon, 2023. "Reasons Behind Words: OPEC Narratives and the Oil Market," Working Papers hal-04196053, HAL.
    4. Zhuanlan Sun & C. Clark Cao & Sheng Liu & Yiwei Li & Chao Ma, 2024. "Behavioral consequences of second-person pronouns in written communications between authors and reviewers of scientific papers," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    5. Beatrice Ferrario & Stefanie Stantcheva, 2022. "Eliciting People's First-Order Concerns: Text Analysis of Open-Ended Survey Questions," AEA Papers and Proceedings, American Economic Association, vol. 112, pages 163-169, May.
    6. Desiraju, Kanti & Mishra, Arindra Nath & Sengupta, Pooja, 2024. "Customer perceptions on open banking apps: Insights using Structural topic modeling," Journal of Retailing and Consumer Services, Elsevier, vol. 81(C).
    7. van Loon, Austin, 2022. "Three Families of Automated Text Analysis," SocArXiv htnej, Center for Open Science.
    8. R. Maria del Rio-Chanona & Alejandro Hermida-Carrillo & Melody Sepahpour-Fard & Luning Sun & Renata Topinkova & Ljubica Nedelkoska, 2022. "Mental health concerns prelude the Great Resignation: Evidence from Social Media," Papers 2208.07926, arXiv.org.
    9. Keith Carlson & Michael A. Livermore & Daniel N. Rockmore, 2020. "The Problem of Data Bias in the Pool of Published U.S. Appellate Court Opinions," Journal of Empirical Legal Studies, John Wiley & Sons, vol. 17(2), pages 224-261, June.
    10. Candelon, Bertrand & Joëts, Marc & Mignon, Valérie, 2024. "What makes econometric ideas popular: The role of connectivity," Research Policy, Elsevier, vol. 53(7).
    11. Stef M Shuster & Celeste Campos-Castillo & Navid Madani & Kenneth Joseph, 2024. "Who supports Bernie? Analyzing identity and ideological variation on Twitter during the 2020 democratic primaries," PLOS ONE, Public Library of Science, vol. 19(4), pages 1-23, April.
    12. Laura K. Nelson & Derek Burk & Marcel Knudsen & Leslie McCall, 2021. "The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods," Sociological Methods & Research, , vol. 50(1), pages 202-237, February.
    13. Rachael M. Moyer, 2022. "Images of controversy: Examining cognition of hydraulic fracturing among policy elites and the general public," Review of Policy Research, Policy Studies Organization, vol. 39(4), pages 441-467, July.
    14. Arold, B. W. & Ash, E. & MacLeod, W. B. & Naidu, S., 2025. "Worker Rights in Collective Bargaining," Cambridge Working Papers in Economics 2517, Faculty of Economics, University of Cambridge.
    15. Sergio Davalos & Ehsan H. Feroz, 2022. "A textual analysis of the US Securities and Exchange Commission's accounting and auditing enforcement releases relating to the Sarbanes–Oxley Act," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 29(1), pages 19-40, January.
    16. Wen Shi & Diyi Liu & Jing Yang & Jing Zhang & Sanmei Wen & Jing Su, 2020. "Social Bots’ Sentiment Engagement in Health Emergencies: A Topic-Based Analysis of the COVID-19 Pandemic Discussions on Twitter," IJERPH, MDPI, vol. 17(22), pages 1-18, November.
    17. Minkyu Kim & Chankook Park, 2021. "Academic Topics Related to Household Energy Consumption Using the Future Sign Detection Technique," Energies, MDPI, vol. 14(24), pages 1-24, December.
    18. David Ardia & Keven Bluteau & Mohammad‐Abbas Meghani, 2024. "Thirty years of academic finance," Journal of Economic Surveys, Wiley Blackwell, vol. 38(3), pages 1008-1042, July.
    19. Villa-Cox, Gonzalo & Bucaram-Villacís, Santiago J. & Goethals, Peter & Speelman, Stijn, 2023. "Mapping linkages between ecosystem services and agricultural and landscape management using structural topic modeling of scientific literature," Ecosystem Services, Elsevier, vol. 61(C).
    20. Kanyao Han & Rezvaneh Rezapour & Katia Nakamura & Dikshya Devkota & Daniel C. Miller & Jana Diesner, 2023. "An expert‐in‐the‐loop method for domain‐specific document categorization based on small training data," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(6), pages 669-684, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jinfst:v:72:y:2021:i:12:p:1511-1527. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.