IDEAS home Printed from https://ideas.repec.org/a/bla/jinfst/v72y2021i12p1511-1527.html
   My bibliography  Save this article

Genre analysis of movies using a topic model of plot summaries

Author

Listed:
  • Paul Matthews
  • Kathrina Glitre

Abstract

Genre plays an important role in the description, navigation, and discovery of movies, but it is rarely studied at large scale using quantitative methods. This allows an analysis of how genre labels are applied, how genres are composed and how these ingredients change, and how genres compare. We apply unsupervised topic modeling to a large collection of textual movie summaries and then use the model's topic proportions to investigate key questions in genre, including recognizability, mapping, canonicity, and change over time. We find that many genres can be quite easily predicted by their lexical signatures and this defines their position on the genre landscape. We find significant genre composition changes between periods for westerns, science fiction and road movies, reflecting changes in production and consumption values. We show that in terms of canonicity, canonical examples are often at the high end of the topic distribution profile for the genre rather than central as might be predicted by categorization theory.

Suggested Citation

  • Paul Matthews & Kathrina Glitre, 2021. "Genre analysis of movies using a topic model of plot summaries," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(12), pages 1511-1527, December.
  • Handle: RePEc:bla:jinfst:v:72:y:2021:i:12:p:1511-1527
    DOI: 10.1002/asi.24525
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.24525
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.24525?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Margaret Roberts & Brandon Stewart & Tingley, Dustin & Edoardo Airoldi, 2013. "The structural topic model and applied social science," Working Paper 132666, Harvard University OpenScholar.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mohamed M. Mostafa, 2023. "A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3905-3935, August.
    2. Celso Brunetti & Marc Joëts & Valérie Mignon, 2023. "Reasons Behind Words: OPEC Narratives and the Oil Market," Working Papers 2023-19, CEPII research center.
    3. García, Diego & Hu, Xiaowen & Rohrer, Maximilian, 2023. "The colour of finance words," Journal of Financial Economics, Elsevier, vol. 147(3), pages 525-549.
    4. Bai, Xiwen & Zhang, Xiunian & Li, Kevin X. & Zhou, Yaoming & Yuen, Kum Fai, 2021. "Research topics and trends in the maritime transport: A structural topic model," Transport Policy, Elsevier, vol. 102(C), pages 11-24.
    5. Wang, Jason & Weiss, Robert E., 2022. "Local and global topics in text modeling of web pages nested in web sites," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    6. Beatrice Ferrario & Stefanie Stantcheva, 2022. "Eliciting People's First-Order Concerns: Text Analysis of Open-Ended Survey Questions," AEA Papers and Proceedings, American Economic Association, vol. 112, pages 163-169, May.
    7. Nuccio Ludovico & Federica Dessi & Marino Bonaiuto, 2020. "Stakeholders Mapping for Sustainable Biofuels: An Innovative Procedure Based on Computational Text Analysis and Social Network Analysis," Sustainability, MDPI, vol. 12(24), pages 1-22, December.
    8. Ben Cormier & Mark S. Manger, 2022. "Power, ideas, and World Bank conditionality," The Review of International Organizations, Springer, vol. 17(3), pages 397-425, July.
    9. David Ardia & Keven Bluteau & Mohammad Abbas Meghani, 2021. "Thirty Years of Academic Finance," Papers 2112.14902, arXiv.org, revised Aug 2022.
    10. van Loon, Austin, 2022. "Three Families of Automated Text Analysis," SocArXiv htnej, Center for Open Science.
    11. Kang, Inje & Yang, Jiseong & Lee, Wonjae & Seo, Eun-Yeong & Lee, Duk Hee, 2023. "Delineating development trends of nanotechnology in the semiconductor industry: Focusing on the relationship between science and technology by employing structural topic model," Technology in Society, Elsevier, vol. 74(C).
    12. R. Maria del Rio-Chanona & Alejandro Hermida-Carrillo & Melody Sepahpour-Fard & Luning Sun & Renata Topinkova & Ljubica Nedelkoska, 2022. "Mental health concerns prelude the Great Resignation: Evidence from Social Media," Papers 2208.07926, arXiv.org.
    13. Keith Carlson & Michael A. Livermore & Daniel N. Rockmore, 2020. "The Problem of Data Bias in the Pool of Published U.S. Appellate Court Opinions," Journal of Empirical Legal Studies, John Wiley & Sons, vol. 17(2), pages 224-261, June.
    14. Diaf, Sami, 2022. "Policy preference at central banks: Quantifying monetary policy signals using keyword topic models," WiSo-HH Working Paper Series 69, University of Hamburg, Faculty of Business, Economics and Social Sciences, WISO Research Laboratory.
    15. Larsen, Vegard H. & Thorsrud, Leif A., 2019. "The value of news for economic developments," Journal of Econometrics, Elsevier, vol. 210(1), pages 203-218.
    16. Laura K. Nelson & Derek Burk & Marcel Knudsen & Leslie McCall, 2021. "The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods," Sociological Methods & Research, , vol. 50(1), pages 202-237, February.
    17. Rachael M. Moyer, 2022. "Images of controversy: Examining cognition of hydraulic fracturing among policy elites and the general public," Review of Policy Research, Policy Studies Organization, vol. 39(4), pages 441-467, July.
    18. Matthew Gentzkow & Bryan T. Kelly & Matt Taddy, 2017. "Text as Data," NBER Working Papers 23276, National Bureau of Economic Research, Inc.
    19. Sergio Davalos & Ehsan H. Feroz, 2022. "A textual analysis of the US Securities and Exchange Commission's accounting and auditing enforcement releases relating to the Sarbanes–Oxley Act," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 29(1), pages 19-40, January.
    20. Rose, Rodrigo L. & Puranik, Tejas G. & Mavris, Dimitri N. & Rao, Arjun H., 2022. "Application of structural topic modeling to aviation safety data," Reliability Engineering and System Safety, Elsevier, vol. 224(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jinfst:v:72:y:2021:i:12:p:1511-1527. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.