IDEAS home Printed from https://ideas.repec.org/a/eee/intfor/v31y2015i2p364-390.html
   My bibliography  Save this article

Box office forecasting using machine learning algorithms based on SNS data

Author

Listed:
  • Kim, Taegu
  • Hong, Jungsik
  • Kang, Pilsung

Abstract

We propose a novel approach to the box office forecasting of motion pictures using social network service (SNS) data and machine learning-based algorithms. We begin by providing a comprehensive survey of the forecasting algorithms and explanatory variables used in the motion picture domain. Because of the importance of forecasting in early periods, we develop three sequential forecasting models for predicting the non-cumulative and cumulative box office earnings: (1) prior to, (2) a week after, and (3) two weeks after release. The numbers of SNS mentions and their weekly trends are used as input variables in addition to the screening-related information. A genetic algorithm is adopted for determining significant input variables, whereas three machine learning-based nonlinear regression algorithms and their combinations are employed for building forecasting models. Experimental results show that the utilization of SNS data, machine learning-based algorithms and their combination made noticeable improvements to the forecasting accuracies of all the three models.

Suggested Citation

  • Kim, Taegu & Hong, Jungsik & Kang, Pilsung, 2015. "Box office forecasting using machine learning algorithms based on SNS data," International Journal of Forecasting, Elsevier, vol. 31(2), pages 364-390.
  • Handle: RePEc:eee:intfor:v:31:y:2015:i:2:p:364-390
    DOI: 10.1016/j.ijforecast.2014.05.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0169207014000971
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijforecast.2014.05.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Anita Elberse & Jehoshua Eliashberg, 2003. "Demand and Supply Dynamics for Sequentially Released Products in International Markets: The Case of Motion Pictures," Marketing Science, INFORMS, vol. 22(3), pages 329-354.
    2. Márton Mestyán & Taha Yasseri & János Kertész, 2013. "Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-8, August.
    3. Ravid, S Abraham, 1999. "Information, Blockbusters, and Stars: A Study of the Film Industry," The Journal of Business, University of Chicago Press, vol. 72(4), pages 463-492, October.
    4. James Jianxin Gong & Wim A. Van der Stede & S. Mark Young, 2011. "Real Options in the Motion Picture Industry: Evidence from Film Marketing and Sequels," Contemporary Accounting Research, John Wiley & Sons, vol. 28(5), pages 1438-1466, December.
    5. Francis Lee, 2009. "Cultural discount of cinematic achievement: the academy awards and U.S. movies’ East Asian box office," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 33(4), pages 239-263, November.
    6. Ramya Neelamegham & Pradeep Chintagunta, 1999. "A Bayesian Model to Forecast New Product Performance in Domestic and International Markets," Marketing Science, INFORMS, vol. 18(2), pages 115-136.
    7. Duan, Wenjing & Gu, Bin & Whinston, Andrew B., 2008. "The dynamics of online word-of-mouth and product sales—An empirical investigation of the movie industry," Journal of Retailing, Elsevier, vol. 84(2), pages 233-242.
    8. Stephanie Brewer & Jason Kelley & James Jozefowicz, 2009. "A blueprint for success in the US film industry," Applied Economics, Taylor & Francis Journals, vol. 41(5), pages 589-606.
    9. Rogers, Everett M, 1976. "New Product Adoption and Diffusion," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 2(4), pages 290-301, March.
    10. Byeng-Hee Chang & Eyun-Jung Ki, 2005. "Devising a Practical Model for Predicting Theatrical Movie Success: Focusing on the Experience Good Property," Journal of Media Economics, Taylor & Francis Journals, vol. 18(4), pages 247-269.
    11. Marshall, Pablo & Dockendorff, Monika & Ibáñez, Soledad, 2013. "A forecasting system for movie attendance," Journal of Business Research, Elsevier, vol. 66(10), pages 1800-1806.
    12. Dan Lovallo & Carmina Clarke & Colin Camerer, 2012. "Robust analogizing and the outside view: two empirical tests of case‐based decision making," Strategic Management Journal, Wiley Blackwell, vol. 33(5), pages 496-512, May.
    13. Andrew Ainslie & Xavier Drèze & Fred Zufryden, 2005. "Modeling Movie Life Cycles and Market Share," Marketing Science, INFORMS, vol. 24(3), pages 508-517, November.
    14. Jonsson, Bo, 1994. "Prediction with a linear regression model and errors in a regressor," International Journal of Forecasting, Elsevier, vol. 10(4), pages 549-555, December.
    15. Jehoshua Eliashberg & Jedid-Jah Jonker & Mohanbir S. Sawhney & Berend Wierenga, 2000. "MOVIEMOD: An Implementable Decision-Support System for Prerelease Market Evaluation of Motion Pictures," Marketing Science, INFORMS, vol. 19(3), pages 226-243, January.
    16. Goia, Aldo & May, Caterina & Fusai, Gianluca, 2010. "Functional clustering and linear regression for peak load forecasting," International Journal of Forecasting, Elsevier, vol. 26(4), pages 700-711, October.
    17. Chakravarty, Anindita & Liu, Yong & Mazumdar, Tridib, 2010. "The Differential Effects of Online Word-of-Mouth and Critics' Reviews on Pre-release Movie Evaluation," Journal of Interactive Marketing, Elsevier, vol. 24(3), pages 185-197.
    18. Arthur De Vany & W. Walls, 1999. "Uncertainty in the Movie Industry: Does Star Power Reduce the Terror of the Box Office?," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 23(4), pages 285-318, November.
    19. Jehoshua Eliashberg & Mohanbir S. Sawhney, 1994. "Modeling Goes to Hollywood: Predicting Individual Differences in Movie Enjoyment," Management Science, INFORMS, vol. 40(9), pages 1151-1173, September.
    20. Mohanbir S. Sawhney & Jehoshua Eliashberg, 1996. "A Parsimonious Model for Forecasting Gross Box-Office Revenues of Motion Pictures," Marketing Science, INFORMS, vol. 15(2), pages 113-131.
    21. Pradeep K. Chintagunta & Shyam Gopinath & Sriram Venkataraman, 2010. "The Effects of Online User Reviews on Movie Box Office Performance: Accounting for Sequential Rollout and Aggregation Across Local Markets," Marketing Science, INFORMS, vol. 29(5), pages 944-957, 09-10.
    22. Zhang, Jie & Thomas, Lyn C., 2012. "Comparisons of linear regression and survival analysis using single and mixture distributions approaches in modelling LGD," International Journal of Forecasting, Elsevier, vol. 28(1), pages 204-215.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Steven F. Lehrer & Tian Xie, 2022. "The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success," Management Science, INFORMS, vol. 68(1), pages 189-210, January.
    2. Nasios, Ioannis & Vogklis, Konstantinos, 2022. "Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time series," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1448-1459.
    3. Schaer, Oliver & Kourentzes, Nikolaos & Fildes, Robert, 2019. "Demand forecasting with user-generated online information," International Journal of Forecasting, Elsevier, vol. 35(1), pages 197-212.
    4. Oliver Schaer & Nikolaos Kourentzes & Robert Fildes, 2022. "Predictive competitive intelligence with prerelease online search traffic," Production and Operations Management, Production and Operations Management Society, vol. 31(10), pages 3823-3839, October.
    5. Green, Lawrence & Sung, Ming-Chien & Ma, Tiejun & Johnson, Johnnie E. V., 2019. "To what extent can new web-based technology improve forecasts? Assessing the economic value of information derived from Virtual Globes and its rate of diffusion in a financial market," European Journal of Operational Research, Elsevier, vol. 278(1), pages 226-239.
    6. Blazquez, Desamparados & Domenech, Josep, 2018. "Big Data sources and methods for social and economic analyses," Technological Forecasting and Social Change, Elsevier, vol. 130(C), pages 99-113.
    7. Yi Liao & Yuxuan Peng & Songlin Shi & Victor Shi & Xiaohong Yu, 2022. "Early box office prediction in China’s film market based on a stacking fusion model," Annals of Operations Research, Springer, vol. 308(1), pages 321-338, January.
    8. Ioannis Nasios & Konstantinos Vogklis, 2023. "Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time series," Papers 2310.13029, arXiv.org.
    9. Daekook Kang, 2021. "Box-office forecasting in Korea using search trend data: a modified generalized Bass diffusion model," Electronic Commerce Research, Springer, vol. 21(1), pages 41-72, March.
    10. Jordi McKenzie, 2023. "The economics of movies (revisited): A survey of recent literature," Journal of Economic Surveys, Wiley Blackwell, vol. 37(2), pages 480-525, April.
    11. Sangjae Lee & Joon Yeon Choeh, 2020. "Movie Production Efficiency Moderating between Online Word-of-Mouth and Subsequent Box Office Revenue," Sustainability, MDPI, vol. 12(16), pages 1-18, August.
    12. An, Yongdae & An, Jinwon & Cho, Sungzoon, 2021. "Artificial intelligence-based predictions of movie audiences on opening Saturday," International Journal of Forecasting, Elsevier, vol. 37(1), pages 274-288.
    13. Ronny Behrens & Natasha Zhang Foutz & Michael Franklin & Jannis Funk & Fernanda Gutierrez-Navratil & Julian Hofmann & Ulrike Leibfried, 2021. "Leveraging analytics to produce compelling and profitable film content," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 45(2), pages 171-211, June.
    14. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2019. "Technology in the 21st century: New challenges and opportunities," Technological Forecasting and Social Change, Elsevier, vol. 143(C), pages 321-335.
    15. Evgeny A. Antipov & Elena B. Pokryshevskaya, 2017. "Are box office revenues equally unpredictable for all movies? Evidence from a Random forest-based model," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 16(3), pages 295-307, June.
    16. Jong-Min Kim & Leixin Xia & Iksuk Kim & Seungjoo Lee & Keon-Hyung Lee, 2020. "Finding Nemo: Predicting Movie Performances by Machine Learning Methods," JRFM, MDPI, vol. 13(5), pages 1-12, May.
    17. Sangjae Lee & Joon Yeon Choeh, 2020. "The impact of online review helpfulness and word of mouth communication on box office performance predictions," Palgrave Communications, Palgrave Macmillan, vol. 7(1), pages 1-12, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daekook Kang, 2021. "Box-office forecasting in Korea using search trend data: a modified generalized Bass diffusion model," Electronic Commerce Research, Springer, vol. 21(1), pages 41-72, March.
    2. Jordi McKenzie, 2023. "The economics of movies (revisited): A survey of recent literature," Journal of Economic Surveys, Wiley Blackwell, vol. 37(2), pages 480-525, April.
    3. Jason M. T. Roos & Ron Shachar, 2014. "When Kerry Met Sally: Politics and Perceptions in the Demand for Movies," Management Science, INFORMS, vol. 60(7), pages 1617-1631, July.
    4. Bae, Giwoong & Kim, Hye-jin, 2019. "The impact of movie titles on box office success," Journal of Business Research, Elsevier, vol. 103(C), pages 100-109.
    5. Gazley, Aaron & Clark, Gemma & Sinha, Ashish, 2011. "Understanding preferences for motion pictures," Journal of Business Research, Elsevier, vol. 64(8), pages 854-861, August.
    6. Darren Filson & James H. Havlicek, 2018. "The performance of global film franchises: installment effects and extension decisions," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 42(3), pages 447-467, August.
    7. Hofmann, Julian & Clement, Michel & Völckner, Franziska & Hennig-Thurau, Thorsten, 2017. "Empirical generalizations on the impact of stars on the economic success of movies," International Journal of Research in Marketing, Elsevier, vol. 34(2), pages 442-461.
    8. Fei Peng & Lili Kang & Sajid Anwar & Xue Li, 2019. "Star power and box office revenues: evidence from China," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 43(2), pages 247-278, June.
    9. Natalia Gmerek, 2015. "The determinants of Polish movies’ box office performance in Poland," Journal of Marketing and Consumer Behaviour in Emerging Markets, University of Warsaw, Faculty of Management, vol. 1(1), pages 15-35.
    10. Jehoshua Eliashberg & Anita Elberse & Mark A.A.M. Leenders, 2006. "The Motion Picture Industry: Critical Issues in Practice, Current Research, and New Research Directions," Marketing Science, INFORMS, vol. 25(6), pages 638-661, 11-12.
    11. Divakaran, Pradeep Kumar Ponnamma & Palmer, Adrian & Søndergaard, Helle Alsted & Matkovskyy, Roman, 2017. "Pre-launch Prediction of Market Performance for Short Lifecycle Products Using Online Community Data," Journal of Interactive Marketing, Elsevier, vol. 38(C), pages 12-28.
    12. Moez Hababou & Nawel Amrouche & Kamel Jedidi, 2016. "Measuring Economic Efficiency in the Motion Picture Industry: a Data Envelopment Analysis Approach," Customer Needs and Solutions, Springer;Institute for Sustainable Innovation and Growth (iSIG), vol. 3(3), pages 144-158, December.
    13. Gaenssle Sophia & Budzinski Oliver & Astakhova Daria, 2018. "Conquering the Box Office: Factors Influencing Success of International Movies in Russia," Review of Network Economics, De Gruyter, vol. 17(4), pages 245-266, December.
    14. Steven F. Lehrer & Tian Xie, 2022. "The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success," Management Science, INFORMS, vol. 68(1), pages 189-210, January.
    15. Ana Suárez-Vázquez, 2011. "Critic power or star power? The influence of hallmarks of quality of motion pictures: an experimental approach," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 35(2), pages 119-135, May.
    16. Suman Basuroy & S. Abraham Ravid & Richard T. Gretz & B. J. Allen, 2020. "Is everybody an expert? An investigation into the impact of professional versus user reviews on movie revenues," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 44(1), pages 57-96, March.
    17. Brinja Meiseberg & Thomas Ehrmann, 2013. "Diversity in teams and the success of cultural products," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 37(1), pages 61-86, February.
    18. Mun, Mak Kit & Chong, Choo Wei, 2018. "Forecasting Movie Demand Using Total and Split Exponential Smoothing," Jurnal Ekonomi Malaysia, Faculty of Economics and Business, Universiti Kebangsaan Malaysia, vol. 52(2), pages 81-94.
    19. Allègre L. Hadida & Joseph Lampel & W. David Walls & Amit Joshi, 2021. "Hollywood studio filmmaking in the age of Netflix: a tale of two institutional logics," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 45(2), pages 213-238, June.
    20. Karniouchina, Ekaterina V., 2011. "Impact of star and movie buzz on motion picture distribution and box office revenue," International Journal of Research in Marketing, Elsevier, vol. 28(1), pages 62-74.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:intfor:v:31:y:2015:i:2:p:364-390. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijforecast .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.