IDEAS home Printed from https://ideas.repec.org/a/inm/ororsc/v32y2021i3p856-880.html
   My bibliography  Save this article

Algorithm Supported Induction for Building Theory: How Can We Use Prediction Models to Theorize?

Author

Listed:
  • Yash Raj Shrestha

    (Department of Management, Technology, and Economics, ETH Zürich, Zurich CH 8092, Switzerland)

  • Vivianna Fang He

    (Management Department, École Supérieure des Sciences Economiques et Commerciales (ESSEC) Business School, 95021 Cergy-Pontoise Cedex, France)

  • Phanish Puranam

    (Strategy Department, INSEAD, Singapore, Singapore 138676)

  • Georg von Krogh

    (Department of Management, Technology, and Economics, ETH Zürich, Zurich CH 8092, Switzerland)

Abstract

Across many fields of social science, machine learning (ML) algorithms are rapidly advancing research as tools to support traditional hypothesis testing research (e.g., through data reduction and automation of data coding or for improving matching on observable features of a phenomenon or constructing instrumental variables). In this paper, we argue that researchers are yet to recognize the value of ML techniques for theory building from data. This may be in part because of scholars’ inherent distaste for predictions without explanations that ML algorithms are known to produce. However, precisely because of this property, we argue that ML techniques can be very useful in theory construction during a key step of inductive theorizing—pattern detection. ML can facilitate algorithm supported induction , yielding conclusions about patterns in data that are likely to be robustly replicable by other analysts and in other samples from the same population. These patterns can then be used as inputs to abductive reasoning for building or developing theories that explain them. We propose that algorithm-supported induction is valuable for researchers interested in using quantitative data to both develop and test theories in a transparent and reproducible manner, and we illustrate our arguments using simulations.

Suggested Citation

  • Yash Raj Shrestha & Vivianna Fang He & Phanish Puranam & Georg von Krogh, 2021. "Algorithm Supported Induction for Building Theory: How Can We Use Prediction Models to Theorize?," Organization Science, INFORMS, vol. 32(3), pages 856-880, May.
  • Handle: RePEc:inm:ororsc:v:32:y:2021:i:3:p:856-880
    DOI: 10.1287/orsc.2020.1382
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/orsc.2020.1382
    Download Restriction: no

    File URL: https://libkey.io/10.1287/orsc.2020.1382?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dorothy Leonard-Barton, 1990. "A Dual Methodology for Case Studies: Synergistic Use of a Longitudinal Single Site with Replicated Multiple Sites," Organization Science, INFORMS, vol. 1(3), pages 248-266, August.
    2. Stanley Deetz, 1996. "Crossroads---Describing Differences in Approaches to Organization Science: Rethinking Burrell and Morgan and Their Legacy," Organization Science, INFORMS, vol. 7(2), pages 191-207, April.
    3. Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan & Ziad Obermeyer, 2015. "Prediction Policy Problems," American Economic Review, American Economic Association, vol. 105(5), pages 491-495, May.
    4. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2011. "Inference on Treatment Effects After Selection Amongst High-Dimensional Controls," Papers 1201.0224, arXiv.org, revised May 2012.
    5. James G. March & Lee S. Sproull & Michal Tamuz, 1991. "Learning from Samples of One or Fewer," Organization Science, INFORMS, vol. 2(1), pages 1-13, February.
    6. Herman Aguinis & Angelo M. Solarino, 2019. "Transparency and replicability in qualitative research: The case of interviews with elite informants," Strategic Management Journal, Wiley Blackwell, vol. 40(8), pages 1291-1315, August.
    7. Richard F. J. Haans & Constant Pieters & Zi-Lin He, 2016. "Thinking about U: Theorizing and testing U- and inverted U-shaped relationships in strategy research," Strategic Management Journal, Wiley Blackwell, vol. 37(7), pages 1177-1195, July.
    8. Bennet A. Zelner, 2009. "Using simulation to interpret results from logit, probit, and other nonlinear models," Strategic Management Journal, Wiley Blackwell, vol. 30(12), pages 1335-1348, December.
    9. Jonathan M.V. Davis & Sara B. Heller, 2017. "Using Causal Forests to Predict Treatment Heterogeneity: An Application to Summer Jobs," American Economic Review, American Economic Association, vol. 107(5), pages 546-550, May.
    10. Sabri Boughorbel & Fethi Jarray & Mohammed El-Anbari, 2017. "Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric," PLOS ONE, Public Library of Science, vol. 12(6), pages 1-17, June.
    11. Fischer, Thomas & Krauss, Christopher, 2018. "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, Elsevier, vol. 270(2), pages 654-669.
    12. John Hulland, 1999. "Use of partial least squares (PLS) in strategic management research: a review of four recent studies," Strategic Management Journal, Wiley Blackwell, vol. 20(2), pages 195-204, February.
    13. Robert I. Sutton, 1997. "Crossroads---The Virtues of Closet Qualitative Research," Organization Science, INFORMS, vol. 8(1), pages 97-106, February.
    14. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    15. J. Myles Shaver, 2019. "Interpreting Interactions in Linear Fixed-Effect Regression Models: When Fixed-Effect Estimates Are No Longer Within-Effects," Strategy Science, INFORMS, vol. 4(1), pages 25-40, March.
    16. Ragin, Charles C., 2000. "Fuzzy-Set Social Science," University of Chicago Press Economics Books, University of Chicago Press, edition 1, number 9780226702773, September.
    17. Vivianna Fang He & Phanish Puranam & Yash Raj Shrestha & Georg von Krogh, 2020. "Resolving governance disputes in communities: A study of software license decisions," Strategic Management Journal, Wiley Blackwell, vol. 41(10), pages 1837-1868, October.
    18. Arturs Kalnins, 2018. "Multicollinearity: How common factors cause Type 1 errors in multivariate regression," Strategic Management Journal, Wiley Blackwell, vol. 39(8), pages 2362-2385, August.
    19. Sonali K. Shah & Kevin G. Corley, 2006. "Building Better Theory by Bridging the Quantitative–Qualitative Divide," Journal of Management Studies, Wiley Blackwell, vol. 43(8), pages 1821-1835, December.
    20. Kathryn Rudie Harrigan, 1985. "An application of clustering for strategic group analysis," Strategic Management Journal, Wiley Blackwell, vol. 6(1), pages 55-73, January.
    21. Richard M. Burton & Børge Obel, 2011. "Computational Modeling for What-Is, What-Might-Be, and What-Should-Be Studies---And Triangulation," Organization Science, INFORMS, vol. 22(5), pages 1195-1202, October.
    22. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    23. Dinesh Puranam & Vishal Narayan & Vrinda Kadiyali, 2017. "The Effect of Calorie Posting Regulation on Consumer Opinion: A Flexible Latent Dirichlet Allocation Model with Informative Priors," Marketing Science, INFORMS, vol. 36(5), pages 726-746, September.
    24. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "Supplementary Appendix for "Inference on Treatment Effects After Selection Amongst High-Dimensional Controls"," Papers 1305.6099, arXiv.org, revised Jun 2013.
    25. Hannigan, Timothy R. & Seidel, Victor P. & Yakis-Douglas, Basak, 2018. "Product innovation rumors as forms of open innovation," Research Policy, Elsevier, vol. 47(5), pages 953-964.
    26. repec:ucp:bkecon:9780226702766 is not listed on IDEAS
    27. Allen H. Huang & Reuven Lehavy & Amy Y. Zang & Rong Zheng, 2018. "Analyst Information Discovery and Interpretation Roles: A Topic Modeling Approach," Management Science, INFORMS, vol. 64(6), pages 2833-2855, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chengyu Liu & Yan Li & Mingjie Fang & Feng Liu, 2023. "Using machine learning to explore the determinants of service satisfaction with online healthcare platforms during the COVID-19 pandemic," Service Business, Springer;Pan-Pacific Business Association, vol. 17(2), pages 449-476, June.
    2. Schade, Philipp & Schuhmacher, Monika C., 2023. "Predicting entrepreneurial activity using machine learning," Journal of Business Venturing Insights, Elsevier, vol. 19(C).
    3. Islam, Towhidul & Meade, Nigel & Carson, Richard T. & Louviere, Jordan J. & Wang, Juan, 2022. "The usefulness of socio-demographic variables in predicting purchase decisions: Evidence from machine learning procedures," Journal of Business Research, Elsevier, vol. 151(C), pages 324-338.
    4. Dahlander, Linus & Beretta, Michela & Thomas, Arne & Kazemi, Shahab & Fenger, Morten H.J. & Frederiksen, Lars, 2023. "Weeding out or picking winners in open innovation? Factors driving multi-stage crowd selection on LEGO ideas," Research Policy, Elsevier, vol. 52(10).
    5. Stefano Cabras & J. D. Tena, 2023. "Implicit institutional incentives and individual decisions: Causal inference with deep learning models," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 44(6), pages 3739-3754, September.
    6. Constance E. Helfat & Aseem Kaul & David J. Ketchen & Jay B. Barney & Olivier Chatain & Harbir Singh, 2023. "Renewing the resource‐based view: New contexts, new concepts, and new methods," Strategic Management Journal, Wiley Blackwell, vol. 44(6), pages 1357-1390, June.
    7. Prothit Sen & Phanish Puranam, 2022. "Do Alliance portfolios encourage or impede new business practice adoption? Theory and evidence from the private equity industry," Strategic Management Journal, Wiley Blackwell, vol. 43(11), pages 2279-2312, November.
    8. Liu, Feng & Long, Xiao & Dong, Lin & Fang, Mingjie, 2023. "What makes you entrepreneurial? Using machine learning to investigate the determinants of entrepreneurship in China," China Economic Review, Elsevier, vol. 81(C).
    9. Milan Miric & Nan Jia & Kenneth G. Huang, 2023. "Using supervised machine learning for large‐scale classification in management research: The case for identifying artificial intelligence patents," Strategic Management Journal, Wiley Blackwell, vol. 44(2), pages 491-519, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. McKenzie, David & Sansone, Dario, 2017. "Man vs. Machine in Predicting Successful Entrepreneurs: Evidence from a Business Plan Competition in Nigeria," CEPR Discussion Papers 12523, C.E.P.R. Discussion Papers.
    2. Michael J. Weir & Thomas W. Sproul, 2019. "Identifying Drivers of Genetically Modified Seafood Demand: Evidence from a Choice Experiment," Sustainability, MDPI, vol. 11(14), pages 1-21, July.
    3. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    4. Markku Maula & Wouter Stam, 2020. "Enhancing Rigor in Quantitative Entrepreneurship Research," Entrepreneurship Theory and Practice, , vol. 44(6), pages 1059-1090, November.
    5. Michael C. Knaus & Michael Lechner & Anthony Strittmatter, 2022. "Heterogeneous Employment Effects of Job Search Programs: A Machine Learning Approach," Journal of Human Resources, University of Wisconsin Press, vol. 57(2), pages 597-636.
    6. Sophie-Charlotte Klose & Johannes Lederer, 2020. "A Pipeline for Variable Selection and False Discovery Rate Control With an Application in Labor Economics," Papers 2006.12296, arXiv.org, revised Jun 2020.
    7. Naguib, Costanza, 2019. "Estimating the Heterogeneous Impact of the Free Movement of Persons on Relative Wage Mobility," Economics Working Paper Series 1903, University of St. Gallen, School of Economics and Political Science.
    8. Erik Heilmann & Janosch Henze & Heike Wetzel, 2021. "Machine learning in energy forecasts with an application to high frequency electricity consumption data," MAGKS Papers on Economics 202135, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    9. Giovanni Di Franco & Michele Santurro, 2021. "Machine learning, artificial neural networks and social research," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(3), pages 1007-1025, June.
    10. de Lucio, Juan, 2021. "Estimación adelantada del crecimiento regional mediante redes neuronales LSTM," INVESTIGACIONES REGIONALES - Journal of REGIONAL RESEARCH, Asociación Española de Ciencia Regional, issue 49, pages 45-64.
    11. Filmer,Deon P. & Nahata,Vatsal & Sabarwal,Shwetlena, 2021. "Preparation, Practice, and Beliefs : A Machine Learning Approach to Understanding Teacher Effectiveness," Policy Research Working Paper Series 9847, The World Bank.
    12. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    13. Chakraborty, Chiranjit & Joseph, Andreas, 2017. "Machine learning at central banks," Bank of England working papers 674, Bank of England.
    14. Francesco Decarolis & Cristina Giorgiantonio, 2020. "Corruption red flags in public procurement: new evidence from Italian calls for tenders," Questioni di Economia e Finanza (Occasional Papers) 544, Bank of Italy, Economic Research and International Relations Area.
    15. Chen, Ya & Tsionas, Mike G. & Zelenyuk, Valentin, 2021. "LASSO+DEA for small and big wide data," Omega, Elsevier, vol. 102(C).
    16. Andini, Monica & Boldrini, Michela & Ciani, Emanuele & de Blasio, Guido & D'Ignazio, Alessio & Paladini, Andrea, 2022. "Machine learning in the service of policy targeting: The case of public credit guarantees," Journal of Economic Behavior & Organization, Elsevier, vol. 198(C), pages 434-475.
    17. Emanuel Kohlscheen, 2022. "Quantifying the Role of Interest Rates, the Dollar and Covid in Oil Prices," Papers 2208.14254, arXiv.org, revised Oct 2022.
    18. Alpino, Matteo & Hauge, Karen Evelyn & Kotsadam, Andreas & Markussen, Simen, 2022. "Effects of dialogue meetings on sickness absence—Evidence from a large field experiment," Journal of Health Economics, Elsevier, vol. 83(C).
    19. Max Vilgalys, 2023. "A Machine Learning Approach to Measuring Climate Adaptation," Papers 2302.01236, arXiv.org.
    20. de Blasio, Guido & D'Ignazio, Alessio & Letta, Marco, 2022. "Gotham city. Predicting ‘corrupted’ municipalities with machine learning," Technological Forecasting and Social Change, Elsevier, vol. 184(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ororsc:v:32:y:2021:i:3:p:856-880. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.