IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v66y2020i4p1605-1626.html
   My bibliography  Save this article

Information Aggregation and P-Hacking

Author

Listed:
  • Oleg Rytchkov

    (Fox School of Business, Temple University, Philadelphia, Pennsylvania 19122)

  • Xun Zhong

    (Gabelli School of Business, Fordham University, New York, New York 10023)

Abstract

This paper studies the interplay between information aggregation and p-hacking in the context of predicting stock returns. The standard information-aggregation techniques exacerbate p-hacking by increasing the probability of the type I error. We propose an aggregation technique that is a simple modification of three-pass regression filter/partial least squares regression with an opposite property: the predictability tests applied to the combined predictor become more conservative in the presence of p-hacking. Using simulations, we quantify the advantages of our approach relative to the standard information-aggregation techniques. We also apply our aggregation technique to three sets of return predictors proposed in the literature and find that the forecasting ability of combined predictors in two cases cannot be explained by p-hacking.

Suggested Citation

  • Oleg Rytchkov & Xun Zhong, 2020. "Information Aggregation and P-Hacking," Management Science, INFORMS, vol. 66(4), pages 1605-1626, April.
  • Handle: RePEc:inm:ormnsc:v:66:y:2020:i:4:p:1605-1626
    DOI: 10.1287/mnsc.2018.3259
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/mnsc.2018.3259
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2018.3259?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bajgrowicz, Pierre & Scaillet, Olivier, 2012. "Technical trading revisited: False discoveries, persistence tests, and transaction costs," Journal of Financial Economics, Elsevier, vol. 106(3), pages 473-491.
    2. Kelly, Bryan T. & Pruitt, Seth & Su, Yinan, 2019. "Characteristics are covariances: A unified model of risk and return," Journal of Financial Economics, Elsevier, vol. 134(3), pages 501-524.
    3. Ivo Welch & Amit Goyal, 2008. "A Comprehensive Look at The Empirical Performance of Equity Premium Prediction," Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1455-1508, July.
    4. Joseph P. Romano & Michael Wolf, 2005. "Stepwise Multiple Testing as Formalized Data Snooping," Econometrica, Econometric Society, vol. 73(4), pages 1237-1282, July.
    5. Forni, Mario & Hallin, Marc & Lippi, Marco & Reichlin, Lucrezia, 2003. "Do financial variables help forecasting inflation and real activity in the euro area?," Journal of Monetary Economics, Elsevier, vol. 50(6), pages 1243-1255, September.
    6. Lewellen, Jonathan, 2015. "The Cross-section of Expected Stock Returns," Critical Finance Review, now publishers, vol. 4(1), pages 1-44, June.
    7. Malcolm Baker & Jeffrey Wurgler, 2000. "The Equity Share in New Issues and Aggregate Stock Returns," Journal of Finance, American Finance Association, vol. 55(5), pages 2219-2257, October.
    8. Ferson, Wayne E. & Sarkissian, Sergei & Simin, Timothy, 1999. "The alpha factor asset pricing model: A parable," Journal of Financial Markets, Elsevier, vol. 2(1), pages 49-68, February.
    9. Clark, Todd E. & McCracken, Michael W., 2001. "Tests of equal forecast accuracy and encompassing for nested models," Journal of Econometrics, Elsevier, vol. 105(1), pages 85-110, November.
    10. Juhani T Linnainmaa & Michael R Roberts, 2018. "The History of the Cross-Section of Stock Returns," Review of Financial Studies, Society for Financial Studies, vol. 31(7), pages 2606-2649.
    11. Hansen, Peter Reinhard, 2005. "A Test for Superior Predictive Ability," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 365-380, October.
    12. Ryan Sullivan & Allan Timmermann & Halbert White, 1999. "Data‐Snooping, Technical Trading Rule Performance, and the Bootstrap," Journal of Finance, American Finance Association, vol. 54(5), pages 1647-1691, October.
    13. Sullivan, Ryan & Timmermann, Allan & White, Halbert, 2001. "Dangers of data mining: The case of calendar effects in stock returns," Journal of Econometrics, Elsevier, vol. 105(1), pages 249-286, November.
    14. Chen, Long, 2009. "On the reversal of return and dividend growth predictability: A tale of two periods," Journal of Financial Economics, Elsevier, vol. 92(1), pages 128-151, April.
    15. Laurent Barras & Olivier Scaillet & Russ Wermers, 2010. "False Discoveries in Mutual Fund Performance: Measuring Luck in Estimated Alphas," Journal of Finance, American Finance Association, vol. 65(1), pages 179-216, February.
    16. Dashan Huang & Fuwei Jiang & Jun Tu & Guofu Zhou, 2015. "Investor Sentiment Aligned: A Powerful Predictor of Stock Returns," Review of Financial Studies, Society for Financial Studies, vol. 28(3), pages 791-837.
    17. Kozak, Serhiy & Nagel, Stefan & Santosh, Shrihari, 2020. "Shrinking the cross-section," Journal of Financial Economics, Elsevier, vol. 135(2), pages 271-292.
    18. Kothari, S. P. & Shanken, Jay, 1997. "Book-to-market, dividend yield, and expected market returns: A time-series analysis," Journal of Financial Economics, Elsevier, vol. 44(2), pages 169-203, May.
    19. Boivin, Jean & Ng, Serena, 2006. "Are more data always better for factor analysis?," Journal of Econometrics, Elsevier, vol. 132(1), pages 169-194, May.
    20. John H. Cochrane, 2011. "Presidential Address: Discount Rates," Journal of Finance, American Finance Association, vol. 66(4), pages 1047-1108, August.
    21. Stock, James H. & Watson, Mark W., 1999. "Forecasting inflation," Journal of Monetary Economics, Elsevier, vol. 44(2), pages 293-335, October.
    22. Wayne E. Ferson & Sergei Sarkissian & Timothy T. Simin, 2003. "Spurious Regressions in Financial Economics?," Journal of Finance, American Finance Association, vol. 58(4), pages 1393-1413, August.
    23. Eugene F. Fama & Kenneth R. French, 2008. "Dissecting Anomalies," Journal of Finance, American Finance Association, vol. 63(4), pages 1653-1678, August.
    24. Mark J Ready, 2002. "Profits from Technical Trading Rules," Financial Management, Financial Management Association, vol. 31(3), Fall.
    25. Lo, Andrew W & MacKinlay, A Craig, 1990. "Data-Snooping Biases in Tests of Financial Asset Pricing Models," Review of Financial Studies, Society for Financial Studies, vol. 3(3), pages 431-467.
    26. Campbell, John Y., 1987. "Stock returns and the term structure," Journal of Financial Economics, Elsevier, vol. 18(2), pages 373-399, June.
    27. Bai, Jushan & Ng, Serena, 2008. "Forecasting economic time series using targeted predictors," Journal of Econometrics, Elsevier, vol. 146(2), pages 304-317, October.
    28. Ludvigson, Sydney C. & Ng, Serena, 2007. "The empirical risk-return relation: A factor analysis approach," Journal of Financial Economics, Elsevier, vol. 83(1), pages 171-222, January.
    29. Lev, B & Thiagarajan, Sr, 1993. "Fundamental Information Analysis," Journal of Accounting Research, Wiley Blackwell, vol. 31(2), pages 190-215.
    30. Campbell R. Harvey, 2017. "Presidential Address: The Scientific Outlook in Financial Economics," Journal of Finance, American Finance Association, vol. 72(4), pages 1399-1440, August.
    31. Clark, Todd E. & West, Kenneth D., 2007. "Approximately normal tests for equal predictive accuracy in nested models," Journal of Econometrics, Elsevier, vol. 138(1), pages 291-311, May.
    32. Joachim Freyberger & Andreas Neuhierl & Michael Weber, 2020. "Dissecting Characteristics Nonparametrically," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    33. Wayne E. Ferson & Sergei Sarkissian & Timothy T. Simin, 2003. "Spurious Regressions in Financial Economics?," Journal of Finance, American Finance Association, vol. 58(4), pages 1393-1414, August.
    34. Foster, F Douglas & Smith, Tom & Whaley, Robert E, 1997. "Assessing Goodness-of-Fit of Asset Pricing Models: The Distribution of the Maximal R-Squared," Journal of Finance, American Finance Association, vol. 52(2), pages 591-607, June.
    35. Christopher J. Neely & David E. Rapach & Jun Tu & Guofu Zhou, 2014. "Forecasting the Equity Risk Premium: The Role of Technical Indicators," Management Science, INFORMS, vol. 60(7), pages 1772-1791, July.
    36. Jeremiah Green & John R. M. Hand & X. Frank Zhang, 2017. "The Characteristics that Provide Independent Information about Average U.S. Monthly Stock Returns," Review of Financial Studies, Society for Financial Studies, vol. 30(12), pages 4389-4436.
    37. Xuemin (Sterling) Yan & Lingling Zheng, 2017. "Fundamental Analysis and the Cross-Section of Stock Returns: A Data-Mining Approach," Review of Financial Studies, Society for Financial Studies, vol. 30(4), pages 1382-1423.
    38. Kelly, Bryan & Pruitt, Seth, 2015. "The three-pass regression filter: A new approach to forecasting using many predictors," Journal of Econometrics, Elsevier, vol. 186(2), pages 294-316.
    39. Louis K. C. Chan & Stephen G. Dimmock & Josef Lakonishok, 2009. "Benchmarking Money Manager Performance: Issues and Evidence," The Review of Financial Studies, Society for Financial Studies, vol. 22(11), pages 4553-4599, November.
    40. Piotroski, JD, 2000. "Value investing: The use of historical financial statement information to separate winners from losers," Journal of Accounting Research, Wiley Blackwell, vol. 38, pages 1-41.
    41. Bryan Kelly & Seth Pruitt, 2013. "Market Expectations in the Cross-Section of Present Values," Journal of Finance, American Finance Association, vol. 68(5), pages 1721-1756, October.
    42. Andreas Neuhierl & Bernd Schlusche, 2011. "Data Snooping and Market-Timing Rule Performance," Journal of Financial Econometrics, Oxford University Press, vol. 9(3), pages 550-587, Summer.
    43. Fama, Eugene F. & French, Kenneth R., 1989. "Business conditions and expected returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 25(1), pages 23-49, November.
    44. Han, Yufeng & Zhou, Guofu & Zhu, Yingzi, 2016. "A trend factor: Any economic gains from using information over investment horizons?," Journal of Financial Economics, Elsevier, vol. 122(2), pages 352-375.
    45. Ou, Jane A. & Penman, Stephen H., 1989. "Financial statement analysis and the prediction of stock returns," Journal of Accounting and Economics, Elsevier, vol. 11(4), pages 295-329, November.
    46. Hanna, J. Douglas & Ready, Mark J., 2005. "Profitable predictability in the cross section of stock returns," Journal of Financial Economics, Elsevier, vol. 78(3), pages 463-505, December.
    47. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    48. Nathaniel Light & Denys Maslov & Oleg Rytchkov, 2017. "Aggregation of Information About the Cross Section of Stock Returns: A Latent Variable Approach," Review of Financial Studies, Society for Financial Studies, vol. 30(4), pages 1339-1381.
    49. Robert F. Stambaugh & Jianfeng Yu & Yu Yuan, 2015. "Arbitrage Asymmetry and the Idiosyncratic Volatility Puzzle," Journal of Finance, American Finance Association, vol. 70(5), pages 1903-1948, October.
    50. Cao, Jie & Han, Bing, 2016. "Idiosyncratic risk, costly arbitrage, and the cross-section of stock returns," Journal of Banking & Finance, Elsevier, vol. 73(C), pages 1-15.
    51. Rapach, David & Zhou, Guofu, 2013. "Forecasting Stock Returns," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 328-383, Elsevier.
    52. David E. Rapach & Jack K. Strauss & Guofu Zhou, 2010. "Out-of-Sample Equity Premium Prediction: Combination Forecasts and Links to the Real Economy," Review of Financial Studies, Society for Financial Studies, vol. 23(2), pages 821-862, February.
    53. G. Elliott & C. Granger & A. Timmermann (ed.), 2013. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 2, number 2.
    54. Pontiff, Jeffrey & Schall, Lawrence D., 1998. "Book-to-market ratios as predictors of market returns," Journal of Financial Economics, Elsevier, vol. 49(2), pages 141-160, August.
    55. Valentin Haddad & Serhiy Kozak & Shrihari Santosh, 2017. "Predicting Relative Returns," NBER Working Papers 23886, National Bureau of Economic Research, Inc.
    56. Sean P. Grover & Michael W. McCracken, 2014. "Factor-based prediction of industry-wide bank stress," Review, Federal Reserve Bank of St. Louis, vol. 96(2), pages 173-194.
    57. Megan L Head & Luke Holman & Rob Lanfear & Andrew T Kahn & Michael D Jennions, 2015. "The Extent and Consequences of P-Hacking in Science," PLOS Biology, Public Library of Science, vol. 13(3), pages 1-15, March.
    58. G. Elliott & C. Granger & A. Timmermann (ed.), 2006. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 1, number 1.
    59. Halbert White, 2000. "A Reality Check for Data Snooping," Econometrica, Econometric Society, vol. 68(5), pages 1097-1126, September.
    60. Dichtl, Hubert & Drobetz, Wolfgang & Neuhierl, Andreas & Wendt, Viktoria-Sophie, 2021. "Data snooping in equity premium prediction," International Journal of Forecasting, Elsevier, vol. 37(1), pages 72-94.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Guillaume Coqueret, 2023. "Forking paths in financial economics," Papers 2401.08606, arXiv.org.
    2. Cakici, Nusret & Zaremba, Adam & Bianchi, Robert J. & Pham, Nga, 2021. "False discoveries in the anomaly research: New insights from the Stock Exchange of Melbourne (1927–1987)," Pacific-Basin Finance Journal, Elsevier, vol. 70(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rapach, David & Zhou, Guofu, 2013. "Forecasting Stock Returns," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 328-383, Elsevier.
    2. Christopher J. Neely & David E. Rapach & Jun Tu & Guofu Zhou, 2014. "Forecasting the Equity Risk Premium: The Role of Technical Indicators," Management Science, INFORMS, vol. 60(7), pages 1772-1791, July.
    3. Dichtl, Hubert & Drobetz, Wolfgang & Neuhierl, Andreas & Wendt, Viktoria-Sophie, 2021. "Data snooping in equity premium prediction," International Journal of Forecasting, Elsevier, vol. 37(1), pages 72-94.
    4. Chinco, Alex & Neuhierl, Andreas & Weber, Michael, 2021. "Estimating the anomaly base rate," Journal of Financial Economics, Elsevier, vol. 140(1), pages 101-126.
    5. Lin, Qi, 2018. "Technical analysis and stock return predictability: An aligned approach," Journal of Financial Markets, Elsevier, vol. 38(C), pages 103-123.
    6. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    7. Xi Dong & Yan Li & David E. Rapach & Guofu Zhou, 2022. "Anomalies and the Expected Market Return," Journal of Finance, American Finance Association, vol. 77(1), pages 639-681, February.
    8. Jiang, Fuwei & Lee, Joshua & Martin, Xiumin & Zhou, Guofu, 2019. "Manager sentiment and stock returns," Journal of Financial Economics, Elsevier, vol. 132(1), pages 126-149.
    9. Söhnke M. Bartram & Harald Lohre & Peter F. Pope & Ananthalakshmi Ranganathan, 2021. "Navigating the factor zoo around the world: an institutional investor perspective," Journal of Business Economics, Springer, vol. 91(5), pages 655-703, July.
    10. Stephen A. Gorman & Frank J. Fabozzi, 2021. "The ABC’s of the alternative risk premium: academic roots," Journal of Asset Management, Palgrave Macmillan, vol. 22(6), pages 405-436, October.
    11. Gonçalo Faria & Fabio Verona, 2016. "Forecasting the equity risk premium with frequency-decomposed predictors," Working Papers de Economia (Economics Working Papers) 06, Católica Porto Business School, Universidade Católica Portuguesa.
    12. Gonçalo Faria & Fabio Verona, 2016. "Forecasting the equity risk premium with frequency-decomposed predictors," Working Papers de Economia (Economics Working Papers) 06, Católica Porto Business School, Universidade Católica Portuguesa.
    13. Mykola Babiak & Jozef Barunik, 2020. "Deep Learning, Predictability, and Optimal Portfolio Returns," CERGE-EI Working Papers wp677, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    14. repec:zbw:bofrdp:2017_001 is not listed on IDEAS
    15. Lin, Qi & Lin, Xi, 2021. "Cash conversion cycle and aggregate stock returns," Journal of Financial Markets, Elsevier, vol. 52(C).
    16. Dominik Wolff & Ulrich Neugebauer, 2019. "Tree-based machine learning approaches for equity market predictions," Journal of Asset Management, Palgrave Macmillan, vol. 20(4), pages 273-288, July.
    17. Gupta, Rangan & Hammoudeh, Shawkat & Modise, Mampho P. & Nguyen, Duc Khuong, 2014. "Can economic uncertainty, financial stress and consumer sentiments predict U.S. equity premium?," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 33(C), pages 367-378.
    18. Hoang, Khoa & Cannavan, Damien & Huang, Ronghong & Peng, Xiaowen, 2021. "Predicting stock returns with implied cost of capital: A partial least squares approach," Journal of Financial Markets, Elsevier, vol. 53(C).
    19. Guofu Zhou, 2018. "Measuring Investor Sentiment," Annual Review of Financial Economics, Annual Reviews, vol. 10(1), pages 239-259, November.
    20. Tsiakas, Ilias & Li, Jiahan & Zhang, Haibin, 2020. "Equity premium prediction and the state of the economy," Journal of Empirical Finance, Elsevier, vol. 58(C), pages 75-95.
    21. Yu-Chin Hsu & Hsiou-Wei Lin & Kendro Vincent, 2017. "Do Cross-Sectional Stock Return Predictors Pass the Test without Data-Snooping Bias?," IEAS Working Paper : academic research 17-A003, Institute of Economics, Academia Sinica, Taipei, Taiwan.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:66:y:2020:i:4:p:1605-1626. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.