Opening the Black Box: Bootstrapping Sensitivity Measures in Neural Networks for Interpretable Machine Learning

My bibliography Save this article

Opening the Black Box: Bootstrapping Sensitivity Measures in Neural Networks for Interpretable Machine Learning

Author

Listed:

Michele La Rocca
(Department of Economics and Statistics, University of Salerno, 84084 Fisciano, Italy)
Cira Perna
(Department of Economics and Statistics, University of Salerno, 84084 Fisciano, Italy)

Registered:

Cira Perna

Abstract

Artificial neural networks are powerful tools for data analysis, particularly in the context of highly nonlinear regression models. However, their utility is critically limited due to the lack of interpretation of the model given its black-box nature. To partially address the problem, the paper focuses on the important problem of feature selection. It proposes and discusses a statistical test procedure for selecting a set of input variables that are relevant to the model while taking into account the multiple testing nature of the problem. The approach is within the general framework of sensitivity analysis and uses the conditional expectation of functions of the partial derivatives of the output with respect to the inputs as a sensitivity measure. The proposed procedure extensively uses the bootstrap to approximate the test statistic distribution under the null while controlling the familywise error rate to correct for data snooping arising from multiple testing. In particular, a pair bootstrap scheme was implemented in order to obtain consistent results when using misspecified statistical models, a typical characteristic of neural networks. Numerical examples and a Monte Carlo simulation were carried out to verify the ability of the proposed test procedure to correctly identify the set of relevant features.

Suggested Citation

Michele La Rocca & Cira Perna, 2022. "Opening the Black Box: Bootstrapping Sensitivity Measures in Neural Networks for Interpretable Machine Learning," Stats, MDPI, vol. 5(2), pages 1-18, April.

Handle: RePEc:gam:jstats:v:5:y:2022:i:2:p:26-457:d:801953

Download full text from publisher

References listed on IDEAS

Joseph P. Romano & Michael Wolf, 2005. "Stepwise Multiple Testing as Formalized Data Snooping," Econometrica, Econometric Society, vol. 73(4), pages 1237-1282, July.
- Joseph P. Romano & Michael Wolf, 2003. "Stepwise multiple testing as formalized data snooping," Economics Working Papers 712, Department of Economics and Business, Universitat Pompeu Fabra.
- Joseph P. Romano & Michael Wolf, 2015. "Stepwise Multiple Testing as Formalized Data Snooping," Working Papers 17, Barcelona School of Economics.
La Rocca, Michele & Perna, Cira, 2005. "Variable selection in neural network regression models with dependent data: a subsampling approach," Computational Statistics & Data Analysis, Elsevier, vol. 48(2), pages 415-429, February.
Goncalves, Silvia & Kilian, Lutz, 2004. "Bootstrapping autoregressions with conditional heteroskedasticity of unknown form," Journal of Econometrics, Elsevier, vol. 123(1), pages 89-120, November.
- Gonçalves, Sílvia & Kilian, Lutz, 2002. "Bootstrapping autoregressions with conditional heteroskedasticity of unknown form," Working Paper Series 196, European Central Bank.
- Kilian, Lutz & Gonçalves, Sílvia, 2002. "Bootstrapping Autoregressions with Conditional Heteroskedasticity of Unknown Form," Discussion Paper Series 1: Economic Studies 2002,26, Deutsche Bundesbank.
- GONÇALVES, Silvia & KILIAN, Lutz, 2003. "Bootstrapping Autoregressions with Conditional Heteroskedasticity of Unknown Form," Cahiers de recherche 2003-01, Universite de Montreal, Departement de sciences economiques.
- Gonçalves, Sílvia & KILIAN, Lutz, 2003. "Bootstrapping Autoregressions with Conditional Heteroskedasticity of Unknown Form," Cahiers de recherche 01-2003, Centre interuniversitaire de recherche en Ã©conomie quantitative, CIREQ.
- Silvia Gonçalves & Lutz Kilian, 2003. "Bootstrapping Autoregressions with Conditional Heteroskedasticity of Unknown Form," CIRANO Working Papers 2003s-17, CIRANO.
Romano, Joseph P. & Shaikh, Azeem M. & Wolf, Michael, 2008. "Formalized Data Snooping Based On Generalized Error Rates," Econometric Theory, Cambridge University Press, vol. 24(2), pages 404-447, April.
- Joseph P & Romano & Azeem M. Shaikh & Michael Wolf, 2005. "Formalized Data Snooping Based on Generalized Error Rates," IEW - Working Papers 259, Institute for Empirical Research in Economics - University of Zurich.
Joseph P. Romano & Michael Wolf, 2005. "Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 94-108, March.
- Joseph Romano & Michael Wolf, 2003. "Exact and approximate stepdown methods for multiple hypothesis testing," Economics Working Papers 727, Department of Economics and Business, Universitat Pompeu Fabra.
Halbert White, 2000. "A Reality Check for Data Snooping," Econometrica, Econometric Society, vol. 68(5), pages 1097-1126, September.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Adriano Koshiyama & Nick Firoozye, 2019. "Avoiding Backtesting Overfitting by Covariance-Penalties: an empirical investigation of the ordinary and total least squares cases," Papers 1905.05023, arXiv.org.
Zeng-Hua Lu, 2019. "Extended MinP Tests for Global and Multiple testing," Papers 1911.04696, arXiv.org, revised Aug 2024.
Christopher J. Bennett, 2009. "p-Value Adjustments for Asymptotic Control of the Generalized Familywise Error Rate," Vanderbilt University Department of Economics Working Papers 0905, Vanderbilt University Department of Economics.
Alvaro Escribano & Genaro Sucarrat, 2011. "Automated model selection in finance: General-to-speci c modelling of the mean and volatility speci cations," Working Papers 2011-09, Instituto Madrileño de Estudios Avanzados (IMDEA) Ciencias Sociales.
Romano, Joseph P. & Shaikh, Azeem M. & Wolf, Michael, 2008. "Formalized Data Snooping Based On Generalized Error Rates," Econometric Theory, Cambridge University Press, vol. 24(2), pages 404-447, April.
- Joseph P & Romano & Azeem M. Shaikh & Michael Wolf, 2005. "Formalized Data Snooping Based on Generalized Error Rates," IEW - Working Papers 259, Institute for Empirical Research in Economics - University of Zurich.
John A. List & Azeem M. Shaikh & Yang Xu, 2019. "Multiple hypothesis testing in experimental economics," Experimental Economics, Springer;Economic Science Association, vol. 22(4), pages 773-793, December.
- John List & Azeem Shaikh & Yang Xu, 2016. "Multiple Hypothesis Testing in Experimental Economics," Artefactual Field Experiments 00402, The Field Experiments Website.
- John A. List & Azeem M. Shaikh & Yang Xu, 2016. "Multiple Hypothesis Testing in Experimental Economics," NBER Working Papers 21875, National Bureau of Economic Research, Inc.
Joseph P. Romano & Azeem M. Shaikh & Michael Wolf, 2010. "Hypothesis Testing in Econometrics," Annual Review of Economics, Annual Reviews, vol. 2(1), pages 75-104, September.
- Joseph P. Romano & Azeem M. Shaikh & Michael Wolf, 2009. "Hypothesis testing in econometrics," IEW - Working Papers 444, Institute for Empirical Research in Economics - University of Zurich.
Nik Tuzov & Frederi Viens, 2011. "Mutual fund performance: false discoveries, bias, and power," Annals of Finance, Springer, vol. 7(2), pages 137-169, May.
Smeekes, S., 2011. "Bootstrap sequential tests to determine the stationary units in a panel," Research Memorandum 003, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
Romano, Joseph P. & Wolf, Michael, 2016. "Efficient computation of adjusted p-values for resampling-based stepdown multiple testing," Statistics & Probability Letters, Elsevier, vol. 113(C), pages 38-40.
- Joseph P. Romano & Michael Wolf, 2016. "Efficient computation of adjusted p-values for resampling-based stepdown multiple testing," ECON - Working Papers 219, Department of Economics - University of Zurich.
Kuang, P. & Schröder, M. & Wang, Q., 2014. "Illusory profitability of technical analysis in emerging foreign exchange markets," International Journal of Forecasting, Elsevier, vol. 30(2), pages 192-205.
- P Kuang & M Schroder & Q Wang, 2013. "Illusory Profitability of Technical Analysis in Emerging Foreign Exchange Markets," Discussion Papers 13-09, Department of Economics, University of Birmingham.
- Pei Kuang & M. Schröder & Q. Wang, 2013. "Illusory Profitability of Technical Analysis in Emerging Foreign Exchange Markets," CDMA Working Paper Series 201302, Centre for Dynamic Macroeconomic Analysis.
Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-Dimensional Econometrics and Regularized GMM," Papers 1806.01888, arXiv.org, revised Jun 2018.
- Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Stephen A. Gorman & Frank J. Fabozzi, 2021. "The ABC’s of the alternative risk premium: academic roots," Journal of Asset Management, Palgrave Macmillan, vol. 22(6), pages 405-436, October.
Genaro Sucarrat & Alvaro Escribano, 2012. "Automated Model Selection in Finance: General-to-Specific Modelling of the Mean and Volatility Specifications," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 74(5), pages 716-735, October.
Bajgrowicz, Pierre & Scaillet, Olivier, 2012. "Technical trading revisited: False discoveries, persistence tests, and transaction costs," Journal of Financial Economics, Elsevier, vol. 106(3), pages 473-491.
- Pierre Bajgrowicz & Olivier Scaillet, 2008. "Technical Trading Revisited: False Discoveries, Persistence Tests, and Transaction Costs," Swiss Finance Institute Research Paper Series 08-05, Swiss Finance Institute, revised Jul 2009.
Georgios Sermpinis & Arman Hassanniakalager & Charalampos Stasinakis & Ioannis Psaradellis, 2018. "Technical Analysis and Discrete False Discovery Rate: Evidence from MSCI Indices," Papers 1811.06766, arXiv.org, revised Jun 2019.
Gabriel Frahm & Tobias Wickern & Christof Wiechers, 2012. "Multiple tests for the performance of different investment strategies," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 96(3), pages 343-383, July.
Sucarrat, Genaro & Escribano, Álvaro, 2009. "Automated financial multi-path GETS modelling," UC3M Working papers. Economics we093620, Universidad Carlos III de Madrid. Departamento de EconomÃa.
Hassanniakalager, Arman & Sermpinis, Georgios & Stasinakis, Charalampos, 2021. "Trading the foreign exchange market with technical analysis and Bayesian Statistics," Journal of Empirical Finance, Elsevier, vol. 63(C), pages 230-251.
Bruno Ferman & Cristine Pinto & Vitor Possebom, 2020. "Cherry Picking with Synthetic Controls," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 39(2), pages 510-532, March.
- Ferman, Bruno & Pinto, Cristine Campos de Xavier & Possebom, Vítor Augusto, 2016. "Cherry picking with synthetic controls," Textos para discussão 420, FGV EESP - Escola de Economia de São Paulo, Fundação Getulio Vargas (Brazil).
- Ferman, Bruno & Pinto, Cristine & Possebom, Vitor, 2017. "Cherry Picking with Synthetic Controls," MPRA Paper 78213, University Library of Munich, Germany.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:5:y:2022:i:2:p:26-457:d:801953. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Opening the Black Box: Bootstrapping Sensitivity Measures in Neural Networks for Interpretable Machine Learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data