pystacked: Stacking generalization and machine learning in Stata

My bibliography Save this paper

pystacked: Stacking generalization and machine learning in Stata

Author

Listed:

Achim Ahrens
Christian B. Hansen
Mark E. Schaffer

Registered:

Abstract

pystacked implements stacked generalization (Wolpert, 1992) for regression and binary classification via Python's scikit-learn. Stacking combines multiple supervised machine learners -- the "base" or "level-0" learners -- into a single learner. The currently supported base learners include regularized regression, random forest, gradient boosted trees, support vector machines, and feed-forward neural nets (multi-layer perceptron). pystacked can also be used with as a `regular' machine learning program to fit a single base learner and, thus, provides an easy-to-use API for scikit-learn's machine learning algorithms.

Suggested Citation

Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2022. "pystacked: Stacking generalization and machine learning in Stata," Papers 2208.10896, arXiv.org, revised Mar 2023.

Handle: RePEc:arx:papers:2208.10896

Download full text from publisher

Other versions of this item:

Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2023. "pystacked: Stacking generalization and machine learning in Stata," Stata Journal, StataCorp LLC, vol. 23(4), pages 909-931, December.

Christian B. Hansen & Mark E. Schaffer & Achim Ahrens, 2022. "pystacked: Stacking generalization and machine learning in Stata," Swiss Stata Conference 2022 01, Stata Users Group.

References listed on IDEAS

Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2020. "lassopack: Model selection and prediction with regularized regression in Stata," Stata Journal, StataCorp LLC, vol. 20(1), pages 176-235, March.
- Ahrens, Achim & Hansen, Christian B. & Schaffer, Mark E, 2019. "lassopack: Model Selection and Prediction with Regularized Regression in Stata," IZA Discussion Papers 12081, Institute of Labor Economics (IZA).
- Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2019. "lassopack: Model selection and prediction with regularized regression in Stata," Papers 1901.05397, arXiv.org.
Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
Vanya Van Belle & Ben Van Calster & Sabine Van Huffel & Johan A K Suykens & Paulo Lisboa, 2016. "Explaining Support Vector Machines: A Color Based Nomogram," PLOS ONE, Public Library of Science, vol. 11(10), pages 1-33, October.
Nick Guenther & Matthias Schonlau, 2016. "Support vector machines," Stata Journal, StataCorp LLC, vol. 16(4), pages 917-937, December.
Giovanni Cerulli, 2022. "Machine learning using Stata/Python," Stata Journal, StataCorp LLC, vol. 22(4), pages 772-810, December.
- Giovanni Cerulli, 2021. "Machine learning using Stata/Python," 2021 Stata Conference 25, Stata Users Group.
- Giovanni Cerulli, 2022. "Machine learning using Stata/Python," Italian Stata Users' Group Meetings 2022 02, Stata Users Group.
Susan Athey & Guido W. Imbens, 2019. "Machine Learning Methods That Economists Should Know About," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 685-725, August.
Athey, Susan & Imbens, Guido W., 2019. "Machine Learning Methods Economists Should Know About," Research Papers 3776, Stanford University, Graduate School of Business.
- Susan Athey & Guido Imbens, 2019. "Machine Learning Methods Economists Should Know About," Papers 1903.10075, arXiv.org.
Kelley Pace, R. & Barry, Ronald, 1997. "Sparse spatial autoregressions," Statistics & Probability Letters, Elsevier, vol. 33(3), pages 291-297, May.
Matthias Schonlau & Rosie Yuyan Zou, 2020. "The random forest algorithm for statistical learning," Stata Journal, StataCorp LLC, vol. 20(1), pages 3-29, March.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Zuchuat, Jeremy & Lalive, Rafael & Osikominu, Aderonke & Pesaresi, Lorenzo & Zweimüller, Josef, 2023. "Duration Dependence in Finding a Job: Applications, Interviews, and Job Offers," IZA Discussion Papers 16602, Institute of Labor Economics (IZA).
- Zuchuat, Jeremy & Lalive, Rafael & Osikominu, Aderonke & Pesaresi, Lorenzo & ZweimÃ¼ller, Josef, 2023. "Duration Dependence in Finding a Job: Applications, Interviews, and Job Offers," CEPR Discussion Papers 18600, C.E.P.R. Discussion Papers.
- Rafael Lalive & Aderonke Osikominu & Lorenzo Pesaresi & Jeremy Zuchuat & Josef Zweimueller, 2025. "Duration Dependence in Finding a Job: Applications, Interviews, and Job Offers," RFBerlin Discussion Paper Series 2515, ROCKWOOL Foundation Berlin (RFBerlin).
Nicolas Apfel & Holger Breinlich & Nick Green & Dennis Novy & J. M. C. Santos Silva & Tom Zylkin, 2025. "Out-of-sample gravity predictions and trade policy counterfactuals," Papers 2509.11271, arXiv.org, revised Sep 2025.
Achim Ahrens & Christian B. Hansen & Mark E. Schaffer & Thomas Wiemann, 2025. "Model Averaging and Double Machine Learning," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 40(3), pages 249-269, April.
- Ahrens, Achim & Hansen, Christian B. & Schaffer, Mark E & Wiemann, Thomas, 2024. "Model Averaging and Double Machine Learning," IZA Discussion Papers 16714, Institute of Labor Economics (IZA).
- Achim Ahrens & Christian B. Hansen & Mark E. Schaffer & Thomas Wiemann, 2024. "Model Averaging and Double Machine Learning," Papers 2401.01645, arXiv.org, revised Sep 2024.
Marcos Delprato, 2025. "Identifying the post-pandemic determinants of low performing students in Latin America through interpretable Machine Learning SHAP Values-Insights from PISA 2022," Papers 2509.24508, arXiv.org.
Bonaccolto-Töpfer, Marina & Satlukal, Sascha, 2024. "Gender differences in reservation wages: New evidence for Germany," Labour Economics, Elsevier, vol. 91(C).
Philipp Bach & Oliver Schacht & Victor Chernozhukov & Sven Klaassen & Martin Spindler, 2024. "Hyperparameter Tuning for Causal Inference with Double Machine Learning: A Simulation Study," Papers 2402.04674, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Dario Sansone & Anna Zhu, 2023. "Using Machine Learning to Create an Early Warning System for Welfare Recipients," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 85(5), pages 959-992, October.
- Dario Sansone & Anna Zhu, 2020. "Using Machine Learning to Create an Early Warning System for Welfare Recipients," Papers 2011.12057, arXiv.org, revised May 2021.
- Sansone, Dario & Zhu, Anna, 2021. "Using Machine Learning to Create an Early Warning System for Welfare Recipients," IZA Discussion Papers 14377, Institute of Labor Economics (IZA).
Yu, Baojun & Li, Changming & Mirza, Nawazish & Umar, Muhammad, 2022. "Forecasting credit ratings of decarbonized firms: Comparative assessment of machine learning models," Technological Forecasting and Social Change, Elsevier, vol. 174(C).
Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Regional Science and Urban Economics, Elsevier, vol. 94(C).
- Gobillon, Laurent & Combes, Pierre-Philippe & Zylberberg, Yanos, 2020. "Urban economics in a historical perspective: Recovering data with machine learning," CEPR Discussion Papers 15308, C.E.P.R. Discussion Papers.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2022. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," PSE-Ecole d'économie de Paris (Postprint) halshs-03673240, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2021. "Urban economics in a historical perspective: Recovering data with machine learning," Working Papers halshs-03231786, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2022. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," Post-Print halshs-03673240, HAL.
- Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2021. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," IZA Discussion Papers 14392, Institute of Labor Economics (IZA).
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2021. "Urban economics in a historical perspective: Recovering data with machine learning," PSE Working Papers halshs-03231786, HAL.
- Pierre-Philippe Combes & Laurent Gobillon & Yanos Zylberberg, 2022. "Urban Economics in a Historical Perspective: Recovering Data with Machine Learning," Sciences Po Economics Publications (main) halshs-03673240, HAL.
Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP54/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Daniel Goller, 2023. "Analysing a built-in advantage in asymmetric darts contests using causal machine learning," Annals of Operations Research, Springer, vol. 325(1), pages 649-679, June.
- Goller, Daniel, 2020. "Analysing a built-in advantage in asymmetric darts contests using causal machine learning," Economics Working Paper Series 2013, University of St. Gallen, School of Economics and Political Science.
- Daniel Goller, 2020. "Analysing a built-in advantage in asymmetric darts contests using causal machine learning," Papers 2008.07165, arXiv.org.
Yiyi Huo & Yingying Fan & Fang Han, 2023. "On the adaptation of causal forests to manifold data," Papers 2311.16486, arXiv.org, revised Dec 2023.
Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
Zhang, Han, 2021. "How Using Machine Learning Classification as a Variable in Regression Leads to Attenuation Bias and What to Do About It," SocArXiv 453jk, Center for Open Science.
Giacomo De Giorgi & Costanza Naguib, 2022. "Life after Default: Credit Hardship and its Effects," Diskussionsschriften dp2206, Universitaet Bern, Departement Volkswirtschaft.
Mark Kattenberg & Bas Scheer & Jurre Thiel, 2023. "Causal forests with fixed effects for treatment effect heterogeneity in difference-in-differences," CPB Discussion Paper 452, CPB Netherlands Bureau for Economic Policy Analysis.
Michael C Knaus, 2022. "Double machine learning-based programme evaluation under unconfoundedness [Econometric methods for program evaluation]," The Econometrics Journal, Royal Economic Society, vol. 25(3), pages 602-627.
- Knaus, Michael C., 2020. "Double Machine Learning based Program Evaluation under Unconfoundedness," Economics Working Paper Series 2004, University of St. Gallen, School of Economics and Political Science.
- Knaus, Michael C., 2020. "Double Machine Learning Based Program Evaluation under Unconfoundedness," IZA Discussion Papers 13051, Institute of Labor Economics (IZA).
- Michael C. Knaus, 2020. "Double Machine Learning based Program Evaluation under Unconfoundedness," Papers 2003.03191, arXiv.org, revised Jun 2022.
Kyle Colangelo & Ying-Ying Lee, 2020. "Double Debiased Machine Learning Nonparametric Inference with Continuous Treatments," Papers 2004.03036, arXiv.org, revised Sep 2023.
Aysegül Kayaoglu & Ghassan Baliki & Tilman Brück & Melodie Al Daccache & Dorothee Weiffen, 2023. "How to conduct impact evaluations in humanitarian and conflict settings," HiCN Working Papers 387, Households in Conflict Network.
Bas Bosma & Arjen Witteloostuijn, 2024. "Machine learning in international business," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 55(6), pages 676-702, August.
Huber, Martin & Meier, Jonas & Wallimann, Hannes, 2022. "Business analytics meets artificial intelligence: Assessing the demand effects of discounts on Swiss train tickets," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 22-39.
- Martin Huber & Jonas Meier & Hannes Wallimann, 2021. "Business analytics meets artificial intelligence: Assessing the demand effects of discounts on Swiss train tickets," Papers 2105.01426, arXiv.org, revised Jun 2022.
Goller, Daniel & Lechner, Michael & Moczall, Andreas & Wolff, Joachim, 2020. "Does the estimation of the propensity score by machine learning improve matching estimation? The case of Germany's programmes for long term unemployed," Labour Economics, Elsevier, vol. 65(C).
- Goller, Daniel & Lechner, Michael & Moczall, Andreas & Wolff, Joachim, 2019. "Does the estimation of the propensity score by machine learning improve matching estimation? The case of Germany’s programmes for long term unemployed," Economics Working Paper Series 1910, University of St. Gallen, School of Economics and Political Science.
- Goller, Daniel & Lechner, Michael & Moczall, Andreas & Wolff, Joachim, 2019. "Does the Estimation of the Propensity Score by Machine Learning Improve Matching Estimation? The Case of Germany's Programmes for Long Term Unemployed," IZA Discussion Papers 12526, Institute of Labor Economics (IZA).
- Goller, Daniel & Lechner, Michael & Moczall, Andreas & Wolff, Joachim, 2020. "Does the estimation of the propensity score by machine learning improve matching estimation? : The case of Germany's programmes for long term unemployed," IAB-Discussion Paper 202005, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
Augustine Denteh & Helge Liebert, 2022. "Who Increases Emergency Department Use? New Insights from the Oregon Health Insurance Experiment," Papers 2201.07072, arXiv.org, revised Apr 2023.
- Augustine Denteh & Helge Liebert, 2022. "Who Increases Emergency Department Use? New Insights from the Oregon Health Insurance Experiment," CESifo Working Paper Series 9664, CESifo.
- Denteh, Augustine & Liebert, Helge, 2022. "Who Increases Emergency Department Use? New Insights from the Oregon Health Insurance Experiment," IZA Discussion Papers 15192, Institute of Labor Economics (IZA).
- Augustine Denteh & Helge Liebert, 2022. "Who Increases Emergency Department Use? New Insights from the Oregon Health Insurance Experiment," Working Papers 2201, Tulane University, Department of Economics.
Lamperti, Fabio, 2024. "Unlocking machine learning for social sciences: The case for identifying Industry 4.0 adoption across business restructuring events," Technological Forecasting and Social Change, Elsevier, vol. 207(C).
Falco J. Bargagli Stoffi & Kenneth De Beckker & Joana E. Maldonado & Kristof De Witte, 2021. "Assessing Sensitivity of Machine Learning Predictions.A Novel Toolbox with an Application to Financial Literacy," Papers 2102.04382, arXiv.org.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2022-09-19 (Big Data)
NEP-CMP-2022-09-19 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2208.10896. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

pystacked: Stacking generalization and machine learning in Stata

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data