IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v66y2025i3d10.1007_s10614-024-10798-9.html
   My bibliography  Save this article

Random Forests with Economic Roots: Explaining Machine Learning in Hedonic Imputation

Author

Listed:
  • Shipei Zeng

    (Shenzhen Research Institute of Big Data)

  • Deyu Rao

    (Hong Kong University of Science and Technology)

Abstract

The increasing integration of machine learning techniques in hedonic imputation has significantly improved predictive capabilities for both price prediction and price index computation. Nevertheless, the lack of transparency, particularly regarding model explainability, remains a notable challenge. This paper undertakes a comprehensive investigation through simulation and empirical analysis, focusing specifically on linear hedonic regression and tree-based machine learning methods, with the aim of elucidating the economic rationale behind machine learning predictions. Our comparative analysis highlights the superior performance of random forests over traditional hedonic regression models. This enhanced performance is critically dependent on assumptions about consumer utility functional forms: tree-based algorithms outperform linear hedonic models most significantly when assuming limited substitutability between product attributes, a characteristic that aligns with the algorithm’s design. The core contribution of this research is the development of a novel model explainability framework that bridges the gap between economics and machine learning. By shedding light on the economic rationale behind machine learning outcomes, we address the growing demand for interpretable machine learning approaches within the economics community. This framework not only enhances the transparency of machine learning models but also provides valuable insights for their application in real-world economic scenarios.

Suggested Citation

  • Shipei Zeng & Deyu Rao, 2025. "Random Forests with Economic Roots: Explaining Machine Learning in Hedonic Imputation," Computational Economics, Springer;Society for Computational Economics, vol. 66(3), pages 2457-2481, September.
  • Handle: RePEc:kap:compec:v:66:y:2025:i:3:d:10.1007_s10614-024-10798-9
    DOI: 10.1007/s10614-024-10798-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-024-10798-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-024-10798-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Robert J. Hill & Alicia N. Rambaldi & Michael Scholz, 2021. "Higher frequency hedonic property price indices: a state-space approach," Empirical Economics, Springer, vol. 61(1), pages 417-441, July.
    2. Silver, Mick & Heravi, Saeed, 2007. "The Difference Between Hedonic Imputation Indexes and Time Dummy Hedonic Indexes," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 239-246, April.
    3. Potrawa, Tomasz & Tetereva, Anastasija, 2022. "How much is the view from the window worth? Machine learning-driven hedonic pricing model of the real estate market," Journal of Business Research, Elsevier, vol. 144(C), pages 50-65.
    4. Yasushi Kondo & Myoung-jae Lee, 2003. "Hedonic price index estimation under mean-independence of time dummies from quality characteristics," Econometrics Journal, Royal Economic Society, vol. 6(1), pages 28-45, June.
    5. Kevin Fox, 2000. "Information-rich expressions for model selection criteria," Applied Economics Letters, Taylor & Francis Journals, vol. 7(1), pages 59-62.
    6. Felix Lorenz & Jonas Willwersch & Marcelo Cajias & Franz Fuerst, 2023. "Interpretable machine learning for real estate market analysis," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 51(5), pages 1178-1208, September.
    7. Robert J. Hill & Daniel Melser, 2008. "Hedonic Imputation And The Price Index Problem: An Application To Housing," Economic Inquiry, Western Economic Association International, vol. 46(4), pages 593-609, October.
    8. Giudici, Paolo & Gramegna, Alex & Raffinetti, Emanuela, 2023. "Machine Learning Classification Model Comparison," Socio-Economic Planning Sciences, Elsevier, vol. 87(PB).
    9. Robert J. Hill & Michael Scholz, 2018. "Can Geospatial Data Improve House Price Indexes? A Hedonic Imputation Approach with Splines," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 64(4), pages 737-756, December.
    10. Alicia N. Rambaldi & Cameron S. Fletcher, 2014. "Hedonic Imputed Property Price Indexes: The Effects of Econometric Modeling Choices," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 60(S2), pages 423-448, November.
    11. Miller, Steve & Startz, Richard, 2019. "Feasible generalized least squares using support vector regression," Economics Letters, Elsevier, vol. 175(C), pages 28-31.
    12. Patrick Bajari & Denis Nekipelov & Stephen P. Ryan & Miaoyu Yang, 2015. "Machine Learning Methods for Demand Estimation," American Economic Review, American Economic Association, vol. 105(5), pages 481-485, May.
    13. repec:bla:revinw:v:60:y:2014:i::p:s423-s448 is not listed on IDEAS
    14. Amemiya, Takeshi, 1980. "Selection of Regressors," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 21(2), pages 331-354, June.
    15. Hu, Lirong & He, Shenjing & Han, Zixuan & Xiao, He & Su, Shiliang & Weng, Min & Cai, Zhongliang, 2019. "Monitoring housing rental prices based on social media:An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies," Land Use Policy, Elsevier, vol. 82(C), pages 657-673.
    16. W. Erwin Diewert & Kevin J. Fox, 2022. "Substitution Bias in Multilateral Methods for CPI Construction," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 355-369, January.
    17. Hill Robert J. & Scholz Michael & Shimizu Chihiro & Steurer Miriam, 2022. "Rolling-Time-Dummy House Price Indexes: Window Length, Linking and Options for Dealing with Low Transaction Volume," Journal of Official Statistics, Sciendo, vol. 38(1), pages 127-151, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Robert J. Hill & Alicia N. Rambaldi & Michael Scholz, 2021. "Higher frequency hedonic property price indices: a state-space approach," Empirical Economics, Springer, vol. 61(1), pages 417-441, July.
    2. Robert J. Hill & Alicia N. Rambaldi, 2022. "Hedonic Models and House Price Index Numbers," Springer Books, in: Duangkamon Chotikapanich & Alicia N. Rambaldi & Nicholas Rohde (ed.), Advances in Economic Measurement, chapter 0, pages 413-444, Springer.
    3. Mick Silver, 2016. "How to Better Measure Hedonic Residential Property Price Indexes," IMF Working Papers 2016/213, International Monetary Fund.
    4. Iqbal A. Syed & Jan De Haan, 2017. "Age, Time, Vintage, And Price Indexes: Measuring The Depreciation Pattern Of Houses," Economic Inquiry, Western Economic Association International, vol. 55(1), pages 580-600, January.
    5. Silver Mick, 2022. "Econometric Issues in Hedonic Property Price Indices: Some Practical Help," Journal of Official Statistics, Sciendo, vol. 38(1), pages 153-186, March.
    6. Daniel Melser & Robert J. Hill, 2019. "Residential Real Estate, Risk, Return and Diversification: Some Empirical Evidence," The Journal of Real Estate Finance and Economics, Springer, vol. 59(1), pages 111-146, July.
    7. repec:grz:wpaper:2014-05 is not listed on IDEAS
    8. Hill, Robert J. & Trojanek, Radoslaw, 2022. "An evaluation of competing methods for constructing house price indexes: The case of Warsaw," Land Use Policy, Elsevier, vol. 120(C).
    9. Robert J. Hill & Norbert Pfeifer & Miriam Steurer & Radoslaw Trojanek, 2024. "Warning: Some transaction prices can be detrimental to your house price index," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 70(2), pages 320-344, June.
    10. Ryan Greenaway-McGrevy & Kade Sorensen, 2021. "A spatial model averaging approach to measuring house prices," Journal of Spatial Econometrics, Springer, vol. 2(1), pages 1-32, December.
    11. Robert J. Hill & Miriam Steurer & Sofie R. Waltl, 2025. "Owner‐Occupied Housing, Inflation, and Monetary Policy," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 57(2-3), pages 583-614, March.
    12. Robert J. Hill & Norbert Pfeifer & Miriam Steurer, 2020. "The Airbnb Rent-Premium and the Crowding-Out of Long-Term Rentals," Graz Economics Papers 2020-06, University of Graz, Department of Economics.
    13. Esmeralda A. Ramalho & Joaquim J.S. Ramalho, 2014. "Convenient links for the estimation of hedonic price indexes: the case of unique, infrequently traded assets," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(2), pages 91-117, May.
    14. Melser, Daniel, 2017. "Disaggregated property price appreciation: The mixed repeat sales model," Regional Science and Urban Economics, Elsevier, vol. 66(C), pages 108-118.
    15. Robert J. Hill & Daniel Melser, 2007. "Comparing House Prices Across Regions and Time: An Hedonic Approach," Discussion Papers 2007-33, School of Economics, The University of New South Wales.
    16. Alicia N. Rambaldi & D.S. Prasada Rao, 2013. "Econometric Modeling and Estimation of Theoretically Consistent Housing Price Indexes," CEPA Working Papers Series WP042013, School of Economics, University of Queensland, Australia.
    17. Alicia N. Rambaldi & Cameron S. Fletcher, 2014. "Hedonic Imputed Property Price Indexes: The Effects of Econometric Modeling Choices," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 60(S2), pages 423-448, November.
    18. Apostolos Ampountolas & Titus Nyarko Nde & Paresh Date & Corina Constantinescu, 2021. "A Machine Learning Approach for Micro-Credit Scoring," Risks, MDPI, vol. 9(3), pages 1-20, March.
    19. Juergen Deppner & Marcelo Cajias, 2024. "Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 68(2), pages 235-273, February.
    20. Trojanek, Radoslaw & Gluszak, Michal, 2022. "Short-run impact of the Ukrainian refugee crisis on the housing market in Poland," Finance Research Letters, Elsevier, vol. 50(C).
    21. David Genesove & Christopher Mayer, 2001. "Loss Aversion and Seller Behavior: Evidence from the Housing Market," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 116(4), pages 1233-1260.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:66:y:2025:i:3:d:10.1007_s10614-024-10798-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.