IDEAS home Printed from https://ideas.repec.org/a/spr/jglopt/v60y2014i1p79-102.html
   My bibliography  Save this article

Restructuring forward step of MARS algorithm using a new knot selection procedure based on a mapping approach

Author

Listed:
  • Elcin Koc
  • Cem Iyigun

Abstract

In high dimensional data modeling, Multivariate Adaptive Regression Splines (MARS) is a popular nonparametric regression technique used to define the nonlinear relationship between a response variable and the predictors with the help of splines. MARS uses piecewise linear functions for local fit and apply an adaptive procedure to select the number and location of breaking points (called knots). The function estimation is basically generated via a two-stepwise procedure: forward selection and backward elimination. In the first step, a large number of local fits is obtained by selecting large number of knots via a lack-of-fit criteria; and in the latter one, the least contributing local fits or knots are removed. In conventional adaptive spline procedure, knots are selected from a set of all distinct data points that makes the forward selection procedure computationally expensive and leads to high local variance. To avoid this drawback, it is possible to restrict the knot points to a subset of data points. In this context, a new method is proposed for knot selection which bases on a mapping approach like self organizing maps. By this method, less but more representative data points are become eligible to be used as knots for function estimation in forward step of MARS. The proposed method is applied to many simulated and real datasets, and the results show that it proposes a time efficient forward step for the knot selection and model estimation without degrading the model accuracy and prediction performance. Copyright Springer Science+Business Media New York 2014

Suggested Citation

  • Elcin Koc & Cem Iyigun, 2014. "Restructuring forward step of MARS algorithm using a new knot selection procedure based on a mapping approach," Journal of Global Optimization, Springer, vol. 60(1), pages 79-102, September.
  • Handle: RePEc:spr:jglopt:v:60:y:2014:i:1:p:79-102
    DOI: 10.1007/s10898-013-0107-5
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10898-013-0107-5
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10898-013-0107-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Julia Tsai & Victoria Chen & M. Beck & Jining Chen, 2004. "Stochastic Dynamic Programming Formulation for a Wastewater Treatment Decision-Making Framework," Annals of Operations Research, Springer, vol. 132(1), pages 207-221, November.
    2. Lee, Tian-Shyug & Chiu, Chih-Chou & Chou, Yu-Chao & Lu, Chi-Jie, 2006. "Mining the customer credit using classification and regression tree and multivariate adaptive regression splines," Computational Statistics & Data Analysis, Elsevier, vol. 50(4), pages 1113-1130, February.
    3. Victoria C. P. Chen & David Ruppert & Christine A. Shoemaker, 1999. "Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming," Operations Research, INFORMS, vol. 47(1), pages 38-53, February.
    4. D. G. T. Denison & B. K. Mallick & A. F. M. Smith, 1998. "Automatic Bayesian curve fitting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 333-350.
    5. Pilla, Venkata L. & Rosenberger, Jay M. & Chen, Victoria & Engsuwan, Narakorn & Siddappa, Sheela, 2012. "A multivariate adaptive regression splines cutting plane approach for solving a two-stage stochastic programming fleet assignment model," European Journal of Operational Research, Elsevier, vol. 216(1), pages 162-171.
    6. Victoria C. P. Chen & Dirk Günther & Ellis L. Johnson, 2003. "Solving for an optimal airline yield management policy via statistical learning," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 52(1), pages 19-30, January.
    7. Wataru Sakamoto, 2007. "MARS: selecting basis functions and knots with an empirical Bayes method," Computational Statistics, Springer, vol. 22(4), pages 583-597, December.
    8. Wong, Chi-ming & Kohn, Robert, 1996. "A Bayesian approach to additive semiparametric regression," Journal of Econometrics, Elsevier, vol. 74(2), pages 209-235, October.
    9. Aldrin, Magne, 2006. "Improved predictions penalizing both slope and curvature in additive models," Computational Statistics & Data Analysis, Elsevier, vol. 50(2), pages 267-284, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. España, Victor J. & Aparicio, Juan & Barber, Xavier & Esteve, Miriam, 2024. "Estimating production functions through additive models based on regression splines," European Journal of Operational Research, Elsevier, vol. 312(2), pages 684-699.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dachuan Shih & Seoung Kim & Victoria Chen & Jay Rosenberger & Venkata Pilla, 2014. "Efficient computer experiment-based optimization through variable selection," Annals of Operations Research, Springer, vol. 216(1), pages 287-305, May.
    2. Zehua Yang & Victoria C. P. Chen & Michael E. Chang & Melanie L. Sattler & Aihong Wen, 2009. "A Decision-Making Framework for Ozone Pollution Control," Operations Research, INFORMS, vol. 57(2), pages 484-498, April.
    3. Bozağaç, Doruk & Batmaz, İnci & Oğuztüzün, Halit, 2016. "Dynamic simulation metamodeling using MARS: A case of radar simulation," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 124(C), pages 69-86.
    4. Pilla, Venkata L. & Rosenberger, Jay M. & Chen, Victoria & Engsuwan, Narakorn & Siddappa, Sheela, 2012. "A multivariate adaptive regression splines cutting plane approach for solving a two-stage stochastic programming fleet assignment model," European Journal of Operational Research, Elsevier, vol. 216(1), pages 162-171.
    5. Elcin Koc & Cem Iyigun & İnci Batmaz & Gerhard-Wilhelm Weber, 2014. "Efficient adaptive regression spline algorithms based on mapping approach with a case study on finance," Journal of Global Optimization, Springer, vol. 60(1), pages 103-120, September.
    6. Ariyajunya, Bancha & Chen, Ying & Chen, Victoria C.P. & Kim, Seoung Bum & Rosenberger, Jay, 2021. "Addressing state space multicollinearity in solving an ozone pollution dynamic control problem," European Journal of Operational Research, Elsevier, vol. 289(2), pages 683-695.
    7. Ayşe Özmen, 2023. "Sparse regression modeling for short- and long‐term natural gas demand prediction," Annals of Operations Research, Springer, vol. 322(2), pages 921-946, March.
    8. Huiyuan Fan & Prashant K. Tarun & Victoria C. P. Chen & Dachuan T. Shih & Jay M. Rosenberger & Seoung Bum Kim & Robert A. Horton, 2018. "Data-driven optimization for Dallas Fort Worth International Airport deicing activities," Annals of Operations Research, Springer, vol. 263(1), pages 361-384, April.
    9. Panagiotelis, Anastasios & Smith, Michael, 2008. "Bayesian identification, selection and estimation of semiparametric functions in high-dimensional additive models," Journal of Econometrics, Elsevier, vol. 143(2), pages 291-316, April.
    10. Zéphyr, Luckny & Lang, Pascal & Lamond, Bernard F. & Côté, Pascal, 2017. "Approximate stochastic dynamic programming for hydroelectric production planning," European Journal of Operational Research, Elsevier, vol. 262(2), pages 586-601.
    11. Jayne Lois San Juan & Carlo James Caligan & Maria Mikayla Garcia & Jericho Mitra & Andres Philip Mayol & Charlle Sy & Aristotle Ubando & Alvin Culaba, 2020. "Multi-Objective Optimization of an Integrated Algal and Sludge-Based Bioenergy Park and Wastewater Treatment System," Sustainability, MDPI, vol. 12(18), pages 1-22, September.
    12. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2013. "Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 380-416, February.
    13. Somayeh Moazeni & Warren B. Powell & Boris Defourny & Belgacem Bouzaiene-Ayari, 2017. "Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization," INFORMS Journal on Computing, INFORMS, vol. 29(2), pages 332-349, May.
    14. Chen, Ruoran & Deng, Tianhu & Huang, Simin & Qin, Ruwen, 2015. "Optimal crude oil procurement under fluctuating price in an oil refinery," European Journal of Operational Research, Elsevier, vol. 245(2), pages 438-445.
    15. Kuhlenkasper, Torben & Kauermann, Göran, 2010. "Female wage profiles: An additive mixed model approach to employment breaks due to childcare," HWWI Research Papers 2-18, Hamburg Institute of International Economics (HWWI).
    16. Ying Chen & Krystel K. Castillo-Villar & Bing Dong, 2021. "Stochastic control of a micro-grid using battery energy storage in solar-powered buildings," Annals of Operations Research, Springer, vol. 303(1), pages 197-216, August.
    17. Pena, Daniel & Redondas, Dolores, 2006. "Bayesian curve estimation by model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 50(3), pages 688-709, February.
    18. Villani, Mattias & Kohn, Robert & Giordani, Paolo, 2009. "Regression density estimation using smooth adaptive Gaussian mixtures," Journal of Econometrics, Elsevier, vol. 153(2), pages 155-173, December.
    19. Diego Klabjan & Daniel Adelman, 2007. "An Infinite-Dimensional Linear Programming Algorithm for Deterministic Semi-Markov Decision Processes on Borel Spaces," Mathematics of Operations Research, INFORMS, vol. 32(3), pages 528-550, August.
    20. Adnan Dželihodžić & Dženana Đonko & Jasmin Kevrić, 2018. "Improved Credit Scoring Model Based on Bagging Neural Network," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(06), pages 1725-1741, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jglopt:v:60:y:2014:i:1:p:79-102. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.