IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v52y2008i7p3459-3467.html
   My bibliography  Save this article

A geometric interpretation of Mallows' Cp statistic and an alternative plot in variable selection

Author

Listed:
  • Siniksaran, Enis

Abstract

Mallows' Cp plot is a useful tool for variable selection in linear regression. Though not as popular as the Cp plot, Spjotvoll's Fp and Pp plots are also used in the variable selection procedure. The Cp, Fp and Pp plots are useful in their own right. If the interest is the direct measure of the amount of bias of the submodels and a distributional assumption is not made about the error term, a Cp or Fp plot is used. If a formal testing procedure is to be performed, then a Pp plot is employed. A geometrical approach is used in order to propose an alternative plot that unifies all the information in these three plots, and that has some advantages over them. A Mathematica package has been written to implement the approach.

Suggested Citation

  • Siniksaran, Enis, 2008. "A geometric interpretation of Mallows' Cp statistic and an alternative plot in variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 52(7), pages 3459-3467, March.
  • Handle: RePEc:eee:csdana:v:52:y:2008:i:7:p:3459-3467
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(07)00432-X
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gatu, Cristian & Yanev, Petko I. & Kontoghiorghes, Erricos J., 2007. "A graph approach to generate all possible regression submodels," Computational Statistics & Data Analysis, Elsevier, vol. 52(2), pages 799-815, October.
    2. Hofmann, Marc & Gatu, Cristian & Kontoghiorghes, Erricos John, 2007. "Efficient algorithms for computing the best subset regression models for large-scale problems," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 16-29, September.
    3. Graham R. Wood & David J. Saville, 2005. "The ubiquitous angle," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 168(1), pages 95-107, January.
    4. Davidson, Russell & MacKinnon, James G., 1993. "Estimation and Inference in Econometrics," OUP Catalogue, Oxford University Press, number 9780195060119, Decembrie.
    5. Jovanovic, B. D. & Hosmer, D. W., 1997. "A simulation of the performance of Cp in model selection for logistic and Poisson regression," Computational Statistics & Data Analysis, Elsevier, vol. 23(3), pages 373-379, January.
    6. Suzanne Sommer & Richard M. Huggins, 1996. "Variables Selection Using the Wald Test and a Robust CP," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 45(1), pages 15-29, March.
    7. Stephen Walker & Christopher Page, 2001. "Generalized ridge regression and a generalization of the CP statistic," Journal of Applied Statistics, Taylor & Francis Journals, vol. 28(7), pages 911-922.
    8. Huang, Yufen & Draper, Norman R., 2003. "Transformations, regression geometry and R2," Computational Statistics & Data Analysis, Elsevier, vol. 42(4), pages 647-664, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pacheco, Joaquín & Casado, Silvia & Porras, Santiago, 2013. "Exact methods for variable selection in principal component analysis: Guide functions and pre-selection," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 95-111.
    2. Postiglione, Paolo & Benedetti, Roberto & Lafratta, Giovanni, 2010. "A regression tree algorithm for the identification of convergence clubs," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2776-2785, November.
    3. Yang, Guijun & Wang, Zhigang & Deng, Wei, 2010. "Unbiased generalized quasi-regression," Computational Statistics & Data Analysis, Elsevier, vol. 54(3), pages 779-789, March.
    4. Gatu, Cristian & Kontoghiorghes, Erricos J. & Gilli, Manfred & Winker, Peter, 2008. "An efficient branch-and-bound strategy for subset vector autoregressive model selection," Journal of Economic Dynamics and Control, Elsevier, vol. 32(6), pages 1949-1963, June.
    5. Brusco, Michael J., 2014. "A comparison of simulated annealing algorithms for variable selection in principal component analysis and discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 38-53.
    6. Fossati, Sebastian, 2012. "Covariate unit root tests with good size and power," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3070-3079.
    7. Paroli, Roberta & Spezia, Luigi, 2008. "Bayesian inference in non-homogeneous Markov mixtures of periodic autoregressions with state-dependent exogenous variables," Computational Statistics & Data Analysis, Elsevier, vol. 52(5), pages 2311-2330, January.
    8. Hillebrand, Eric & Schnabl, Gunther & Ulu, Yasemin, 2009. "Japanese foreign exchange intervention and the yen-to-dollar exchange rate: A simultaneous equations approach using realized volatility," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 19(3), pages 490-505, July.
    9. Anikó Bíró, 2013. "Subjective mortality hazard shocks and the adjustment of consumption expenditures," Journal of Population Economics, Springer;European Society for Population Economics, vol. 26(4), pages 1379-1408, October.
    10. Zanini, Fabio C. & Irwin, Scott H. & Schnitkey, Gary D. & Sherrick, Bruce J., 2000. "Estimating Farm-Level Yield Distributions For Corn And Soybeans In Illinois," 2000 Annual meeting, July 30-August 2, Tampa, FL 21720, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    11. Giuseppe Croce & Emanuela Ghignoni, 2011. "Overeducation and spatial flexibility in Italian local labour markets," Working Papers in Public Economics 145, University of Rome La Sapienza, Department of Economics and Law.
    12. Chasco, Coro & López, Ana María & Guillain, Rachel, 2008. "The non-stationary influence of geography on the spatial agglomeration of production in the EU," MPRA Paper 10737, University Library of Munich, Germany.
    13. Davidson, Russell & Flachaire, Emmanuel, 2007. "Asymptotic and bootstrap inference for inequality and poverty measures," Journal of Econometrics, Elsevier, vol. 141(1), pages 141-166, November.
    14. Darrian Collins & Clem Tisdell, 2004. "Outbound Business Travel Depends on Business Returns: Australian Evidence," Australian Economic Papers, Wiley Blackwell, vol. 43(2), pages 192-207, June.
    15. Caginalp, Gunduz & DeSantis, Mark, 2017. "Does price efficiency increase with trading volume? Evidence of nonlinearity and power laws in ETFs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 467(C), pages 436-452.
    16. Jongeneel, Roelof A. & Ge, Lan, 2005. "Explaining Growth in Dutch Agriculture: Prices, Public R&D, and Technological Change," 2005 International Congress, August 23-27, 2005, Copenhagen, Denmark 24573, European Association of Agricultural Economists.
    17. Dong, Yingying, 2010. "Jumpy or Kinky? Regression Discontinuity without the Discontinuity," MPRA Paper 25461, University Library of Munich, Germany.
    18. PAUL CASHIN & C. JOHN McDERMOTT, 1998. "Are Australia's Current Account Deficits Excessive?," The Economic Record, The Economic Society of Australia, vol. 74(227), pages 346-361, December.
    19. Barnett, William A. & Serletis, Apostolos, 2008. "Consumer preferences and demand systems," Journal of Econometrics, Elsevier, vol. 147(2), pages 210-224, December.
    20. Hany Eldemerdash & Hugh Metcalf & Sara Maioli, 2014. "Twin deficits: new evidence from a developing (oil vs. non-oil) countries’ perspective," Empirical Economics, Springer, vol. 47(3), pages 825-851, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:52:y:2008:i:7:p:3459-3467. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.