IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v55y2011i1p366-374.html

Generalized additive models and inflated type I error rates of smoother significance tests

Author

Listed:
  • Young, Robin L.
  • Weinberg, Janice
  • Vieira, Verónica
  • Ozonoff, Al
  • Webster, Thomas F.

Abstract

Generalized additive models (GAMs) have distinct advantages over generalized linear models as they allow investigators to make inferences about associations between outcomes and predictors without placing parametric restrictions on the associations. The variable of interest is often smoothed using a locally weighted scatterplot smoothing (LOESS) and the optimal span (degree of smoothing) can be determined by minimizing the Akaike Information Criterion (AIC). A natural hypothesis when using GAMs is to test whether the smoothing term is necessary or if a simpler model would suffice. The statistic of interest is the difference in deviances between models including and excluding the smoothed term. As approximate chi-square tests of this hypothesis are known to be biased, permutation tests are a reasonable alternative. We compare the type I error rates of the chi-square test and of three permutation test methods using synthetic data generated under the null hypothesis. In each permutation method a distribution of differences in deviances is obtained from 999 permuted datasets and the null hypothesis is rejected if the observed statistic falls in the upper 5% of the distribution. One test is a conditional permutation test using the optimal span size for the observed data; this span size is held constant for all permutations. This test is shown to have an inflated type I error rate. Alternatively, the span size can be fixed a priori such that the span selection technique is not reliant on the observed data. This test is shown to be unbiased; however, the choice of span size is not clear. A third method is an unconditional permutation test where the optimal span size is selected for observed and permuted datasets. This test is unbiased though computationally intensive.

Suggested Citation

  • Young, Robin L. & Weinberg, Janice & Vieira, Verónica & Ozonoff, Al & Webster, Thomas F., 2011. "Generalized additive models and inflated type I error rates of smoother significance tests," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 366-374, January.
  • Handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:366-374
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00191-X
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Clifford M. Hurvich & Jeffrey S. Simonoff & Chih‐Ling Tsai, 1998. "Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 271-293.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pearl Anne Ante-Testard & Francois Rerolle & Anna T. Nguyen & Sania Ashraf & Sarker Masud Parvez & Abu Mohammed Naser & Tarik Benmarhnia & Mahbubur Rahman & Stephen P. Luby & Jade Benjamin-Chung & Ben, 2024. "WASH interventions and child diarrhea at the interface of climate and socioeconomic position in Bangladesh," Nature Communications, Nature, vol. 15(1), pages 1-13, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    2. Juan Manuel Julio & Norberto Rodr�guez & H�ctor Manuel Z�rate, 2005. "Estimating the COP Exchange Rate Volatility Smile and the Market Effect of Central Bank Interventions: A CHARN Approach," Borradores de Economia 2605, Banco de la Republica.
    3. Malloy, Elizabeth J. & Spiegelman, Donna & Eisen, Ellen A., 2009. "Comparing measures of model selection for penalized splines in Cox models," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2605-2616, May.
    4. Karimu, Amin & Brännlund, Runar, 2013. "Functional form and aggregate energy demand elasticities: A nonparametric panel approach for 17 OECD countries," Energy Economics, Elsevier, vol. 36(C), pages 19-27.
    5. Liao, Jun & Zou, Guohua, 2020. "Corrected Mallows criterion for model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    6. Lu, Jun & Lin, Lu, 2018. "Feature screening for multi-response varying coefficient models with ultrahigh dimensional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 242-254.
    7. Chu, Chi-Yang & Henderson, Daniel J. & Parmeter, Christopher F., 2017. "On discrete Epanechnikov kernel functions," Computational Statistics & Data Analysis, Elsevier, vol. 116(C), pages 79-105.
    8. Salvatore Ingrassia & Simona Minotti & Giorgio Vittadini, 2012. "Local Statistical Modeling via a Cluster-Weighted Approach with Elliptical Distributions," Journal of Classification, Springer;The Classification Society, vol. 29(3), pages 363-401, October.
    9. Maria Sassi, 2010. "OLS and GWR Approaches to Agricultural Convergence in the EU-15," International Advances in Economic Research, Springer;International Atlantic Economic Society, vol. 16(1), pages 96-108, February.
    10. Nagler Thomas & Schellhase Christian & Czado Claudia, 2017. "Nonparametric estimation of simplified vine copula models: comparison of methods," Dependence Modeling, De Gruyter, vol. 5(1), pages 99-120, January.
    11. Arturo Bujanda & Thomas M. Fullerton, 2017. "Impacts of transportation infrastructure on single-family property values," Applied Economics, Taylor & Francis Journals, vol. 49(51), pages 5183-5199, November.
    12. Costas Milas & Ruthira Naraidoo, 2009. "Financial Market Conditions, Real Time, Nonlinearity and European Central Bank Monetary Policy: In-Sample and Out-of-Sample Assessment," Working Papers 200923, University of Pretoria, Department of Economics.
    13. Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2017. "The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 91-102.
    14. Víctor M. Guerrero & Daniela Cortés Toto & Hortensia J. Reyes Cervantes, 2018. "Effect of autocorrelation when estimating the trend of a time series via penalized least squares with controlled smoothness," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(1), pages 109-130, March.
    15. Yanagihara, Hirokazu & Satoh, Kenichi, 2010. "An unbiased Cp criterion for multivariate ridge regression," Journal of Multivariate Analysis, Elsevier, vol. 101(5), pages 1226-1238, May.
    16. Simon Lineu Umbach, 2020. "Forecasting with supervised factor models," Empirical Economics, Springer, vol. 58(1), pages 169-190, January.
    17. Farahani Mohd Saimi & Firdaus Mohamad Hamzah & Mohd Ekhwan Toriman & Othman Jaafar & Hazrina Tajudin, 2020. "Trend and Linearity Analysis of Meteorological Parameters in Peninsular Malaysia," Sustainability, MDPI, vol. 12(22), pages 1-19, November.
    18. Khalid Al-Ahmadi & Ali Al-Zahrani, 2013. "NO 2 and Cancer Incidence in Saudi Arabia," IJERPH, MDPI, vol. 10(11), pages 1-19, November.
    19. Baglan, Deniz & Ege Yazgan, M. & Yilmazkuday, Hakan, 2016. "Relative price variability and inflation: New evidence," Journal of Macroeconomics, Elsevier, vol. 48(C), pages 263-282.
    20. Ma, Xinwei & Ji, Yanjie & Yuan, Yufei & Van Oort, Niels & Jin, Yuchuan & Hoogendoorn, Serge, 2020. "A comparison in travel patterns and determinants of user demand between docked and dockless bike-sharing systems using multi-sourced data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 139(C), pages 148-173.

    More about this item

    Keywords

    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:366-374. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.