IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v130y2014icp409-424.html
   My bibliography  Save this article

Variable selection and estimation for longitudinal survey data

Author

Listed:
  • Wang, Li
  • Wang, Suojin
  • Wang, Guannan

Abstract

There is wide interest in studying longitudinal surveys where sample subjects are observed successively over time. Longitudinal surveys have been used in many areas today, for example, in the health and social sciences, to explore relationships or to identify significant variables in regression settings. This paper develops a general strategy for the model selection problem in longitudinal sample surveys. A survey weighted penalized estimating equation approach is proposed to select significant variables and estimate the coefficients simultaneously. The proposed estimators are design consistent and perform as well as the oracle procedure when the correct submodel was known. The estimating function bootstrap is applied to obtain the standard errors of the estimated parameters with good accuracy. A fast and efficient variable selection algorithm is developed to identify significant variables for complex longitudinal survey data. Simulated examples are illustrated to show the usefulness of the proposed methodology under various model settings and sampling designs.

Suggested Citation

  • Wang, Li & Wang, Suojin & Wang, Guannan, 2014. "Variable selection and estimation for longitudinal survey data," Journal of Multivariate Analysis, Elsevier, vol. 130(C), pages 409-424.
  • Handle: RePEc:eee:jmvana:v:130:y:2014:i:c:p:409-424
    DOI: 10.1016/j.jmva.2014.05.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X14001158
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2014.05.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Wang, Li & Wang, Suojin, 2011. "Nonparametric additive model-assisted estimation for survey data," Journal of Multivariate Analysis, Elsevier, vol. 102(7), pages 1126-1140, August.
    3. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    4. Johnson, Brent A. & Lin, D.Y. & Zeng, Donglin, 2008. "Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 672-680, June.
    5. Wenjiang J. Fu, 2003. "Penalized Estimating Equations," Biometrics, The International Biometric Society, vol. 59(1), pages 126-132, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Laura Dumitrescu & Wei Qian & J. N. K. Rao, 2021. "Inference for longitudinal data from complex sampling surveys: An approach based on quadratic inference functions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(1), pages 246-274, March.
    2. Nathaniel W. Anderson & Anna J. Markowitz & Daniel Eisenberg & Neal Halfon & Kristin Anderson Moore & Frederick J. Zimmerman, 2022. "The Child and Adolescent Thriving Index 1.0: Developing a Measure of the Outcome Indicators of Well-Being for Population Health Assessment," Child Indicators Research, Springer;The International Society of Child Indicators (ISCI), vol. 15(6), pages 2015-2042, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xingwei Tong & Xin He & Liuquan Sun & Jianguo Sun, 2009. "Variable Selection for Panel Count Data via Non‐Concave Penalized Estimating Function," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(4), pages 620-635, December.
    2. Blommaert, A. & Hens, N. & Beutels, Ph., 2014. "Data mining for longitudinal data under multicollinearity and time dependence using penalized generalized estimating equations," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 667-680.
    3. Lu Tang & Peter X.‐K. Song, 2021. "Poststratification fusion learning in longitudinal data analysis," Biometrics, The International Biometric Society, vol. 77(3), pages 914-928, September.
    4. Fan, Yali & Qin, Guoyou & Zhu, Zhongyi, 2012. "Variable selection in robust regression models for longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 156-167.
    5. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    6. Yongjin Li & Qingzhao Zhang & Qihua Wang, 2017. "Penalized estimation equation for an extended single-index model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 169-187, February.
    7. Li, Gaorong & Lian, Heng & Feng, Sanying & Zhu, Lixing, 2013. "Automatic variable selection for longitudinal generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 174-186.
    8. Zhangong Zhou & Rong Jiang & Weimin Qian, 2013. "LAD variable selection for linear models with randomly censored data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(2), pages 287-300, February.
    9. Tamar Sofer & Elizabeth D. Schifano & David C. Christiani & Xihong Lin, 2017. "Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case‐control genetic association studies," Biometrics, The International Biometric Society, vol. 73(4), pages 1210-1220, December.
    10. Bingduo Yang & Christian M. Hafner & Guannan Liu & Wei Long, 2021. "Semiparametric estimation and variable selection for single‐index copula models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(7), pages 962-988, November.
    11. Zangdong He & Wanzhu Tu & Sijian Wang & Haoda Fu & Zhangsheng Yu, 2015. "Simultaneous variable selection for joint models of longitudinal and survival outcomes," Biometrics, The International Biometric Society, vol. 71(1), pages 178-187, March.
    12. Wenning Feng & Abdhi Sarkar & Chae Young Lim & Tapabrata Maiti, 2016. "Variable selection for binary spatial regression: Penalized quasi‐likelihood approach," Biometrics, The International Biometric Society, vol. 72(4), pages 1164-1172, December.
    13. Feng, Sanying & He, Wenqi & Li, Feng, 2020. "Model detection and estimation for varying coefficient panel data models with fixed effects," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    14. Cheng, Chao & Feng, Xingdong & Huang, Jian & Jiao, Yuling & Zhang, Shuang, 2022. "ℓ0-Regularized high-dimensional accelerated failure time model," Computational Statistics & Data Analysis, Elsevier, vol. 170(C).
    15. Fengting Yi & Niansheng Tang & Jianguo Sun, 2022. "Simultaneous variable selection and estimation for joint models of longitudinal and failure time data with interval censoring," Biometrics, The International Biometric Society, vol. 78(1), pages 151-164, March.
    16. Zhang, Hao Helen & Lu, Wenbin & Wang, Hansheng, 2010. "On sparse estimation for semiparametric linear transformation models," Journal of Multivariate Analysis, Elsevier, vol. 101(7), pages 1594-1606, August.
    17. Zhihua Sun & Yi Liu & Kani Chen & Gang Li, 2022. "Broken adaptive ridge regression for right-censored survival data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 69-91, February.
    18. Feng, Sanying & Xue, Liugen, 2015. "Model detection and estimation for single-index varying coefficient model," Journal of Multivariate Analysis, Elsevier, vol. 139(C), pages 227-244.
    19. Fang, Jianglin, 2023. "A split-and-conquer variable selection approach for high-dimensional general semiparametric models with massive data," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    20. Zhao, Weihua & Lian, Heng & Zhang, Riquan & Lai, Peng, 2016. "Estimation and variable selection for proportional response data with partially linear single-index models," Computational Statistics & Data Analysis, Elsevier, vol. 96(C), pages 40-56.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:130:y:2014:i:c:p:409-424. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.