IDEAS home Printed from https://ideas.repec.org/a/oup/biomet/v104y2017i1p83-96..html
   My bibliography  Save this article

Regression modelling on stratified data with the lasso

Author

Listed:
  • E. Ollier
  • V. Viallon

Abstract

SUMMARY We consider the estimation of regression models on strata defined using a categorical covariate, in order to identify interactions between this categorical covariate and the other predictors. A basic approach requires the choice of a reference stratum. We show that the performance of a penalized version of this approach depends on this arbitrary choice, and propose an approach that bypasses this at almost no additional computational cost. Regarding model selection consistency, our proposal mimics the strategy based on an optimal and covariate-specific choice for the reference stratum. An empirical study confirms that our proposal generally outperforms the basic approach in the identification and description of the interactions. An illustration on gene expression data is provided.

Suggested Citation

  • E. Ollier & V. Viallon, 2017. "Regression modelling on stratified data with the lasso," Biometrika, Biometrika Trust, vol. 104(1), pages 83-96.
  • Handle: RePEc:oup:biomet:v:104:y:2017:i:1:p:83-96.
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1093/biomet/asw065
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Qian, Junyang & Jia, Jinzhu, 2016. "On stepwise pattern recovery of the fused Lasso," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 221-237.
    2. Ollier, Edouard & Samson, Adeline & Delavenne, Xavier & Viallon, Vivian, 2016. "A SAEM algorithm for fused lasso penalized NonLinear Mixed Effect Models: Application to group comparison in pharmacokinetics," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 207-221.
    3. Reid, Stephen & Tibshirani, Rob, 2014. "Regularization Paths for Conditional Logistic Regression: The clogitL1 Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 58(i12).
    4. Gross, Samuel M. & Tibshirani, Robert, 2016. "Data Shared Lasso: A novel tool to discover uplift," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 226-235.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zachary F. Fisher & Younghoon Kim & Barbara L. Fredrickson & Vladas Pipiras, 2022. "Penalized Estimation and Forecasting of Multiple Subject Intensive Longitudinal Data," Psychometrika, Springer;The Psychometric Society, vol. 87(2), pages 1-29, June.
    2. Rafael Blanquero & Emilio Carrizosa & Pepa Ramírez-Cobo & M. Remedios Sillero-Denamiel, 2021. "A cost-sensitive constrained Lasso," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(1), pages 121-158, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shi, Haolun & Yin, Guosheng, 2018. "Boosting conditional logit model," Journal of choice modelling, Elsevier, vol. 26(C), pages 48-63.
    2. Zachary F. Fisher & Younghoon Kim & Barbara L. Fredrickson & Vladas Pipiras, 2022. "Penalized Estimation and Forecasting of Multiple Subject Intensive Longitudinal Data," Psychometrika, Springer;The Psychometric Society, vol. 87(2), pages 1-29, June.
    3. Karsten Schweikert, 2022. "Oracle Efficient Estimation of Structural Breaks in Cointegrating Regressions," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(1), pages 83-104, January.
    4. Lu Tang & Peter X.‐K. Song, 2021. "Poststratification fusion learning in longitudinal data analysis," Biometrics, The International Biometric Society, vol. 77(3), pages 914-928, September.
    5. Karsten Schweikert, 2020. "Oracle Efficient Estimation of Structural Breaks in Cointegrating Regressions," Papers 2001.07949, arXiv.org, revised Apr 2021.
    6. Ollier, Edouard, 2022. "Fast selection of nonlinear mixed effect models using penalized likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).
    7. Won Son & Johan Lim & Donghyeon Yu, 2023. "Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea," International Statistical Review, International Statistical Institute, vol. 91(2), pages 218-242, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:biomet:v:104:y:2017:i:1:p:83-96.. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://academic.oup.com/biomet .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.