Author
Listed:
- Heewon Park
- Satoru Miyano
Abstract
Gene regulatory network inference is a key approach for elucidating molecular mechanisms underlying complex diseases, but accurately inferring them from high-dimensional data, especially when sample sizes are imbalanced, remains a significant challenge. Although the L1-type regularization methods have been used for gene network inference, the existing methods often fail under conditions involving high dimensionality, noise, and unequal sample sizes across phenotypes. To overcome these limitations, this study developed netRL, a novel computational framework that integrates the Random Lasso with prior network biological knowledge. The proposed method leveraged a bootstrap-based strategy to stabilize the selection of key regulatory genes and incorporates network-informed penalization using centrality measures (i.e., hubness and betweenness centrality). This study also introduced a statistical strategy using a hypergeometric test to assess the significance of the inferred edges, thereby enhancing the reliability of the network. Through extensive simulation studies, this study demonstrated that netRL outperforms conventional methods in both network estimation and gene selection. Applying netRL to whole-blood RNA-seq profiles from the Japan COVID-19 Task Force, this study successfully identified distinct phenotype-specific molecular interplays between asymptomatic and critical cases despite pronounced sample imbalance. The findings reveal that asymptomatic networks were dense and enriched for ribosomal proteins, whereas critical networks were sparse, centralized, and characterized by hub genes such as NFKBIA, B2M, CXCL8, and FOS. Pathway enrichment further revealed phenotype-specific biological processes, highlighting molecular signatures of disease progression. The results of this study suggest that enhancing the activity of asymptomatic condition-specific markers (e.g., ribosomal proteins) may provide important insights into the molecular mechanisms underlying COVID-19 severity. Collectively, these results demonstrate that netRL enables biologically interpretable and statistically robust network inference, offering new insights into the molecular basis of COVID-19 severity and broader applications in systems biology.
Suggested Citation
Heewon Park & Satoru Miyano, 2026.
"Network-constrained Random Lasso for biologically interpretable gene network inference across unequal sample sizes,"
PLOS ONE, Public Library of Science, vol. 21(3), pages 1-23, March.
Handle:
RePEc:plo:pone00:0344198
DOI: 10.1371/journal.pone.0344198
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0344198. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.