Author
Listed:
- Lönnstedt Ingrid M.
(Department of Immunology, Genetics and Pathology, Uppsala University, 75185 Uppsala, Sweden)
- Nelander Sven
(Department of Immunology, Genetics and Pathology, Uppsala University, 75185 Uppsala, Sweden)
Abstract
The systematic study of transcriptional responses to genetic and chemical perturbations in human cells is still in its early stages. The largest available dataset to date is the newly released L1000 compendium. With its 1.3 million gene expression profiles of treated human cells it offers many opportunities for biomedical data mining, but also data normalization challenges of new dimensions. We developed a novel and practical approach to obtain accurate estimates of fold change response profiles from L1000, based on the RUV (Remove Unwanted Variation) statistical framework. Extending RUV to a big data setting, we propose an estimation procedure, in which an underlying RUV model is tuned by feedback through dataset specific statistical measures, reflecting p-value distributions and internal gene knockdown controls. Applying these metrics – termed evaluation endpoints – to disjoint data splits and integrating the results to select an optimal normalization, the procedure reduces bias and noise in the L1000 data, which in turn broadens the potential of this resource for pharmacological and functional genomic analyses. Our pipeline and normalization results are distributed as an R package (nelanderlab.org/FC1000.html).
Suggested Citation
Lönnstedt Ingrid M. & Nelander Sven, 2017.
"FC1000: normalized gene expression changes of systematically perturbed human cells,"
Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 16(4), pages 217-242, September.
Handle:
RePEc:bpj:sagmbi:v:16:y:2017:i:4:p:217-242:n:2
DOI: 10.1515/sagmb-2016-0072
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:16:y:2017:i:4:p:217-242:n:2. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyterbrill.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.