IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v106y2017icp12-26.html

Parametric methods for confidence interval estimation of overlap coefficients

Author

Listed:
  • Wang, Dan
  • Tian, Lili

Abstract

Overlap coefficient (OVL), the proportion of overlap area between two probability distributions, is a direct measure of similarity between two distributions. It is useful in microarray analysis for the purpose of identifying differentially expressed biomarkers, especially when data follow multimodal distribution which cannot be transformed to normal. However, the inference methods about OVL are quite sparse. This article proposes two methods, a generalized inference (GI) approach and a parametric bootstrapping (PB) method, to construct confidence intervals of OVL under the assumption of normality. In conjunction with the EM algorithms, these methods are extended to mixture Gaussian (MG) distributions. The performances of these methods are evaluated empirically under a variety of distributions including normal, gamma and mixture Gaussian. At last, the proposed approaches are applied to a published microarray dataset from a gene expression study of three most prevalent adult lymphoid malignancies.

Suggested Citation

  • Wang, Dan & Tian, Lili, 2017. "Parametric methods for confidence interval estimation of overlap coefficients," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 12-26.
  • Handle: RePEc:eee:csdana:v:106:y:2017:i:c:p:12-26
    DOI: 10.1016/j.csda.2016.08.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016794731630202X
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2016.08.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Anderson, Gordon & Linton, Oliver & Whang, Yoon-Jae, 2012. "Nonparametric estimation and inference about the overlap of two distributions," Journal of Econometrics, Elsevier, vol. 171(1), pages 1-23.
    2. K. Krishnamoorthy & Yong Lu, 2003. "Inferences on the Common Mean of Several Normal Populations Based on the Generalized Variable Method," Biometrics, The International Biometric Society, vol. 59(2), pages 237-247, June.
    3. Kelly Zou & W. J. Hall, 2000. "Two transformation models for estimating an ROC curve derived from continuous data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(5), pages 621-631.
    4. Clemons, Traci E. & Jr., Edwin L. Bradley, 2000. "A nonparametric measure of the overlapping coefficient," Computational Statistics & Data Analysis, Elsevier, vol. 34(1), pages 51-61, July.
    5. Schmid, Friedrich & Schmidt, Axel, 2006. "Nonparametric estimation of the coefficient of overlapping--theory and empirical application," Computational Statistics & Data Analysis, Elsevier, vol. 50(6), pages 1583-1596, March.
    6. Mulekar, Madhuri S. & Mishra, Satya N., 2000. "Confidence interval estimation of overlap: equal means case," Computational Statistics & Data Analysis, Elsevier, vol. 34(2), pages 121-137, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Eidous, Omar M. & Ananbeh, Enas A., 2024. "Kernel method for estimating overlapping coefficient using numerical integration methods," Applied Mathematics and Computation, Elsevier, vol. 462(C).
    2. Li Yan, 2022. "Confidence interval estimation of the common mean of several gamma populations," PLOS ONE, Public Library of Science, vol. 17(6), pages 1-13, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Schmid, Friedrich & Schmidt, Axel, 2006. "Nonparametric estimation of the coefficient of overlapping--theory and empirical application," Computational Statistics & Data Analysis, Elsevier, vol. 50(6), pages 1583-1596, March.
    2. Anderson, Gordon & Linton, Oliver & Whang, Yoon-Jae, 2012. "Nonparametric estimation and inference about the overlap of two distributions," Journal of Econometrics, Elsevier, vol. 171(1), pages 1-23.
    3. Gordon Anderson & Oliver Linton & Yoon-Jae Wang, 2009. "Non Parametric Estimation of a Polarization Measure," Working Papers tecipa-363, University of Toronto, Department of Economics.
    4. Eidous, Omar M. & Ananbeh, Enas A., 2024. "Kernel method for estimating overlapping coefficient using numerical integration methods," Applied Mathematics and Computation, Elsevier, vol. 462(C).
    5. Yin, Jingjing & Tian, Lili, 2014. "Joint inference about sensitivity and specificity at the optimal cut-off point associated with Youden index," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 1-13.
    6. Lee, Sokbae & Song, Kyungchul & Whang, Yoon-Jae, 2018. "Testing For A General Class Of Functional Inequalities," Econometric Theory, Cambridge University Press, vol. 34(5), pages 1018-1064, October.
    7. Bonica, Adam & Sen, Maya, 2015. "The Politics of Selecting the Bench from the Bar: The Legal Profession and Partisan Incentives to Politicize the Judiciary," Working Paper Series rwp15-001, Harvard University, John F. Kennedy School of Government.
    8. Y. Huang & M. S. Pepe, 2009. "A Parametric ROC Model-Based Approach for Evaluating the Predictiveness of Continuous Markers in Case–Control Studies," Biometrics, The International Biometric Society, vol. 65(4), pages 1133-1144, December.
    9. Liu, Shen & Maharaj, Elizabeth Ann & Inder, Brett, 2014. "Polarization of forecast densities: A new approach to time series classification," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 345-361.
    10. Beare, Brendan K. & Shi, Xiaoxia, 2019. "An improved bootstrap test of density ratio ordering," Econometrics and Statistics, Elsevier, vol. 10(C), pages 9-26.
    11. Hamza Dhaker & Papa Ngom & Boubakari Ibrahimouf & Malick Mbodj, 2019. "Overlap Coefficients Based on Kullback-Leibler of Two Normal Densities: Equal Means Case," Journal of Mathematics Research, Canadian Center of Science and Education, vol. 11(2), pages 114-124, April.
    12. Gordon Anderson & Oliver Linton & Jasmin Thomas, 2017. "Similarity, dissimilarity and exceptionality: generalizing Gini’s transvariation to measure “differentness” in many distributions," METRON, Springer;Sapienza Università di Roma, vol. 75(2), pages 161-180, August.
    13. Rolando Gonzales & Gabriela Aguilera-Lizarazu & Andrea Rojas-Hosse & Patricia Aranda, 2016. "Preference for women but less preference for indigenous women: A lab-field experiment of loan discrimination in a developing economy," Working Papers PIERI 2016-24, PEP-PIERI.
    14. Li, Xinmin & Wang, Juan & Liang, Hua, 2011. "Comparison of several means: A fiducial based approach," Computational Statistics & Data Analysis, Elsevier, vol. 55(5), pages 1993-2002, May.
    15. Gordon Anderson & Teng Leo & Robert Muelhaupt, 2014. "Measuring Advances in Equality of Opportunity: The Changing Gender Gap in Educational Attainment in Canada in the Last Half Century," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 119(1), pages 73-99, October.
    16. Christian M Dahl & Martin Huber & Giovanni Mellace, 2023. "It is never too LATE: a new look at local average treatment effects with or without defiers," The Econometrics Journal, Royal Economic Society, vol. 26(3), pages 378-404.
    17. Seo, Juwon, 2018. "Tests of stochastic monotonicity with improved power," Journal of Econometrics, Elsevier, vol. 207(1), pages 53-70.
    18. Chang, Ching-Hui & Pal, Nabendu, 2008. "Testing on the common mean of several normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 321-333, December.
    19. Anderson, Gordon & Leo, Teng Wah, 2013. "An empirical examination of matching theories: The one child policy, partner choice and matching intensity in urban China," Journal of Comparative Economics, Elsevier, vol. 41(2), pages 468-489.
    20. H. Zakerzadeh & A. Jafari, 2015. "Inference on the parameters of two Weibull distributions based on record values," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(1), pages 25-40, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:106:y:2017:i:c:p:12-26. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.