IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v129y2024i1d10.1007_s11192-023-04849-5.html
   My bibliography  Save this article

Do two wrongs make a right? Measuring the effect of publications on science careers

Author

Listed:
  • Donna K. Ginther

    (University of Kansas
    National Bureau of Economic Research)

  • Carlos Zambrana

    (University of Kansas)

  • Patricia Oslund

    (University of Kansas)

  • Wan-Ying Chang

    (National Science Foundation)

Abstract

This paper examines whether publication data matched to the Survey of Doctorate Recipients can be used for research purposes. We use Gold Standard data created to validate the publication match quality and compare these measures to publications assigned by a machine-learning algorithm developed by Thomson Reuters (now Clarivate). Our econometric model demonstrates that publications likely suffer from non-classical measurement error. Using horse race and instrumental variable models, we confirm that the Gold Standard data are relatively free from measurement error but show that the Clarivate data suffer from non-classical measurement error. We employ a variety of methods to adjust the Clarivate data for false negatives and false positives and demonstrate that with these adjustments the data produce estimates very similar to the Gold Standard. However, these adjustments are not as useful when publications are used as a dependent variable. We recommend using subsamples of the data that have better match quality when using the Clarivate data as a dependent variable.

Suggested Citation

  • Donna K. Ginther & Carlos Zambrana & Patricia Oslund & Wan-Ying Chang, 2024. "Do two wrongs make a right? Measuring the effect of publications on science careers," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(1), pages 289-320, January.
  • Handle: RePEc:spr:scient:v:129:y:2024:i:1:d:10.1007_s11192-023-04849-5
    DOI: 10.1007/s11192-023-04849-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-023-04849-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-023-04849-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Aigner, Dennis J., 1973. "Regression with a binary independent variable subject to errors of observation," Journal of Econometrics, Elsevier, vol. 1(1), pages 49-59, March.
    2. Donna K Ginther & Jodi Basner & Unni Jensen & Joshua Schnell & Raynard Kington & Walter T Schaffer, 2018. "Publications as predictors of racial and ethnic differences in NIH research awards," PLOS ONE, Public Library of Science, vol. 13(11), pages 1-24, November.
    3. Xuan Jiang & Wan-Ying Chang & Bruce A Weinberg, 2021. "Man versus machine? Self-reports versus algorithmic measurement of publications," PLOS ONE, Public Library of Science, vol. 16(9), pages 1-22, September.
    4. Bollinger, Christopher R., 1996. "Bounding mean regressions when a binary regressor is mismeasured," Journal of Econometrics, Elsevier, vol. 73(2), pages 387-399, August.
    5. AIGNER, Dennis J., 1973. "Regression with a binary independent variable subject to errors of observation," LIDAM Reprints CORE 130, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Adele Bergin, 2015. "Employer Changes and Wage Changes: Estimation with Measurement Error in a Binary Variable," LABOUR, CEIS, vol. 29(2), pages 194-223, June.
    2. Santiago Acerenza & Kyunghoon Ban & D'esir'e K'edagni, 2021. "Local Average and Marginal Treatment Effects with a Misclassified Treatment," Papers 2105.00358, arXiv.org, revised Sep 2024.
    3. Craig Gundersen & Brent Kreider, 2008. "Food Stamps and Food Insecurity: What Can Be Learned in the Presence of Nonclassical Measurement Error?," Journal of Human Resources, University of Wisconsin Press, vol. 43(2), pages 352-382.
    4. Kreider, Brent & Pepper, John V., 2007. "Disability and Employment: Reevaluating the Evidence in Light of Reporting Errors," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 432-441, June.
    5. Violeta Misheva & Dinand Webbink & Nicholas G. Martin, 2017. "The effect of child maltreatment on illegal and problematic behaviour: new evidence on the ‘cycle of violence’ using twins data," Journal of Population Economics, Springer;European Society for Population Economics, vol. 30(4), pages 1035-1067, October.
    6. Frazis, Harley & Loewenstein, Mark A., 2003. "Estimating linear regressions with mismeasured, possibly endogenous, binary explanatory variables," Journal of Econometrics, Elsevier, vol. 117(1), pages 151-178, November.
    7. Wossen, Tesfamicheal & Abay, Kibrom A. & Abdoulaye, Tahirou, 2022. "Misperceiving and misreporting input quality: Implications for input use and productivity," Journal of Development Economics, Elsevier, vol. 157(C).
    8. Erich Battistin & Barbara Sianesi, 2006. "Misreported schooling and returns to education: evidence from the UK," CeMMAP working papers CWP07/06, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    9. Hu, Yingyao, 2008. "Identification and estimation of nonlinear models with misclassification error using instrumental variables: A general solution," Journal of Econometrics, Elsevier, vol. 144(1), pages 27-61, May.
    10. Maury Gittleman & Morris M. Kleiner, 2016. "Wage Effects of Unionization and Occupational Licensing Coverage in the United States," ILR Review, Cornell University, ILR School, vol. 69(1), pages 142-172, January.
    11. Brent Kreider & Steven C. Hill, 2009. "Partially Identifying Treatment Effects with an Application to Covering the Uninsured," Journal of Human Resources, University of Wisconsin Press, vol. 44(2).
    12. Adele Bergin, 2013. "Job Changes and Wage Changes: Estimation with Measurement Error in a Binary Variable," Economics Department Working Paper Series n240-13.pdf, Department of Economics, National University of Ireland - Maynooth.
    13. Bellou, Andriana & Bhatt, Rachana, 2013. "Reducing underage alcohol and tobacco use: Evidence from the introduction of vertical identification cards," Journal of Health Economics, Elsevier, vol. 32(2), pages 353-366.
    14. Ganna Vakhitova & Christopher R. Bollinger, 2011. "Labor Market Return to Computer Skills: Using Microsoft Certification to Measure Computer Skills," Discussion Papers 46, Kyiv School of Economics.
    15. Arthur Lewbel, 2007. "Estimation of Average Treatment Effects with Misclassification," Econometrica, Econometric Society, vol. 75(2), pages 537-551, March.
    16. Molinari, Francesca, 2008. "Partial identification of probability distributions with misclassified data," Journal of Econometrics, Elsevier, vol. 144(1), pages 81-117, May.
    17. Francis DiTraglia & Camilo Garcia-Jimeno, 2015. "On Mis-measured Binary Regressors: New Results And Some Comments on the Literature, Third Version," PIER Working Paper Archive 15-040, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 24 Nov 2015.
    18. Brachet, Tanguy, 2008. "Maternal Smoking, Misclassification, and Infant Health," MPRA Paper 21466, University Library of Munich, Germany.
    19. Steven J. Haider & Melvin Stephens Jr., 2020. "Correcting for Misclassified Binary Regressors Using Instrumental Variables," NBER Working Papers 27797, National Bureau of Economic Research, Inc.
    20. Takahide Yanagi, 2019. "Inference on local average treatment effects for misclassified treatment," Econometric Reviews, Taylor & Francis Journals, vol. 38(8), pages 938-960, September.

    More about this item

    Keywords

    Measurement error; Publications; Science;
    All these keywords.

    JEL classification:

    • O3 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights
    • J4 - Labor and Demographic Economics - - Particular Labor Markets
    • C26 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Instrumental Variables (IV) Estimation

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:129:y:2024:i:1:d:10.1007_s11192-023-04849-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.