IDEAS home Printed from https://ideas.repec.org/p/col/000122/014437.html
   My bibliography  Save this paper

The productivity of top researchers: A semi-nonparametric approach

Author

Listed:
  • Lina M. Cortés
  • Javier Perote
  • Andrés Mora-Valencia

Abstract

Research productivity distributions exhibit heavy tails because it is common for a few researchers to accumulate the majority of the top publications and their corresponding citations. Measurements of this productivity are very sensitive to the field being analyzed and the distribution used. In particular, distributions such as the lognormal distribution seem to systematically underestimate the productivity of the top researchers. In this article, we propose the use of a (log)semi-nonparametric distribution (log-SNP) that nests the lognormal and captures the heavy tail of the productivity distribution through the introduction of new parameters linked to high-order moments. To compare the results, we use research performance data on 140,971 researchers who have produced 253,634 publications in 18 fields of knowledge (O’Boyle and Aguinis, 2012) and show how the log-SNP distribution provides more accurate measures of the performance of the top researchers in their respective fields of knowledge.

Suggested Citation

  • Lina M. Cortés & Javier Perote & Andrés Mora-Valencia, 2016. "The productivity of top researchers: A semi-nonparametric approach," Documentos de Trabajo de Valor Público 14437, Universidad EAFIT.
  • Handle: RePEc:col:000122:014437
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10784/8181
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Bertocchi, Graziella & Gambardella, Alfonso & Jappelli, Tullio & Nappi, Carmela A. & Peracchi, Franco, 2015. "Bibliometric evaluation vs. informed peer review: Evidence from Italy," Research Policy, Elsevier, vol. 44(2), pages 451-466.
    2. Chung, Kee H & Cox, Raymond A K, 1990. "Patterns of Productivity in the Finance Literature: A Study of the Bibliometric Distributions," Journal of Finance, American Finance Association, vol. 45(1), pages 301-309, March.
    3. S. Redner, 1998. "How popular is your paper? An empirical study of the citation distribution," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 4(2), pages 131-134, July.
    4. Glenn Ellison, 2013. "How Does the Market Use Citation Data? The Hirsch Index in Economics," American Economic Journal: Applied Economics, American Economic Association, vol. 5(3), pages 63-90, July.
    5. Finardi, Ugo, 2013. "Correlation between Journal Impact Factor and Citation Performance: An experimental study," Journal of Informetrics, Elsevier, vol. 7(2), pages 357-370.
    6. Kocher, Martin G. & Luptacik, Mikulas & Sutter, Matthias, 2006. "Measuring productivity of research in economics: A cross-country study using DEA," Socio-Economic Planning Sciences, Elsevier, vol. 40(4), pages 314-332, December.
    7. Anne-Wil Harzing & Satu Alakangas, 2016. "Google Scholar, Scopus and the Web of Science: a longitudinal and cross-disciplinary comparison," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(2), pages 787-804, February.
    8. Pedro Albarrán & Juan A. Crespo & Ignacio Ortuño & Javier Ruiz-Castillo, 2011. "The skewness of science in 219 sub-fields and a number of aggregates," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(2), pages 385-397, August.
    9. Birkmaier, Daniel & Wohlrabe, Klaus, 2014. "The Matthew effect in economics reconsidered," Journal of Informetrics, Elsevier, vol. 8(4), pages 880-889.
    10. Ruiz-Castillo, Javier & Costas, Rodrigo, 2014. "The skewness of scientific productivity," Journal of Informetrics, Elsevier, vol. 8(4), pages 917-934.
    11. Chen, Xiaohong, 2007. "Large Sample Sieve Estimation of Semi-Nonparametric Models," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 76, Elsevier.
    12. Hodgson, Geoffrey M & Rothman, Harry, 1999. "The Editors and Authors of Economics Journals: A Case of Institutional Oligopoly?," Economic Journal, Royal Economic Society, vol. 109(453), pages 165-186, February.
    13. Anne-Wil Harzing, 2014. "A longitudinal study of Google Scholar coverage between 2012 and 2013," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 565-575, January.
    14. Kaur, Jasleen & Radicchi, Filippo & Menczer, Filippo, 2013. "Universality of scholarly impact metrics," Journal of Informetrics, Elsevier, vol. 7(4), pages 924-932.
    15. Young-Ho Eom & Santo Fortunato, 2011. "Characterizing and Modeling Citation Dynamics," PLOS ONE, Public Library of Science, vol. 6(9), pages 1-7, September.
    16. Juan A Crespo & Ignacio Ortuño-Ortín & Javier Ruiz-Castillo, 2012. "The Citation Merit of Scientific Publications," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-9, November.
    17. Trino-Manuel Niguez & Ivan Paya & David Peel & Javier Perote, 2013. "Higher-order moments in the theory of diversification and portfolio composition," Working Papers 18297128, Lancaster University Management School, Economics Department.
    18. Abramo, Giovanni & D’Angelo, Ciriaco Andrea, 2014. "Assessing national strengths and weaknesses in research fields," Journal of Informetrics, Elsevier, vol. 8(3), pages 766-775.
    19. Campanario, Juan Miguel, 2015. "Providing impact: The distribution of JCR journals according to references they contribute to the 2-year and 5-year journal impact factors," Journal of Informetrics, Elsevier, vol. 9(2), pages 398-407.
    20. da Silva, Roberto & Kalil, Fahad & de Oliveira, José Palazzo Moreira & Martinez, Alexandre Souto, 2012. "Universality in bibliometrics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(5), pages 2119-2128.
    21. Gallant, A Ronald & Nychka, Douglas W, 1987. "Semi-nonparametric Maximum Likelihood Estimation," Econometrica, Econometric Society, vol. 55(2), pages 363-390, March.
    22. Bárbara S. Lancho-Barrantes & Vicente P. Guerrero-Bote & Félix Moya-Anegón, 2010. "The iceberg hypothesis revisited," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(2), pages 443-461, November.
    23. Perc, Matjaž, 2010. "Zipf’s law and log-normal distributions in measures of scientific output across fields and institutions: 40 years of Slovenia’s research as an example," Journal of Informetrics, Elsevier, vol. 4(3), pages 358-364.
    24. Del Brio, Esther B. & Perote, Javier, 2012. "Gram–Charlier densities: Maximum likelihood versus the method of moments," Insurance: Mathematics and Economics, Elsevier, vol. 51(3), pages 531-537.
    25. Mingers, John & Leydesdorff, Loet, 2015. "A review of theory and practice in scientometrics," European Journal of Operational Research, Elsevier, vol. 246(1), pages 1-19.
    26. Sargan, J D, 1975. "Gram-Charlier Approximations Applied to t Ratios of k-Class Estimators," Econometrica, Econometric Society, vol. 43(2), pages 327-346, March.
    27. Kretschmer, Hildrun & Kretschmer, Theo, 2007. "Lotka's distribution and distribution of co-author pairs’ frequencies," Journal of Informetrics, Elsevier, vol. 1(4), pages 308-337.
    28. Phillips, Peter C B, 1977. "A General Theorem in the Theory of Asymptotic Expansions as Approximations to the Finite Sample Distributions of Econometric Estimators," Econometrica, Econometric Society, vol. 45(6), pages 1517-1534, September.
    29. Ñíguez, Trino-Manuel & Paya, Ivan & Peel, David & Perote, Javier, 2012. "On the stability of the constant relative risk aversion (CRRA) utility under high degrees of uncertainty," Economics Letters, Elsevier, vol. 115(2), pages 244-248.
    30. Ignacio Mauleon & Javier Perote, 2000. "Testing densities with financial data: an empirical comparison of the Edgeworth-Sargan density to the Student's t," The European Journal of Finance, Taylor & Francis Journals, vol. 6(2), pages 225-239.
    31. Jordi Duch & Xiao Han T Zeng & Marta Sales-Pardo & Filippo Radicchi & Shayna Otis & Teresa K Woodruff & Luís A Nunes Amaral, 2012. "The Possible Role of Resource Requirements and Academic Career-Choice Risk on Gender Differences in Publication Rate and Impact," PLOS ONE, Public Library of Science, vol. 7(12), pages 1-11, December.
    32. Borokhovich, Kenneth A, et al, 1995. "Finance Research Productivity and Influence," Journal of Finance, American Finance Association, vol. 50(5), pages 1691-1717, December.
    33. Kaur, Jasleen & Ferrara, Emilio & Menczer, Filippo & Flammini, Alessandro & Radicchi, Filippo, 2015. "Quality versus quantity in scientific impact," Journal of Informetrics, Elsevier, vol. 9(4), pages 800-808.
    34. Day, Theodore Eugene, 2015. "The big consequences of small biases: A simulation of peer review," Research Policy, Elsevier, vol. 44(6), pages 1266-1270.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lina Cortés & Juan M. Lozada & Javier Perote, 2019. "Firm size and concentration inequality: A flexible extension of Gibrat’s law," Documentos de Trabajo de Valor Público 17205, Universidad EAFIT.
    2. Lina M Cortés & Juan M Lozada & Javier Perote, 2021. "Firm size and economic concentration: An analysis from a lognormal expansion," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-21, July.
    3. Alfredo Trespalacios & Lina M. Cortés & Javier Perote, 2019. "Modeling the electricity spot price with switching regime semi-nonparametric distributions," Documentos de Trabajo de Valor Público 17618, Universidad EAFIT.
    4. Cortés, Lina M. & Mora-Valencia, Andrés & Perote, Javier, 2017. "Measuring firm size distribution with semi-nonparametric densities," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 485(C), pages 35-47.
    5. Alfredo Trespalacios & Lina M. Cortés & Javier Perote, 2021. "Modeling Electricity Price and Quantity Uncertainty: An Application for Hedging with Forward Contracts," Energies, MDPI, vol. 14(11), pages 1-26, June.
    6. Trespalacios, Alfredo & Cortés, Lina M. & Perote, Javier, 2020. "Uncertainty in electricity markets from a semi-nonparametric approach," Energy Policy, Elsevier, vol. 137(C).
    7. Lina M. Cortés & Javier Perote & Andrés Mora-Valencia, 2017. "Implicit probability distribution for WTI options: The Black Scholes vs. the semi-nonparametric approach," Documentos de Trabajo de Valor Público 15923, Universidad EAFIT.
    8. Robert A. Buckle & John Creedy, 2019. "An evaluation of metrics used by the Performance-based Research Fund process in New Zealand," New Zealand Economic Papers, Taylor & Francis Journals, vol. 53(3), pages 270-287, September.
    9. Cortés, Lina M. & Mora-Valencia, Andrés & Perote, Javier, 2020. "Retrieving the implicit risk neutral density of WTI options with a semi-nonparametric approach," The North American Journal of Economics and Finance, Elsevier, vol. 54(C).
    10. Marek Kwiek, 2018. "High research productivity in vertically undifferentiated higher education systems: Who are the top performers?," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 415-462, April.
    11. Jiménez, Inés & Mora-Valencia, Andrés & Perote, Javier, 2023. "Multivariate dynamics between emerging markets and digital asset markets: An application of the SNP-DCC model," Emerging Markets Review, Elsevier, vol. 56(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kaur, Jasleen & Ferrara, Emilio & Menczer, Filippo & Flammini, Alessandro & Radicchi, Filippo, 2015. "Quality versus quantity in scientific impact," Journal of Informetrics, Elsevier, vol. 9(4), pages 800-808.
    2. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    3. Del Brio, Esther B. & Perote, Javier, 2012. "Gram–Charlier densities: Maximum likelihood versus the method of moments," Insurance: Mathematics and Economics, Elsevier, vol. 51(3), pages 531-537.
    4. Trespalacios, Alfredo & Cortés, Lina M. & Perote, Javier, 2020. "Uncertainty in electricity markets from a semi-nonparametric approach," Energy Policy, Elsevier, vol. 137(C).
    5. Bouyssou, Denis & Marchant, Thierry, 2016. "Ranking authors using fractional counting of citations: An axiomatic approach," Journal of Informetrics, Elsevier, vol. 10(1), pages 183-199.
    6. Andrés Mora-Valencia & Trino-Manuel Ñíguez & Javier Perote, 2017. "Multivariate approximations to portfolio return distribution," Computational and Mathematical Organization Theory, Springer, vol. 23(3), pages 347-361, September.
    7. Bonaccorsi, Andrea & Haddawy, Peter & Cicero, Tindaro & Hassan, Saeed-Ul, 2017. "The solitude of stars. An analysis of the distributed excellence model of European universities," Journal of Informetrics, Elsevier, vol. 11(2), pages 435-454.
    8. Jiménez, Inés & Mora-Valencia, Andrés & Perote, Javier, 2022. "Semi-nonparametric risk assessment with cryptocurrencies," Research in International Business and Finance, Elsevier, vol. 59(C).
    9. Cortés, Lina M. & Mora-Valencia, Andrés & Perote, Javier, 2017. "Measuring firm size distribution with semi-nonparametric densities," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 485(C), pages 35-47.
    10. Ñíguez, Trino-Manuel & Perote, Javier, 2016. "Multivariate moments expansion density: Application of the dynamic equicorrelation model," Journal of Banking & Finance, Elsevier, vol. 72(S), pages 216-232.
    11. Sergio Copiello, 2019. "The open access citation premium may depend on the openness and inclusiveness of the indexing database, but the relationship is controversial because it is ambiguous where the open access boundary lie," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 995-1018, November.
    12. Tol, Richard S.J., 2013. "The Matthew effect for cohorts of economists," Journal of Informetrics, Elsevier, vol. 7(2), pages 522-527.
    13. Trino-Manuel Ñíguez & Javier Perote, 2012. "Forecasting Heavy-Tailed Densities with Positive Edgeworth and Gram-Charlier Expansions," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 74(4), pages 600-627, August.
    14. John Mingers & Jesse R. O’Hanley & Musbaudeen Okunola, 2017. "Using Google Scholar institutional level data to evaluate the quality of university research," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1627-1643, December.
    15. Mike Thelwall, 2016. "Interpreting correlations between citation counts and other indicators," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(1), pages 337-347, July.
    16. Zhihui Zhang & Ying Cheng & Nian Cai Liu, 2015. "Improving the normalization effect of mean-based method from the perspective of optimization: optimization-based linear methods and their performance," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 587-607, January.
    17. Del Brio, Esther B. & Mora-Valencia, Andrés & Perote, Javier, 2014. "VaR performance during the subprime and sovereign debt crises: An application to emerging markets," Emerging Markets Review, Elsevier, vol. 20(C), pages 23-41.
    18. Vîiu, Gabriel-Alexandru, 2018. "The lognormal distribution explains the remarkable pattern documented by characteristic scores and scales in scientometrics," Journal of Informetrics, Elsevier, vol. 12(2), pages 401-415.
    19. Andrea Bonaccorsi & Tindaro Cicero & Peter Haddawy & Saeed-UL Hassan, 2017. "Explaining the transatlantic gap in research excellence," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(1), pages 217-241, January.
    20. Jessica Petersen & Fabian Hattke & Rick Vogel, 2017. "Editorial governance and journal impact: a study of management and business journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1593-1614, September.

    More about this item

    Keywords

    Research evaluation; Research productivity; Heavy tail distributions; Semi- nonparametric modeling.;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C44 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Operations Research; Statistical Decision Theory
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:col:000122:014437. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Valor Público EAFIT - Centro de estudios e incidencia (email available below). General contact details of provider: https://edirc.repec.org/data/cieafco.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.