IDEAS home Printed from
   My bibliography  Save this article

Incorporating citation impact into analysis of research trends


  • Minchul Lee

    () (Yonsei University)

  • Min Song

    () (Yonsei University)


In the past decades, there have been a number of proposals to apply topic modeling to research trend analysis. However, most of previous studies have relied primarily on document publication year and have not incorporated the impact of articles into trend analysis. Unlike previous trend analysis using topic modeling, we incorporate citation count, which can be viewed as the impact of articles, into trend analysis to shed a new light on the understanding of research trends. To this end, we propose the Generalized Dirichlet multinomial regression (g-DMR) topic model, which improves the DMR topic model by replacing a linear inner product in topic priors, $$\mathrm{exp}\left({{\varvec{x}}}_{d}\cdot {{\varvec{\lambda}}}_{t}\right),$$ exp x d · λ t , with a more general form based on topic distribution function (TDF), $$\mathrm{exp}\left(\mathrm{f}\left({{\varvec{x}}}_{d}\right)\right)+\upvarepsilon$$ exp f x d + ε . We use multidimensional Legendre Polynomial as TDF to capture publication year and the number of citations per publication simultaneously. In DMR model, since metadata could affect the document-topic distribution only monotonically and continuous values such as publication year and citation count need to be discretized, it is difficult to view the dynamic change of each topic. But the g-DMR model can handle various orthogonal continuous variables with arbitrary order of polynomial, so it can show more dynamic topic trends. Two major experiments show that the proposed model is better suited for topic generation with consideration of citation impact than DMR does for the trend analysis in the field of Library and Information Science in general and Text Mining in particular.

Suggested Citation

  • Minchul Lee & Min Song, 2020. "Incorporating citation impact into analysis of research trends," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(2), pages 1191-1224, August.
  • Handle: RePEc:spr:scient:v:124:y:2020:i:2:d:10.1007_s11192-020-03508-3
    DOI: 10.1007/s11192-020-03508-3

    Download full text from publisher

    File URL:
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    1. Maisonobe, Marion & Eckert, Denis & Grossetti, Michel & Jégou, Laurent & Milard, Béatrice, 2016. "The world network of scientific collaborations between cities: domestic or international dynamics?," Journal of Informetrics, Elsevier, vol. 10(4), pages 1025-1036.
    2. Ugo Finardi & Andrea Buratti, 2016. "Scientific collaboration framework of BRICS countries: an analysis of international coauthorship," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(1), pages 433-446, October.
    3. Kang, Kiyeon & Sohn, So Young, 2016. "Evaluating the patenting activities of pharmaceutical research organizations based on new technology indices," Journal of Informetrics, Elsevier, vol. 10(1), pages 74-81.
    4. Xu, Shuo & Hao, Liyuan & An, Xin & Yang, Guancan & Wang, Feifei, 2019. "Emerging research topics detection with multiple machine learning models," Journal of Informetrics, Elsevier, vol. 13(4).
    5. Margaret E. Roberts & Brandon M. Stewart & Dustin Tingley & Christopher Lucas & Jetson Leder‐Luis & Shana Kushner Gadarian & Bethany Albertson & David G. Rand, 2014. "Structural Topic Models for Open‐Ended Survey Responses," American Journal of Political Science, John Wiley & Sons, vol. 58(4), pages 1064-1082, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Minchul Lee & Min Song, 0. "Incorporating citation impact into analysis of research trends," Scientometrics, Springer;Akadémiai Kiadó, vol. 0, pages 1-34.
    2. Loet Leydesdorff & Dieter Franz Kogler & Bowen Yan, 2017. "Mapping patent classifications: portfolio and statistical analysis, and the comparison of strengths and weaknesses," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1573-1591, September.
    3. Parijat Chakrabarti & Margaret Frye, 2017. "A mixed-methods framework for analyzing text data: Integrating computational techniques with qualitative methods in demography," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 37(42), pages 1351-1382.
    4. Shuo Xu & Liyuan Hao & Xin An & Hongshen Pang & Ting Li, 2020. "Review on emerging research topics with key-route main path analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 607-624, January.
    5. Simone Belli & Carlos Gonzalo-Penela, 2020. "Science, research, and innovation infospheres in Google results of the Ibero-American countries," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(2), pages 635-653, May.
    6. Savin, Ivan & Drews, Stefan & van den Bergh, Jeroen, 2021. "Free associations of citizens and scientists with economic and green growth: A computational-linguistics analysis," Ecological Economics, Elsevier, vol. 180(C).
    7. Simone Belli & Joan Baltà, 2019. "Stocktaking scientific publication on bi-regional collaboration between Europe 28 and Latin America and the Caribbean," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(3), pages 1447-1480, December.
    8. Ebadi, Ashkan & Tremblay, Stéphane & Goutte, Cyril & Schiffauerova, Andrea, 2020. "Application of machine learning techniques to assess the trends and alignment of the funded research output," Journal of Informetrics, Elsevier, vol. 14(2).
    9. Mladen Djuric & Marina Dobrota & Jovan Filipovic, 2020. "Complexity-based quality indicators for human and social capital in science and research: the case of Serbian Homeland versus Diaspora," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 303-328, July.
    10. Mónica D. Oliveira & Inês Mataloto & Panos Kanavos, 2019. "Multi-criteria decision analysis for health technology assessment: addressing methodological challenges to improve the state of the art," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 20(6), pages 891-918, August.
    11. Grace Skogstad & Matt Wilder, 2019. "Strangers at the gate: the role of multidimensional ideas, policy anomalies and institutional gatekeepers in biofuel policy developments in the USA and European Union," Policy Sciences, Springer;Society of Policy Sciences, vol. 52(3), pages 343-366, September.
    12. Stamolampros, Panagiotis & Korfiatis, Nikolaos & Chalvatzis, Konstantinos & Buhalis, Dimitrios, 2020. "Harnessing the “wisdom of employees” from online reviews," Annals of Tourism Research, Elsevier, vol. 80(C).
    13. Joanna Sterling & John T. Jost & Curtis D. Hardin, 2019. "Liberal and Conservative Representations of the Good Society: A (Social) Structural Topic Modeling Approach," SAGE Open, , vol. 9(2), pages 21582440198, May.
    14. Rui Gama & Cristina Barros & Ricardo Fernandes, 2018. "Science Policy, R&D and Knowledge in Portugal: an Application of Social Network Analysis," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 9(2), pages 329-358, June.
    15. Andreas Rehs, 2020. "A structural topic model approach to scientific reorientation of economics and chemistry after German reunification," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1229-1251, November.
    16. Federica Genovese & Endre Tvinnereim, 2019. "Who opposes climate regulation? Business preferences for the European emission trading scheme," The Review of International Organizations, Springer, vol. 14(3), pages 511-542, September.
    17. Valentina Anzoise & Debora Slanzi & Irene Poli, 2020. "Local stakeholders’ narratives about large-scale urban development: The Zhejiang Hangzhou Future Sci-Tech City," Urban Studies, Urban Studies Journal Limited, vol. 57(3), pages 655-671, February.
    18. Ulrich Fritsche & Johannes Puckelwald, 2018. "Deciphering Professional Forecasters’ Stories - Analyzing a Corpus of Textual Predictions for the German Economy," Macroeconomics and Finance Series 201804, University of Hamburg, Department of Socioeconomics.
    19. Mourtgos, Scott M. & Adams, Ian T., 2019. "The rhetoric of de-policing: Evaluating open-ended survey responses from police officers with machine learning-based structural topic modeling," Journal of Criminal Justice, Elsevier, vol. 64(C), pages 1-1.
    20. Petra Maresova & Ivan Soukal & Ruzena Stemberkova & Kamil Kuca, 2020. "Innovation in the public sector in a small open economy-initial investigation of patent activity at the Czech universities," Journal of Innovation and Entrepreneurship, Springer, vol. 9(1), pages 1-16, December.


    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:124:y:2020:i:2:d:10.1007_s11192-020-03508-3. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Sonal Shukla) or (Springer Nature Abstracting and Indexing). General contact details of provider: .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.