Author
Listed:
- Reyner Pérez-Campdesuñer
(Faculty of Law, Administrative and Social Sciences, Universidad UTE, Quito 170527, Ecuador)
- Alexander Sánchez-Rodríguez
(Faculty of Engineering Sciences and Industries, Universidad UTE, Quito 170527, Ecuador)
- Rodobaldo Martínez-Vivar
(Faculty of Law, Administrative and Social Sciences, Universidad UTE, Quito 170527, Ecuador)
- Margarita De Miguel-Guzmán
(Departament of Administration, Instituto Superior Tecnológico Atlantic, Santo Domingo 230201, Ecuador)
- Gelmar García-Vidal
(Faculty of Law, Administrative and Social Sciences, Universidad UTE, Quito 170527, Ecuador)
Abstract
The volume of scientific publications has increased exponentially over the past decades across virtually all academic disciplines. In this landscape of information overload, objective criteria are needed to identify high-impact research. Citation counts have traditionally served as a primary indicator of scientific relevance; however, questions remain as to whether they truly reflect the intrinsic quality of a publication. This study investigates the relationship between citation frequency and a wide range of editorial, authorship, and contextual variables. A dataset of 339,609 articles indexed in Scopus was analyzed, retrieved using the search query TITLE-ABS-KEY (management) AND LIMIT-TO (subarea, “Busi”). The research employed a descriptive analysis followed by two predictive modeling approaches: a Random Forest algorithm to assess variable importance, and a binary logistic regression to estimate the probability of a paper being cited. Results indicate that factors such as journal quartile, country of affiliation, number of authors, open access availability, and keyword usage significantly influence citation outcomes. The Random Forest model explained 94.9% of the variance, while the logistic model achieved an AUC of 0.669, allowing the formulation of a predictive citation equation. Findings suggest that multiple determinants beyond content quality drive citation behavior, and that citation probability can be predicted with reasonable accuracy, though inherent model limitations must be acknowledged.
Suggested Citation
Reyner Pérez-Campdesuñer & Alexander Sánchez-Rodríguez & Rodobaldo Martínez-Vivar & Margarita De Miguel-Guzmán & Gelmar García-Vidal, 2025.
"Beyond Quality: Predicting Citation Impact in Business Research Using Data Science,"
Publications, MDPI, vol. 13(3), pages 1-18, September.
Handle:
RePEc:gam:jpubli:v:13:y:2025:i:3:p:42-:d:1742694
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jpubli:v:13:y:2025:i:3:p:42-:d:1742694. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.