IDEAS home Printed from https://ideas.repec.org/a/spr/jcsosc/v8y2025i2d10.1007_s42001-025-00379-7.html
   My bibliography  Save this article

Predicting news deserts using supervised machine learning

Author

Listed:
  • Arijit Paladhi

    (Indiana University)

Abstract

The decline of local newspapers has led to the emergence of news deserts—areas lacking access to critical local information—posing a threat to community engagement and democracy. This study aims to predict which U.S. counties are most at risk of becoming news deserts by developing machine learning models based on socioeconomic, geographic, and circulation data. Addressing class imbalance and data noise, we employed classifiers such as Logistic Regression, Random Forest, XGBoost, Support Vector Machines, K-Nearest Neighbors, and Naive Bayes, combined with resampling techniques like SMOTE, Tomek Links, SMOTETomek, SMOTEENN, and ADASYN. Our analysis found that XGBoost combined with ADASYN performed best, achieving an F2-Score of 0.486 and AUC-PR of 0.467 on test data. These results provide valuable insights for policymakers aiming to develop targeted interventions to preserve local media ecosystems and strengthen democratic processes.

Suggested Citation

  • Arijit Paladhi, 2025. "Predicting news deserts using supervised machine learning," Journal of Computational Social Science, Springer, vol. 8(2), pages 1-29, May.
  • Handle: RePEc:spr:jcsosc:v:8:y:2025:i:2:d:10.1007_s42001-025-00379-7
    DOI: 10.1007/s42001-025-00379-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s42001-025-00379-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s42001-025-00379-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. King, Gary & Zeng, Langche, 2001. "Logistic Regression in Rare Events Data," Political Analysis, Cambridge University Press, vol. 9(2), pages 137-163, January.
    2. Case, Anne C. & Rosen, Harvey S. & Hines, James Jr., 1993. "Budget spillovers and fiscal policy interdependence : Evidence from the states," Journal of Public Economics, Elsevier, vol. 52(3), pages 285-307, October.
    3. Takaya Saito & Marc Rehmsmeier, 2015. "The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-21, March.
    4. Jessica Mahone, 2023. "An Overview of State and Local Legislation to Support Local News: Policy Mechanisms and Challenges to Impact," The ANNALS of the American Academy of Political and Social Science, , vol. 707(1), pages 46-61, May.
    5. Agnes Gulyas & Joy Jenkins & Annika Bergström, 2023. "Places and Spaces Without News: The Contested Phenomenon of News Deserts," Media and Communication, Cogitatio Press, vol. 11(3), pages 285-289.
    6. Paul Voss & David Long & Roger Hammer & Samantha Friedman, 2006. "County child poverty rates in the US: a spatial regression approach," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 25(4), pages 369-391, August.
    7. Agnes Gulyas & Joy Jenkins & Annika Bergström, 2023. "Places and Spaces Without News: The Contested Phenomenon of News Deserts," Media and Communication, Cogitatio Press, vol. 11(3), pages 285-289.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seo‐Young Silvia Kim & Akhil Bandreddi & R. Michael Alvarez, 2024. "Partisanship is why people vote in person in a pandemic," Social Science Quarterly, Southwestern Social Science Association, vol. 105(4), pages 1042-1060, July.
    2. An Liu & Henk Folmer & Johan H L Oud, 2014. "Estimation of Autoregressive Models with Two Types of Weak Spatial Dependence by Means of the W-Based and the Latent Variables Approach: Evidence from Monte Carlo Simulations," Environment and Planning A, , vol. 46(1), pages 186-202, January.
    3. Sandy Fréret & Denis Maguain, 2017. "The effects of agglomeration on tax competition: evidence from a two-regime spatial panel model on French data," International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 24(6), pages 1100-1140, December.
    4. F. Gauthier & D. Germain & B. Hétu, 2017. "Logistic models as a forecasting tool for snow avalanches in a cold maritime climate: northern Gaspésie, Québec, Canada," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 89(1), pages 201-232, October.
    5. Douglas Cumming & Lars Hornuf & Moein Karami & Denis Schweizer, 2023. "Disentangling Crowdfunding from Fraudfunding," Journal of Business Ethics, Springer, vol. 182(4), pages 1103-1128, February.
    6. Zodrow, George R, 2003. "Tax Competition and Tax Coordination in the European Union," International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 10(6), pages 651-671, November.
    7. Cristian David Correa-Álvarez & Juan Carlos Salazar-Uribe & Luis Raúl Pericchi-Guerra, 2023. "Bayesian multilevel logistic regression models: a case study applied to the results of two questionnaires administered to university students," Computational Statistics, Springer, vol. 38(4), pages 1791-1810, December.
    8. Kristien Werck & Bruno Heyndels & Benny Geys, 2008. "The impact of ‘central places’ on spatial spending patterns: evidence from Flemish local government cultural expenditures," Journal of Cultural Economics, Springer;The Association for Cultural Economics International, vol. 32(1), pages 35-58, March.
    9. Asmae AQZZOUZ & Michel DIMOU, 2022. "Tax mimicking in French counties," Region et Developpement, Region et Developpement, LEAD, Universite du Sud - Toulon Var, vol. 55, pages 113-132.
    10. Eunae Yoo & Elliot Rabinovich & Bin Gu, 2020. "The Growth of Follower Networks on Social Media Platforms for Humanitarian Operations," Production and Operations Management, Production and Operations Management Society, vol. 29(12), pages 2696-2715, December.
    11. Kakamu, Kazuhiko & Yunoue, Hideo & Kuramoto, Takashi, 2014. "Spatial patterns of flypaper effects for local expenditure by policy objective in Japan: A Bayesian approach," Economic Modelling, Elsevier, vol. 37(C), pages 500-506.
    12. Matthieu Leprince & Sonia Paty & Emmanuelle Reulier, 2005. "Choix d'imposition et interactions spatiales entre collectivités locales. Un test sur les départements français," Recherches économiques de Louvain, De Boeck Université, vol. 71(1), pages 67-93.
    13. Zhang, Haotian & Lu, Shengfeng & Chen, Sixia, 2024. "Does centralization of tax administration regulate tax competition? Evidence from a quasi-natural experiment in China," Economic Analysis and Policy, Elsevier, vol. 84(C), pages 1084-1098.
    14. Lo Turco, Alessia & Maggioni, Daniela, 2018. "Effects of Islamic religiosity on bilateral trust in trade: The case of Turkish exports," Journal of Comparative Economics, Elsevier, vol. 46(4), pages 947-965.
    15. Galinato, Gregmar I. & Chouinard, Hayley H., 2018. "Strategic interaction and institutional quality determinants of environmental regulations," Resource and Energy Economics, Elsevier, vol. 53(C), pages 114-132.
    16. Blackman, Allen & Guerrero, Santiago, 2012. "What drives voluntary eco-certification in Mexico?," Journal of Comparative Economics, Elsevier, vol. 40(2), pages 256-268.
    17. Alessandra Iannamorelli & Stefano Nobili & Antonio Scalia & Luana Zaccaria, 2024. "Asymmetric Information and Corporate Lending: Evidence from SME Bond Markets," Review of Finance, European Finance Association, vol. 28(1), pages 163-201.
    18. repec:rri:wpaper:200711 is not listed on IDEAS
    19. Christopher J Greenwood & George J Youssef & Primrose Letcher & Jacqui A Macdonald & Lauryn J Hagg & Ann Sanson & Jenn Mcintosh & Delyse M Hutchinson & John W Toumbourou & Matthew Fuller-Tyszkiewicz &, 2020. "A comparison of penalised regression methods for informing the selection of predictive markers," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-14, November.
    20. Mehrez Ben Slama & Dhafer Saidane & Hassouna Fedhila, 2012. "How to identify targets in the M&A banking operations? Case of cross-border strategies in Europe by line of activity," Review of Quantitative Finance and Accounting, Springer, vol. 38(2), pages 209-240, February.
    21. Tse-Chuan Yang & Stephen A Matthews, 2015. "Death by Segregation: Does the Dimension of Racial Segregation Matter?," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-26, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcsosc:v:8:y:2025:i:2:d:10.1007_s42001-025-00379-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.