IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0324017.html
   My bibliography  Save this article

Heterogeneity of diagnosis and documentation of post-COVID conditions in primary care: A machine learning analysis

Author

Listed:
  • Nathaniel Hendrix
  • Rishi V Parikh
  • Madeline Taskier
  • Grace Walter
  • Ilia Rochlin
  • Sharon Saydah
  • Emilia H Koumans
  • Oscar Rincón-Guevara
  • David H Rehkopf
  • Robert L Phillips

Abstract

Background: Post-COVID conditions (PCC) have proven difficult to diagnose. In this retrospective observational study, we aimed to characterize the level of variation in PCC diagnoses observed across clinicians from a number of methodological angles and to determine whether natural language classifiers trained on clinical notes can reconcile differences in diagnostic definitions. Methods: We used data from 519 primary care clinics around the United States who were in the American Family Cohort registry between October 1, 2021 (when the ICD-10 code for PCC was activated) and November 1, 2023. There were 6,116 patients with a diagnostic code for PCC (U09.9), and 5,020 with diagnostic codes for both PCC and COVID-19. We explored these data using 4 different outcomes: 1) Time between COVID-19 and PCC diagnostic codes; 2) Count of patients with PCC diagnostic codes per clinician; 3) Patient-specific probability of PCC diagnostic code based on patient and clinician characteristics; and 4) Performance of a natural language classifier trained on notes from 5,000 patients annotated by two physicians to indicate probable PCC. Results: Of patients with diagnostic codes for PCC and COVID-19, 61.3% were diagnosed with PCC less than 12 weeks after initial recorded COVID-19. Clinicians in the top 1% of diagnostic propensity accounted for more than a third of all PCC diagnoses (35.8%). Comparing LASSO logistic regressions predicting documentation of PCC diagnosis, a log-likelihood test showed significantly better fit when clinician and practice site indicators were included (p

Suggested Citation

  • Nathaniel Hendrix & Rishi V Parikh & Madeline Taskier & Grace Walter & Ilia Rochlin & Sharon Saydah & Emilia H Koumans & Oscar Rincón-Guevara & David H Rehkopf & Robert L Phillips, 2025. "Heterogeneity of diagnosis and documentation of post-COVID conditions in primary care: A machine learning analysis," PLOS ONE, Public Library of Science, vol. 20(5), pages 1-12, May.
  • Handle: RePEc:plo:pone00:0324017
    DOI: 10.1371/journal.pone.0324017
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0324017
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0324017&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0324017?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0324017. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.