Author
Listed:
- Rustam A. Erizhokov
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia)
- Alexander E. Gordeev
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Faculty of Computer Science, National Research University Higher School of Economics, 11 Pokrovsky Bulvar, 109028 Moscow, Russia)
- Polina A. Sakharova
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia
MIREA—Russian Technological University, 78 Vernadsky Avenue, 119454 Moscow, Russia)
- Adel A. Yafarova
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia)
- Maria D. Varyukhina
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia)
- Ivan A. Blokhin
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia)
- Olga V. Omelyanskaya
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia
MIREA—Russian Technological University, 78 Vernadsky Avenue, 119454 Moscow, Russia)
- Anton V. Vladzymyrskyy
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia
Department of Information Technologies and Medical Data Processing, I.M. Sechenov First Moscow State Medical University (Sechenov University), Trubetskaya Street, 8, Building 2, 119991 Moscow, Russia)
- Yuriy A. Vasilev
(Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies, Moscow Health Care Department, Petrovka Street, 24, Building 1, 127051 Moscow, Russia)
Abstract
Large language models (LLMs) are increasingly being used in radiology-related workflows, but their application to reference, regulatory, and methodological queries remains limited by hallucinations and the static nature of model knowledge. This study aimed to develop and evaluate a retrieval-augmented generation (RAG) system for radiologists designed to provide grounded responses to such queries. A knowledge base was created through a survey of practicing radiologists and expert validation of sources, resulting in a corpus of 1049 documents. The system incorporated structured document parsing, a two-level parent–child vector database, hybrid dense–sparse retrieval, reranking, and a local large language model. Performance was assessed through functional testing, automated LLM-as-a-judge metrics, and multireader expert evaluation by 16 radiologists using 400 technical queries. No hallucinations were detected in the 77-query functional testing set during expert review. On the full technical dataset, automated Contextual Precision, Contextual Recall, and Answer Relevancy were 0.735, 0.881, and 0.890, respectively. Expert evaluation showed high response accuracy (mean, 4.53/5) and high expert-assessed Contextual Precision (0.886). Inter-expert agreement was substantial to excellent for most Likert-scale criteria. These findings suggest that a hierarchical RAG architecture can provide reliable access to radiology-specific reference information, although external validation and automated updating of the knowledge base remain necessary.
Suggested Citation
Rustam A. Erizhokov & Alexander E. Gordeev & Polina A. Sakharova & Adel A. Yafarova & Maria D. Varyukhina & Ivan A. Blokhin & Olga V. Omelyanskaya & Anton V. Vladzymyrskyy & Yuriy A. Vasilev, 2026.
"Development and Multireader Evaluation of Radiological RAG-System,"
Data, MDPI, vol. 11(6), pages 1-27, June.
Handle:
RePEc:gam:jdataj:v:11:y:2026:i:6:p:143-:d:1965847
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:11:y:2026:i:6:p:143-:d:1965847. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address
(email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.