IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i9p1492-d1647007.html
   My bibliography  Save this article

RMPT: Reinforced Memory-Driven Pure Transformer for Automatic Chest X-Ray Report Generation

Author

Listed:
  • Caijie Qin

    (Institute of Information Engineering, Sanming University, Sanming 365004, China)

  • Yize Xiong

    (Institute of Information Engineering, Sanming University, Sanming 365004, China)

  • Weibin Chen

    (Qingdao Nuocheng Chemicals Safty Technology Co., Ltd., Qingdao 266071, China)

  • Yong Li

    (Institute of Information Engineering, Sanming University, Sanming 365004, China)

Abstract

Automatic generation of chest X-ray reports, designed to produce clinically precise descriptions from chest X-ray images, is gaining significant research attention because of its vast potential in clinical applications. Recently, despite considerable progress, current models typically adhere to a CNN–Transformer-based framework, which still fails to enhance the perceptual field during image feature extraction. To solve this problem, we propose the Reinforced Memory-driven Pure Transformer (RMPT), which is a novel Transformer–Transformer-based model. In implementation, our RMPT employs the Swin Transformer to extract visual features from given X-ray images, which has a larger perceptual field to better model the relationships between different regions. Furthermore, we adopt a memory-driven Transformer (MemTrans) to effectively model similar patterns in different reports, which is able to facilitate the model to generate long reports. Finally, we present an innovative training approach leveraging Reinforcement Learning (RL) that efficiently steers the model to focus on challenging samples, consequently improving its comprehensive performance across both straightforward and complex situations. Experimental results on the IU X-ray dataset show that our proposed RMPT achieves superior performance on various Natural Language Generation (NLG) evaluation metrics. Further ablation study results demonstrate that our RMPT model achieves 10.5% overall performance compared to the base mode.

Suggested Citation

  • Caijie Qin & Yize Xiong & Weibin Chen & Yong Li, 2025. "RMPT: Reinforced Memory-Driven Pure Transformer for Automatic Chest X-Ray Report Generation," Mathematics, MDPI, vol. 13(9), pages 1-14, April.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:9:p:1492-:d:1647007
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/9/1492/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/9/1492/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:9:p:1492-:d:1647007. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.