IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0340499.html

CORE-Net: A cross-modal orthogonal representation enhancement network for low-altitude multispectral object detection

Author

Listed:
  • Daoze Tang
  • Shuyun Tang
  • Dequan Zheng

Abstract

Object detection in visible light (RGB) images is frequently compromised by low-illumination conditions, whereas infrared (IR) imaging typically exhibits superior robustness in such environments. Multispectral fusion addresses this limitation by leveraging complementary information from both modalities; however, existing methods predominantly rely on intricate fusion modules to integrate cross-modal features, inevitably incurring significant computational overhead and architectural complexity. To mitigate this issue, we propose a novel Cross-modal Orthogonal Representation Enhancement Network (CORE-Net). Diverging from conventional heavy-fusion paradigms, our framework adopts a dual-branch architecture integrated with a streamlined Cross-modal Concatenation Network Framework (CCNF), which achieves efficient feature integration while substantially reducing model complexity. Furthermore, CORE-Net incorporates two distinct components—the Multiple Pooling Convolution Downsampling (MPCD) module and the Refined Integration Network (RINet)—specifically designed to optimize feature extraction capabilities. Extensive evaluations on the DroneVehicle and LLVIP datasets demonstrate that CORE-Net achieves state-of-the-art (SOTA) performance in terms of both detection accuracy and computational efficiency. Ablation studies substantiate the individual and synergistic contributions of each proposed component, while deployment on edge devices further corroborates the model’s practical efficiency. Additionally, qualitative visualizations confirm the model’s efficacy in suppressing background noise and enhancing discriminative fine-grained features. In summary, CORE-Net establishes a robust new paradigm for high-performance and efficient multispectral object detection.

Suggested Citation

  • Daoze Tang & Shuyun Tang & Dequan Zheng, 2026. "CORE-Net: A cross-modal orthogonal representation enhancement network for low-altitude multispectral object detection," PLOS ONE, Public Library of Science, vol. 21(4), pages 1-23, April.
  • Handle: RePEc:plo:pone00:0340499
    DOI: 10.1371/journal.pone.0340499
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0340499
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0340499&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0340499?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0340499. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.