IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v20y2023i4p3059-d1063704.html
   My bibliography  Save this article

HyFormer: Hybrid Transformer and CNN for Pixel-Level Multispectral Image Land Cover Classification

Author

Listed:
  • Chuan Yan

    (School of Automation, Guangxi University of Science and Technology, Liuzhou 545006, China)

  • Xiangsuo Fan

    (School of Automation, Guangxi University of Science and Technology, Liuzhou 545006, China
    Guangxi Collaborative Innovation Centre for Earthmoving Machinery, Guangxi University of Science and Technology, Liuzhou 545006, China)

  • Jinlong Fan

    (National Satellite Meteorological Center, China Meteorological Administration, Beijing 100081, China)

  • Ling Yu

    (School of Automation, Guangxi University of Science and Technology, Liuzhou 545006, China)

  • Nayi Wang

    (School of Automation, Guangxi University of Science and Technology, Liuzhou 545006, China)

  • Lin Chen

    (School of Automation, Guangxi University of Science and Technology, Liuzhou 545006, China)

  • Xuyang Li

    (School of Automation, Guangxi University of Science and Technology, Liuzhou 545006, China)

Abstract

To effectively solve the problems that most convolutional neural networks cannot be applied to the pixelwise input in remote sensing (RS) classification and cannot adequately represent the spectral sequence information, we propose a new multispectral RS image classification framework called HyFormer based on Transformer. First, a network framework combining a fully connected layer (FC) and convolutional neural network (CNN) is designed, and the 1D pixelwise spectral sequences obtained from the fully connected layers are reshaped into a 3D spectral feature matrix for the input of CNN, which enhances the dimensionality of the features through FC as well as increasing the feature expressiveness, and can solve the problem that 2D CNN cannot achieve pixel-level classification. Secondly, the features of the three levels of CNN are extracted and combined with the linearly transformed spectral information to enhance the information expression capability, and also used as the input of the transformer encoder to improve the features of CNN using the powerful global modelling capability of the Transformer, and finally the skip connection of the adjacent encoders to enhance the fusion between different levels of information. The pixel classification results are obtained by MLP Head. In this paper, we mainly focus on the feature distribution in the eastern part of Changxing County and the central part of Nanxun District, Zhejiang Province, and conduct experiments based on Sentinel-2 multispectral RS images. The experimental results show that the overall accuracy of HyFormer for the study area classification in Changxing County is 95.37% and that of Transformer (ViT) is 94.15%. The experimental results show that the overall accuracy of HyFormer for the study area classification in Nanxun District is 95.4% and that of Transformer (ViT) is 94.69%, and the performance of HyFormer on the Sentinel-2 dataset is better than that of the Transformer.

Suggested Citation

  • Chuan Yan & Xiangsuo Fan & Jinlong Fan & Ling Yu & Nayi Wang & Lin Chen & Xuyang Li, 2023. "HyFormer: Hybrid Transformer and CNN for Pixel-Level Multispectral Image Land Cover Classification," IJERPH, MDPI, vol. 20(4), pages 1-25, February.
  • Handle: RePEc:gam:jijerp:v:20:y:2023:i:4:p:3059-:d:1063704
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/20/4/3059/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/20/4/3059/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:20:y:2023:i:4:p:3059-:d:1063704. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.