• Acta Photonica Sinica
  • Vol. 52, Issue 4, 0428002 (2023)
Wensheng FAN, Fan LIU*, and Ming LI
Author Affiliations
  • College of Data Science, Taiyuan University of Technology, Jinzhong 030600, China
  • show less
    DOI: 10.3788/gzxb20235204.0428002 Cite this Article
    Wensheng FAN, Fan LIU, Ming LI. Remote Sensing Image Fusion Based on Two-branch U-shaped Transformer[J]. Acta Photonica Sinica, 2023, 52(4): 0428002 Copy Citation Text show less
    Flowchart of two successive STBs
    Fig. 1. Flowchart of two successive STBs
    The overall network architecture
    Fig. 2. The overall network architecture
    Schematic of different modules in the decoder
    Fig. 3. Schematic of different modules in the decoder
    Comparison on fusion performance and computational complexity of the model variants
    Fig. 4. Comparison on fusion performance and computational complexity of the model variants
    Reduced-resolution GF2 testing images and fusion results of different methods
    Fig. 5. Reduced-resolution GF2 testing images and fusion results of different methods
    Residual maps between the fusion results on the GF2 testing images and the reference image
    Fig. 6. Residual maps between the fusion results on the GF2 testing images and the reference image
    Reduced-resolution QB testing images and fusion results of different methods
    Fig. 7. Reduced-resolution QB testing images and fusion results of different methods
    Residual maps between the fusion results on the QB testing images and the reference image
    Fig. 8. Residual maps between the fusion results on the QB testing images and the reference image
    Reduced-resolution WV3 testing images and fusion results of different methods
    Fig. 9. Reduced-resolution WV3 testing images and fusion results of different methods
    Residual maps between the fusion results on the WV3 testing images and the reference image
    Fig. 10. Residual maps between the fusion results on the WV3 testing images and the reference image
    Full-resolution GF2 testing images and fusion results of different methods
    Fig. 11. Full-resolution GF2 testing images and fusion results of different methods
    The visualization of multi-level feature maps output by the encoder in each stage
    Fig. 12. The visualization of multi-level feature maps output by the encoder in each stage
    VariantCPGF2QBWV3
    SAMERGASsCCQ4SAMERGASsCCQ4SAMERGASsCCQ8
    p812880.725 70.597 20.991 80.985 10.979 80.797 10.990 40.977 72.565 61.577 80.990 60.975 2
    c969640.736 60.574 90.991 60.986 20.943 00.767 00.990 30.981 52.355 71.419 40.992 60.979 5
    Proposed12840.713 20.564 30.992 10.986 70.865 40.691 30.992 80.984 12.276 91.369 50.993 10.980 6
    c19219240.692 40.558 00.992 50.987 70.770 40.618 90.994 90.986 42.185 11.312 50.993 50.981 5
    Ideal001100110011
    Table 1. Comparison on fused results of different model variants
    MethodGF2QBWV3
    SAMERGASsCCQ4SAMERGASsCCQ4SAMERGASsCCQ8
    BDSD2.120 32.596 40.886 40.800 91.480 01.076 10.980 10.967 64.137 12.820 50.964 40.941 3
    GSA2.090 32.671 80.876 10.781 61.470 81.062 30.977 20.968 23.792 92.586 30.963 30.946 7
    MTF-GLP-HPM2.041 53.022 30.850 80.743 71.435 81.072 60.979 40.966 73.922 13.493 80.928 70.943 4
    SR-D1.774 51.843 30.919 90.875 81.519 51.232 80.970 70.959 73.681 92.774 50.956 50.931 6
    PNN1.191 00.954 60.974 20.963 31.142 50.907 60.985 20.973 92.878 91.749 60.986 20.970 7
    RSIFNN1.160 51.003 70.970 90.959 91.206 21.016 40.980 60.968 12.902 11.743 80.985 60.970 8
    PSGAN0.777 20.582 90.990 50.985 40.945 10.725 30.991 50.982 62.555 71.579 20.990 30.974 8
    NLRNet0.834 90.625 90.988 80.983 10.977 90.753 80.990 50.981 32.513 91.511 50.990 90.976 5
    PanColorGAN0.961 30.686 30.990 10.984 51.096 90.863 80.991 10.983 32.469 61.462 10.992 00.977 6
    Proposed0.713 20.564 30.992 10.986 70.865 40.691 30.992 80.984 12.276 91.369 50.993 10.980 6
    Ideal001100110011
    Table 2. Quantitative comparison on fused results of different methods at reduced resolution
    MethodGF2QBWV3
    DλDSHQNRDλDSHQNRDλDSHQNR
    BDSD0.088 40.224 60.707 20.058 40.023 50.919 50.081 10.041 20.881 0
    GSA0.203 70.324 30.544 80.057 50.039 40.905 50.061 60.051 70.890 0
    MTF-GLP-HPM0.048 00.237 10.726 70.021 20.037 50.942 10.029 50.044 20.927 5
    SR-D0.014 80.035 60.950 00.022 40.026 30.951 90.026 90.030 40.943 6
    PNN0.012 80.046 80.941 00.028 50.013 90.958 00.036 60.024 30.940 1
    RSIFNN0.014 60.090 50.896 30.018 70.014 10.967 50.029 60.022 80.948 3
    PSGAN0.010 90.018 00.971 30.017 70.028 50.954 30.038 70.025 00.937 4
    NLRNet0.012 40.016 80.971 10.032 00.013 80.954 60.041 60.023 40.935 9
    PanColorGAN0.021 80.015 60.963 00.022 40.015 60.962 40.032 30.026 60.941 9
    Proposed0.009 60.014 70.975 90.021 70.010 40.968 10.036 20.012 80.951 5
    Ideal001001001
    Table 3. Quantitative comparison on fused results of different methods at full resolution
    MethodRuntime/sParameters
    BDSD0.012 1-
    GSA0.006 7-
    MTF-GLP-HPM0.027 9-
    SR-D0.188 4-
    PNN0.001 30.080×106
    RSIFNN0.002 50.338×106
    PSGAN0.011 11.888×106
    NLRNet0.014 84.850×106
    PanColorGAN0.053 732.617×106
    Proposed0.023 458.571×106
    Table 4. Comparison on runtimes and parameters of different methods
    Wensheng FAN, Fan LIU, Ming LI. Remote Sensing Image Fusion Based on Two-branch U-shaped Transformer[J]. Acta Photonica Sinica, 2023, 52(4): 0428002
    Download Citation