• Laser & Optoelectronics Progress
  • Vol. 59, Issue 8, 0810015 (2022)
Xin Wang and Kaijun Wu*
Author Affiliations
  • College of Information Technology, Shanghai Ocean University, Shanghai 201306, China
  • show less
    DOI: 10.3788/LOP202259.0810015 Cite this Article Set citation alerts
    Xin Wang, Kaijun Wu. Real-Time Semantic Segmentation Network Based on Octave Convolution[J]. Laser & Optoelectronics Progress, 2022, 59(8): 0810015 Copy Citation Text show less
    High frequency and low frequency of feature map
    Fig. 1. High frequency and low frequency of feature map
    Schematic of OctConv operation
    Fig. 2. Schematic of OctConv operation
    Comparison between swish and ReLU
    Fig. 3. Comparison between swish and ReLU
    Comparison between swish function and hard-swish function
    Fig. 4. Comparison between swish function and hard-swish function
    OTCH-L module
    Fig. 5. OTCH-L module
    Proposed model
    Fig. 6. Proposed model
    Schematic of MIoU
    Fig. 7. Schematic of MIoU
    Influence of α on network performance
    Fig. 8. Influence of α on network performance
    Schematic of residual modules with different dimensionality reduction channels
    Fig. 9. Schematic of residual modules with different dimensionality reduction channels
    Comparison of experimental results
    Fig. 10. Comparison of experimental results
    CategoryContent
    Personperson
    Animalbird, cat, cow, dog, horse, sheep
    Vehicleplane, bicycle, boat, bus, car, motorbike, train
    Indoorbottle, chair, dining table, potted plant, sofa, tv /monitor
    Table 1. Categories contained in the dataset
    ParameterValue
    Batch size8
    Input image size473×473
    Epoch100
    Learning rate10-6
    Table 2. Model super parameters
    NetworkαAccuracy /%Speed /(frame·s-1
    ResNet-5074.58.1
    R+OctConv074.78.9
    R+OctConv0.376.211.3
    R+OctConv0.675.514.1
    R+OctConv0.973.117.5
    MobileNet v366.115.3
    M+OctConv066.815.6
    M+OctConv0.368.319.2
    M+OctConv0.667.122.3
    M+OctConv0.965.725.4
    Table 3. Influence of different α values on model performance
    ModuleMIoU /%Parameter /103Speed /(frame·s-1
    OTCH-L(C71.5475622.86
    OTCH-L(C/2)70.3457125.94
    OTCH-L(C/4)67.0147726.42
    Table 4. Performance comparison between neck blocks with different dimensionality reduction channels
    NetworkMIoU /%MPA /%Speed /(frame·s-1
    SegNet59.8672.4510.57
    PSPNet65.2881.5116.68
    DeepLab v3 plus66.3382.0419.56
    This Paper70.3483.6525.94
    Table 5. Performance comparison between different networks
    NetworkMIoU /%Speed /(frame·s-1
    Other paper68.4314.14
    This Paper70.3425.94
    Table 6. Performance comparison between the proposed network and a recent algorithm
    ModuleMIoU /%MPA /%Speed /(frame·s-1
    ResNet5072.0785.0611.68
    Xception69.3383.7417.56
    MobileNet v367.9281.3926.60
    Improved MobileNet v370.3483.6525.94
    Table 7. ‍Performance comparison of different feature extraction networks
    Xin Wang, Kaijun Wu. Real-Time Semantic Segmentation Network Based on Octave Convolution[J]. Laser & Optoelectronics Progress, 2022, 59(8): 0810015
    Download Citation