• Laser & Optoelectronics Progress
  • Vol. 56, Issue 9, 091003 (2019)
Guanghong Tan, Jin Hou*, Yanpeng Han, and Shuo Luo
Author Affiliations
  • School of Information Science and Technology, Southwest Jiaotong University, Chengdu, Sichuan 611756, China
  • show less
    DOI: 10.3788/LOP56.091003 Cite this Article Set citation alerts
    Guanghong Tan, Jin Hou, Yanpeng Han, Shuo Luo. Low-Parameter Real-Time Image Segmentation Algorithm Based on Convolutional Neural Network[J]. Laser & Optoelectronics Progress, 2019, 56(9): 091003 Copy Citation Text show less
    Convolution kernel. (a) Classical convolution kernel; (b) dilated convolution kernel Rrate=2; (c) dilated convolution kernel Rrate=3
    Fig. 1. Convolution kernel. (a) Classical convolution kernel; (b) dilated convolution kernel Rrate=2; (c) dilated convolution kernel Rrate=3
    Atrous-Fire modular structure
    Fig. 2. Atrous-Fire modular structure
    Dilated convolution kernel and initial characteristic graphs. (a) Sawtooth structure convolution kernel; (b) no grid feature graph; (c) grid feature graph
    Fig. 3. Dilated convolution kernel and initial characteristic graphs. (a) Sawtooth structure convolution kernel; (b) no grid feature graph; (c) grid feature graph
    Network structure of Atrous-squeezeseg
    Fig. 4. Network structure of Atrous-squeezeseg
    Training loss value curves
    Fig. 5. Training loss value curves
    Validation loss value curves
    Fig. 6. Validation loss value curves
    Effect comparison of ADE20K. (a) Original images; (b) ground truth; (c) proposed algorithm; (d) Squeezeseg+FCN; (e) VGG16+FCN; (f) SqueezeNet+FCN; (g) without dilated; (h) without BN
    Fig. 7. Effect comparison of ADE20K. (a) Original images; (b) ground truth; (c) proposed algorithm; (d) Squeezeseg+FCN; (e) VGG16+FCN; (f) SqueezeNet+FCN; (g) without dilated; (h) without BN
    Effect comparison of PASCAL VOC. (a) Original images; (b) ground truth; (c) proposed algorithm; (d) Squeezeseg+FCN; (e) VGG16+FCN; (f) SqueezeNet+FCN; (g) without dilated; (h) without BN
    Fig. 8. Effect comparison of PASCAL VOC. (a) Original images; (b) ground truth; (c) proposed algorithm; (d) Squeezeseg+FCN; (e) VGG16+FCN; (f) SqueezeNet+FCN; (g) without dilated; (h) without BN
    Layer nameOutput sizeSqueeze(S1)Expand(E1/E3)Rrate
    Input image224×224×3
    Conv1112×112×64
    Maxpool156×56×64
    Atrous-Fire1 (3×)56×56×25616322/5/7
    Maxpool228×28×256
    Atrous-Fire2 (3×)28×28×25632642/3/5
    Maxpool314×14×256
    Atrous-Fire3 (3×)14×14×256641282/3/5
    Atrous-Fire4 (2×)14×14×5121282561/2
    Atrous-Fire4 (2×)14×14×5121282561/1
    Table 1. Encoder parameters
    MethodNumber ofparametersMIUBuildingSkyCarTreeRoadPersonFloorWall
    Atrous-squeezeseg21.0962.967.584.061.458.164.749.160.458.5
    Squeezeseg+FCN54.6555.961.885.848.451.861.532.353.852.2
    VGG16+FCN66.2163.268.386.861.158.266.048.558.357.4
    SqueezeNet+FCN54.6550.546.783.844.751.855.528.047.346.7
    Atrous-squeezeseg(without dilated)21.0950.651.183.541.243.853.829.751.450.1
    Atrous-squeezeseg(without BN)21.0951.651.383.543.645.858.029.850.151.3
    Table 2. Number of parameters of different semantic segmentation models and MIU
    MethodFPS /(frame·s-1)PA /%
    GTX 1080TiNVIDIA TX2
    Atrous-squeezeseg45.38.359.5
    Squeezeseg+FCN39.54.259.3
    VGG16+FCN29.61.959.8
    SqueezeNet+FCN46.64.555.6
    Atrous-squeezeseg(without dilated)45.68.456.1
    Atrous-squeezeseg(without BN)56.29.257.3
    Table 3. PA and FPS of model in different devices
    Guanghong Tan, Jin Hou, Yanpeng Han, Shuo Luo. Low-Parameter Real-Time Image Segmentation Algorithm Based on Convolutional Neural Network[J]. Laser & Optoelectronics Progress, 2019, 56(9): 091003
    Download Citation