• Acta Optica Sinica
  • Vol. 37, Issue 12, 1215003 (2017)
Shouchuan Wu1, Haitao Zhao1、*, and Shaoyuan Sun2
Author Affiliations
  • 1 School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
  • 2 School of Information Science and Technology, Donghua University, Shanghai 201620, China
  • show less
    DOI: 10.3788/AOS201737.1215003 Cite this Article Set citation alerts
    Shouchuan Wu, Haitao Zhao, Shaoyuan Sun. Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network[J]. Acta Optica Sinica, 2017, 37(12): 1215003 Copy Citation Text show less
    Mechanism of bi-recursive convolution
    Fig. 1. Mechanism of bi-recursive convolution
    Diagram for BrCNN
    Fig. 2. Diagram for BrCNN
    Effect of dividing ground truth depth of infrared video into different depth levels. (a) Infrared video; (b) ground truth; (c) 10 layers; (d) 20 layers; (e) 30 layers
    Fig. 3. Effect of dividing ground truth depth of infrared video into different depth levels. (a) Infrared video; (b) ground truth; (c) 10 layers; (d) 20 layers; (e) 30 layers
    Output of different bi-recursive convolutional layers. (a) Infrared video; (b) output of the first bi-recursive convolutional layer; (c) output of the second bi-recursive convolutional layer; (d) output of the third bi-recursive convolutional layer
    Fig. 4. Output of different bi-recursive convolutional layers. (a) Infrared video; (b) output of the first bi-recursive convolutional layer; (c) output of the second bi-recursive convolutional layer; (d) output of the third bi-recursive convolutional layer
    Depth estimation results of different models. (a) Infrared video; (b) ground truth depth; (c) depth estimated by BrCNN; (d) depth estimated by AlexNet; (e) depth estimated by VGG16; (f) depth estimated by Res34
    Fig. 5. Depth estimation results of different models. (a) Infrared video; (b) ground truth depth; (c) depth estimated by BrCNN; (d) depth estimated by AlexNet; (e) depth estimated by VGG16; (f) depth estimated by Res34
    (a) Mean relative error of BrCNN and VGG16 model; (b) 2-norm of difference between adjacent frames
    Fig. 6. (a) Mean relative error of BrCNN and VGG16 model; (b) 2-norm of difference between adjacent frames
    Depth estimation results of larger 2-norm between frame differences. (a) Continuous three frames of infrared video; (b) estimation results of BrCNN
    Fig. 7. Depth estimation results of larger 2-norm between frame differences. (a) Continuous three frames of infrared video; (b) estimation results of BrCNN
    MethodAccuracyError
    ζ<1.25ζ<1.252ζ<1.253MRERMSELE
    BrCNN0.7620.9010.9560.21410.2010.083
    AlexNet[12]0.7370.8730.9080.25810.7970.090
    VGG16[13]0.7190.8680.9120.27410.8420.093
    Res34[14]0.7210.8700.9210.27510.7820.089
    Table 1. Comparison of depth estimation results between BrCNN model and other classical network models
    Shouchuan Wu, Haitao Zhao, Shaoyuan Sun. Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network[J]. Acta Optica Sinica, 2017, 37(12): 1215003
    Download Citation