• Acta Optica Sinica
  • Vol. 37, Issue 12, 1215003 (2017)
Shouchuan Wu1, Haitao Zhao1、*, and Shaoyuan Sun2
Author Affiliations
  • 1 School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
  • 2 School of Information Science and Technology, Donghua University, Shanghai 201620, China
  • show less
    DOI: 10.3788/AOS201737.1215003 Cite this Article Set citation alerts
    Shouchuan Wu, Haitao Zhao, Shaoyuan Sun. Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network[J]. Acta Optica Sinica, 2017, 37(12): 1215003 Copy Citation Text show less
    References

    [1] Karsch K, Liu C, Kang S B. Depth transfer: Depth extraction from video using non-parametric sampling[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36, 2144-2158(2014). http://europepmc.org/abstract/MED/26353057

    [2] Konrad J, Wang M, Ishwar P et al. Learning-based, automatic 2D-to-3D image and video conversion[J]. IEEE Transactions on Image Processing, 22, 3485-3496(2013). http://ieeexplore.ieee.org/document/6544689/

    [3] Kong N, Black M J. Intrinsic depth: Improving depth transfer with intrinsic images[C]. IEEE International Conference on Computer Vision, 3514-3522(2015).

    [4] Saxena A, Chung S H, Ng A Y. 3D depth reconstruction from a single still image[J]. International Journal of Computer Vision, 76, 53-69(2008). http://link.springer.com/article/10.1007/s11263-007-0071-y

    [5] Xi Lin, Sun Shaoyuan, Li Linna et al. Depth estimation from monocular infrared images based on SVM model[J]. Laser & Infrared, 42, 1311-1315(2012).

    [6] Xu Lu, Zhao Haitao, Sun Shaoyuan. Monocular infrared image depth estimation based on deep convolutional neural networks[J]. Acta Optica Sinica, 36, 0715002(2016).

    [7] Eigen D, Puhrsch C, Fergus R. Depth map prediction from a single image using a multi-scale deep network[C]. Advances in Neural Information Processing Systems, 2366-2374(2014).

    [8] Liu F Y, Shen C H, Lin G S. Deep convolutional neural fields for depth estimation from a single image[C]. IEEE Conference on Computer Vision and Pattern Recognition, 5162-5170(2015).

    [9] Zhang G F, Jia J Y, Hua W et al. Robust bilayer segmentation and motion/depth estimation with a handheld camera[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 603-617(2011). http://ieeexplore.ieee.org/document/5482585/

    [10] Akhter I, Sheikh Y, Khan S et al. Trajectory space: A dual representation for nonrigid structure from motion[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 1442-1456(2011). http://doi.ieeecomputersociety.org/10.1109/TPAMI.2010.201

    [11] Ha H, Im S, Park J et al. High-quality depth from uncalibrated small motion clip[C]. IEEE Conference on Computer Vision and Pattern Recognition, 5413-5421(2016).

    [12] Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[C]. Advances in Neural Information Processing Systems, 1097-1105(2012).

    [13] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[C]. International Conference on Learning Representations, 1-14(2015).

    [14] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition[C]. IEEE Conference on Computer Vision and Pattern Recognition, 770-778(2016).

    [15] Girshick R, Donahue J, Darrell T et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]. IEEE Conference on Computer Vision and Pattern Recognition, 580-587(2014).

    [16] Ren S Q, He K M, Girshick R et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017). http://dl.acm.org/citation.cfm?id=3101780

    [17] Redmon J, Divvala S, Girshick R et al. You only look once: Unified, real-time object detection[C]. IEEE Conference on Computer Vision and Pattern Recognition, 779-788(2016).

    [18] Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 9, 1735-1780(1997).

    [19] Cho K, Van Merriënboer B, Gulcehre C et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]. Computer Science(2014). http://arxiv.org/abs/1406.1078

    [20] Chung J, Gülçehre C, Cho K et al. Gated feedback recurrent neural networks[J]. Computer Science, 2067-2075(2015). http://www.oalib.com/paper/4071471

    [21] Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation[C]. IEEE Conference on Computer Vision and Pattern Recognition, 3431-3440(2015).

    [22] Kingma D, Ba J. Adam: A method for stochastic optimization[C]. 3rd International Conference for Learning Representations(2015).

    CLP Journals

    [1] Zhe An, Xiping Xu, Jinhua Yang, Yang Qiao, Yang Liu. Design of Augmented Reality Head-up Display System Based on Image Semantic Segmentation[J]. Acta Optica Sinica, 2018, 38(7): 0710004

    [2] Li Jiahao, Sun Shaoyuan, Wu Xueping, Li Dawei. Unmanned Vehicle Night Infrared Video Colorization Based on Dual-Channel Cycle-Consistent Adversarial Networks[J]. Laser & Optoelectronics Progress, 2018, 55(9): 91505

    [3] Zhenqiang Bao, Aihua Li, Zhigao Cui, Meng Yuan. Research Progress of Deep Learning in Visual Localization and Three-Dimensional Structure Recovery[J]. Laser & Optoelectronics Progress, 2018, 55(5): 050007

    [4] Tingting Gu, Haitao Zhao, Shaoyuan Sun. Depth Estimation of Single Infrared Image Based on Interframe Information Extraction[J]. Laser & Optoelectronics Progress, 2018, 55(6): 061010

    Shouchuan Wu, Haitao Zhao, Shaoyuan Sun. Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network[J]. Acta Optica Sinica, 2017, 37(12): 1215003
    Download Citation