Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network

Shouchuan Wu; Haitao Zhao; Shaoyuan Sun

doi:10.3788/AOS201737.1215003

[1] Karsch K, Liu C, Kang S B. Depth transfer: Depth extraction from video using non-parametric sampling[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36, 2144-2158(2014). http://europepmc.org/abstract/MED/26353057

[2] Konrad J, Wang M, Ishwar P et al. Learning-based, automatic 2D-to-3D image and video conversion[J]. IEEE Transactions on Image Processing, 22, 3485-3496(2013). http://ieeexplore.ieee.org/document/6544689/

[3] Kong N, Black M J. Intrinsic depth: Improving depth transfer with intrinsic images[C]. IEEE International Conference on Computer Vision, 3514-3522(2015).

[4] Saxena A, Chung S H, Ng A Y. 3D depth reconstruction from a single still image[J]. International Journal of Computer Vision, 76, 53-69(2008). http://link.springer.com/article/10.1007/s11263-007-0071-y

[5] Xi Lin, Sun Shaoyuan, Li Linna et al. Depth estimation from monocular infrared images based on SVM model[J]. Laser & Infrared, 42, 1311-1315(2012).

[6] Xu Lu, Zhao Haitao, Sun Shaoyuan. Monocular infrared image depth estimation based on deep convolutional neural networks[J]. Acta Optica Sinica, 36, 0715002(2016).

[7] Eigen D, Puhrsch C, Fergus R. Depth map prediction from a single image using a multi-scale deep network[C]. Advances in Neural Information Processing Systems, 2366-2374(2014).

[8] Liu F Y, Shen C H, Lin G S. Deep convolutional neural fields for depth estimation from a single image[C]. IEEE Conference on Computer Vision and Pattern Recognition, 5162-5170(2015).

[9] Zhang G F, Jia J Y, Hua W et al. Robust bilayer segmentation and motion/depth estimation with a handheld camera[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 603-617(2011). http://ieeexplore.ieee.org/document/5482585/

[10] Akhter I, Sheikh Y, Khan S et al. Trajectory space: A dual representation for nonrigid structure from motion[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 1442-1456(2011). http://doi.ieeecomputersociety.org/10.1109/TPAMI.2010.201

[11] Ha H, Im S, Park J et al. High-quality depth from uncalibrated small motion clip[C]. IEEE Conference on Computer Vision and Pattern Recognition, 5413-5421(2016).

[12] Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[C]. Advances in Neural Information Processing Systems, 1097-1105(2012).

[13] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[C]. International Conference on Learning Representations, 1-14(2015).

[14] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition[C]. IEEE Conference on Computer Vision and Pattern Recognition, 770-778(2016).

[15] Girshick R, Donahue J, Darrell T et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]. IEEE Conference on Computer Vision and Pattern Recognition, 580-587(2014).

[16] Ren S Q, He K M, Girshick R et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017). http://dl.acm.org/citation.cfm?id=3101780

[17] Redmon J, Divvala S, Girshick R et al. You only look once: Unified, real-time object detection[C]. IEEE Conference on Computer Vision and Pattern Recognition, 779-788(2016).

[18] Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 9, 1735-1780(1997).

[19] Cho K, Van Merriënboer B, Gulcehre C et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]. Computer Science(2014). http://arxiv.org/abs/1406.1078

[20] Chung J, Gülçehre C, Cho K et al. Gated feedback recurrent neural networks[J]. Computer Science, 2067-2075(2015). http://www.oalib.com/paper/4071471

[21] Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation[C]. IEEE Conference on Computer Vision and Pattern Recognition, 3431-3440(2015).

[22] Kingma D, Ba J. Adam: A method for stochastic optimization[C]. 3rd International Conference for Learning Representations(2015).

CLP Journals