• Infrared and Laser Engineering
  • Vol. 47, Issue 2, 203002 (2018)
Yang Nan1、2, Nan Lin1、2, Zhang Dingyi1、2, and Ku Tao1、2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less
    DOI: 10.3788/irla201847.0203002 Cite this Article
    Yang Nan, Nan Lin, Zhang Dingyi, Ku Tao. Research on image interpretation based on deep learning[J]. Infrared and Laser Engineering, 2018, 47(2): 203002 Copy Citation Text show less
    References

    [1] Xu Feng, Lu Jiangang, Sun Youxian. Application of neural network in image processing[J]. Chinese Journal of Information and Control, 2003, 4(1): 344-351. (in Chinese)

    [2] Farhadi A, Hejrati M, Sadeghi M A, et al. Every picture tells a story generating sentences from images[J]. ECCV, 2010, 21(10):15-29.

    [3] Kulkarni G, Premraj V, Dhar S, et al. Baby talk: Understanding and generating simple image descriptions[J]. CVPR, 2014, 35(12): 1601-1608.

    [4] Cho K, van Merrienboer B, Gulcehre C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]. EMNLP, 2014, 14(6): 1078-1093.

    [5] Vinyals O, Toshev A, Bengio S, et al. Show and tell: A neural image caption generator[C]//Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015: 3156-3164.

    [6] Alex Krizhevsky, IIya Sutskever, Geoffrey Hinton. Imagenet classification with deep convolution neural networks[C]//Proceedings of Advances Neural Information Processing Systems(NLPS), 2012: 1097-1105.

    [7] Sermanet P, Eigen D, Zhang X, et al. Overfeat: Integrated recognition, localization and detection using convolutional networks[J]. Computer Vision and Pattern Recognition, 2013, arXiv preprint arXiv: 1312.6229.

    [8] Gerber R, Nagel H H. Knowledge representation for the generation of quantified natural language description of vehicle traffic in image sequence[C]//Proceeding of the IEEE International Conference on Image Processing, 1996: 805-808.

    [9] Yao B Z, Yang X, Lin L, et al. I2t: Image parsing to text description[C]//Proceedings of the IEEE, 2010, 98(8): 1485-1508.

    [10] Li S, Kulkarni G, Berg T L, et al. Composing simple image descriptions using web-scale n-grams[C]//Proceeding of the Conference on Computational Natural Language Learning, 2011.

    [11] Aker A, Gaizauskas R. Generating image descriptions using dependency relational patterns[C]//Proceedings of the Meeting of the Association for Computational Linguistics(ACL), 2010: 49 (9) :1250-1258.

    [12] Hodosh M, Young P, Hockenmaier J. Framing image description as a ranking task: Data, models and evaluation metrics[C]//International Conference on Artificial Intelligence, 2013, 47(1): 853-899.

    [13] Wen Ya, Nan Lin. Research on semantic analysis method of image based on natural language understanding[D]. Shenyang: Shenyang Institute of Automation, Chinese Academy of Sciences, 2017. (in Chinese)

    CLP Journals

    [1] Yang Zhang, Yulong He, Yu Ning, Quan Sun, Jun Li, Xiaojun Xu. Method of inverting wavefront phase from far-field spot based on deep learning[J]. Infrared and Laser Engineering, 2021, 50(8): 20200363

    Yang Nan, Nan Lin, Zhang Dingyi, Ku Tao. Research on image interpretation based on deep learning[J]. Infrared and Laser Engineering, 2018, 47(2): 203002
    Download Citation