Predictive pixel-wise optical encoding: towards single-shot high dynamic range moving object recognition

Yutong He; Yu Liang; Honghao Huang; Chengyang Hu; Sigang Yang; Hongwei Chen

doi:10.1364/PRJ.533288

[1] E. Reinhard, G. Ward, S. Pattanaik. High Dynamic Range Imaging: Acquisition, Display, and Image-Based Lighting(2010).

[2] S. K. Nayar, T. Mitsunaga. High dynamic range imaging: spatially varying pixel exposures. IEEE Conference on Computer Vision and Pattern Recognition, 472-479(2000).

[3] F. Dufaux, P. Le Callet, R. Mantiuk. High Dynamic Range Video: From Acquisition, to Display and Applications(2016).

[4] Y.-L. Liu, W.-S. Lai, Y.-S. Chen. Single-image HDR reconstruction by learning to reverse the camera pipeline. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1651-1660(2020).

[5] P. E. Debevec, J. Malik. Recovering high dynamic range radiance maps from photographs. 24th Annual Conference on Computer Graphics and Interactive Techniques, 369-378(1997).

[6] F. Banterle, A. Artusi, K. Debattista. Advanced High Dynamic Range Imaging(2017).

[7] S. W. Hasinoff, F. Durand, W. T. Freeman. Noise-optimal capture for high dynamic range photography. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 553-560(2010).

[8] T. Mertens, J. Kautz, F. Van Reeth. Exposure fusion: a simple and practical alternative to high dynamic range photography. Comput. Graph. Forum, 28, 161-171(2009).

[9] E. Onzon, F. Mannan, F. Heide. Neural auto-exposure for high-dynamic range object detection. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7710-7720(2021).

[10] Ce. Liu. Beyond Pixels: Exploring New Representations and Applications for Motion Analysis(2009).

[11] Y. Niu, J. Wu, W. Liu. HDR-GAN: HDR image reconstruction from multi-exposed LDR images with large motions. IEEE Trans. Image Process., 30, 3885-3896(2021).

[12] Q. Yan, D. Gong, Q. Shi. Attention-guided network for ghost-free high dynamic range imaging. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1751-1760(2019).

[13] T. Asatsuma, Y. Sakano, S. Iida. Sub-pixel architecture of CMOS image sensor achieving over 120 dB dynamic range with less motion artifact characteristics. International Image Sensor Workshop, R31(2019).

[14] S. Iida, Y. Sakano, T. Asatsuma. A 0.68 e-rms random-noise 121 dB dynamic-range sub-pixel architecture CMOS image sensor with LED flicker mitigation. IEEE International Electron Devices Meeting (IEDM), 10.2.1-10.2.4(2018).

[15] M. D. Tocci, C. Kiser, N. Tocci. A versatile HDR video production system. ACM Trans. Graph., 30, 41(2011).

[16] J. Han, C. Zhou, P. Duan. Neuromorphic camera guided high dynamic range imaging. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1730-1739(2020).

[17] C. A. Metzler, H. Ikoma, Y. Peng. Deep optics for single-shot high-dynamic-range imaging. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1375-1385(2020).

[18] Q. Sun, E. Tseng, Q. Fu. Learning rank-1 diffractive optics for single-shot high dynamic range imaging. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1386-1396(2020).

[19] S. Hajisharif, J. Kronander, J. Unger. Adaptive dualiSO HDR reconstruction. EURASIP Journal on Image and Video Processing, 1-13(2015).

[20] Z. Yang, P. Wang, X. Li. 3D laser scanner system using high dynamic range imaging. Opt. Laser Eng., 54, 31-41(2014).

[21] X. Li, C. Sun, P. Wang. The image adaptive method for solder paste 3D measurement system. Opt. Laser Eng., 66, 41-51(2015).

[22] W. Feng, F. Zhang, W. Wang. Digital micromirror device camera with per-pixel coded exposure for high dynamic range imaging. Appl. Opt., 56, 3831-3840(2017).

[23] B. Niu, X. Qu, X. Guan. Fast HDR image generation method from a single snapshot image based on frequency division multiplexing technology. Opt. Express, 29, 27562-27572(2021).

[24] S. Oprea, P. Martinez-Gonzalez, A. Garcia-Garcia. A review on deep learning techniques for video prediction. IEEE Trans. Pattern Anal. Mach. Intell., 44, 2806-2826(2020).

[25] X. Hu, Z. Huang, A. Huang. A dynamic multi-scale voxel flow network for video prediction. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6121-6131(2023).

[26] L. Wang, K.-J. Yoon. Deep learning for HDR imaging: state-of-the-art and future trends. IEEE Trans. Pattern Anal. Mach. Intell., 44, 8874-8895(2021).

[27] Q. Wang, X. Lu, C. Zhang. LSV-LP: large-scale video-based license plate detection and recognition. IEEE Trans. Pattern Anal. Mach. Intell., 45, 752-767(2022).

[28] T. Pun. A new method for gray-level picture thresholding using the entropy of the histogram. Signal Process., 2, 223-237(1980).

[29] https://github.com/we0091234/Chinese_license_plate_detection_recognition. https://github.com/we0091234/Chinese_license_plate_detection_recognition

[30] https://doi.org/10.5281/zenodo.3908559. https://doi.org/10.5281/zenodo.3908559

[31] B. Shi, X. Bai, C. Yao. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell., 39, 2298-2304(2016).

[32] E. Reinhard, M. Stark, P. Shirley. Photographic tone reproduction for digital images. Seminal Graphics Papers: Pushing the Boundaries, 2, 661-670(2023).

[33] https://github.com/dario-loi/exposure-fusion. https://github.com/dario-loi/exposure-fusion

[34] H. Huang, J. Teng, Y. Liang. Key frames assisted hybrid encoding for high-quality compressive video sensing. Opt. Express, 30, 39111-39128(2022).

[35] S. Ri, M. Fujigaki, T. Matui. Accurate pixel-to-pixel correspondence adjustment in a digital micromirror device camera by using the phase-shifting moiré method. Appl. Opt., 45, 6940-6946(2006).

[36] D. Doherty, G. Hewlett. 10.4: Phased reset timing for improved digital micromirror device (DMD) brightness. SID Symposium Digest of Technical Papers 29, 125-128(1998).

[37] W. Sun, C. Tang, Z. Yuan. A 112-765 GOPS/W FPGA-based CNN accelerator using importance map guided adaptive activation sparsification for pix2pix applications. IEEE Asian Solid-State Circuits Conference (A-SSCC), 1-4(2020).

[38] J. Liu, C. Zaouter, X. Liu. Coded-aperture broadband light field imaging using digital micromirror devices. Optica, 8, 139-142(2021).

[39] C. Hu, H. Huang, M. Chen. Video object detection from one single image through opto-electronic neural network. APL Photon., 6, 046104(2021).

[40] M. Jaderberg, K. Simonyan, A. Zisserman. Spatial transformer networks. Advances in Neural Information Processing Systems, 1-9(2015).

[41] Z. Wang, A. C. Bovik, H. R. Sheikh. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process., 13, 600-612(2004).

[42] Z. Wang, E. Simoncelli, A. Bovik. Multiscale structural similarity for image quality assessment. 37th Asilomar Conference on Signals, Systems & Computers, 2, 1398-1402(2003).

[43] R. Zhang, P. Isola, A. A. Efros. The unreasonable effectiveness of deep features as a perceptual metric. IEEE Conference on Computer Vision and Pattern Recognition, 586-595(2018).

微信扫一扫：分享

微信扫一扫：分享