• Acta Photonica Sinica
  • Vol. 51, Issue 11, 1101001 (2022)
Jun XIE, Jianglei DI*, and Yuwen QIN**
Author Affiliations
  • Institute of Advanced Photonics Technology,School of Information Engineering,Guangdong Provincial Key Laboratory of Information Photonics Technology,Guangdong University of Technology,Guangzhou 510006,China
  • show less
    DOI: 10.3788/gzxb20225111.1101001 Cite this Article
    Jun XIE, Jianglei DI, Yuwen QIN. Application of Deep Learning in Underwater Imaging(Invited)[J]. Acta Photonica Sinica, 2022, 51(11): 1101001 Copy Citation Text show less
    Principle of underwater image degradation
    Fig. 1. Principle of underwater image degradation
    Classification of underwater imaging
    Fig. 2. Classification of underwater imaging
    CNN structure
    Fig. 3. CNN structure
    Effects of different algorithms before and after processing[23]
    Fig. 4. Effects of different algorithms before and after processing23
    The applications of deep learning in image enhancement
    Fig. 5. The applications of deep learning in image enhancement
    Image enhancement effect by neural networks
    Fig. 6. Image enhancement effect by neural networks
    WaterGAN model structure[32]
    Fig. 7. WaterGAN model structure32
    Image restoration results[41]
    Fig. 8. Image restoration results41
    Neural network for parameter estimation[56]
    Fig. 9. Neural network for parameter estimation56
    Neural network for image restoration
    Fig. 10. Neural network for image restoration
    Diagram of our two-stage learning[59]
    Fig. 11. Diagram of our two-stage learning59
    Computational polarization difference imaging systems based on Stokes vector[63]
    Fig. 12. Computational polarization difference imaging systems based on Stokes vector63
    The relationship between K(x,y)and ∆D(x,y)[66]
    Fig. 13. The relationship between Kxy)and ∆Dxy66
    Passive under water polarization imaging detection method in neritic area[4]
    Fig. 14. Passive under water polarization imaging detection method in neritic area4
    Recovery results of different underwater objects[61]
    Fig. 15. Recovery results of different underwater objects61
    Recovery results of different underwater objects[70]
    Fig. 16. Recovery results of different underwater objects70
    Neural network for polarimetric underwater image recovery
    Fig. 17. Neural network for polarimetric underwater image recovery
    Four kinds of polarization-intensity information confluence models and its comparative versions[73]
    Fig. 18. Four kinds of polarization-intensity information confluence models and its comparative versions73
    Comparison between raw images and restoration results of eight models[73]
    Fig. 19. Comparison between raw images and restoration results of eight models73
    Schematic diagram of ghost imaging
    Fig. 20. Schematic diagram of ghost imaging
    Structure of CGI[77]
    Fig. 21. Structure of CGI77
    Reconstruction results of CSGI and GIDL at different sampling rates[88]
    Fig. 22. Reconstruction results of CSGI and GIDL at different sampling rates88
    Reconstruction results based on DL and CS methods at different concentrations[87]
    Fig. 23. Reconstruction results based on DL and CS methods at different concentrations87
    Comparison of simulation results of UGI-GAN,UDLGI,and PDLGI at different sampling rates[84]
    Fig. 24. Comparison of simulation results of UGI-GAN,UDLGI,and PDLGI at different sampling rates84
    Hyperspectral image data cube
    Fig. 25. Hyperspectral image data cube
    HyperDiver UHI system and its components[95]
    Fig. 26. HyperDiver UHI system and its components95
    A multi-faced dataset from HyperDiver[95]
    Fig. 27. A multi-faced dataset from HyperDiver95
    Color image of the seabed from UHI and SAM classification[97]
    Fig. 28. Color image of the seabed from UHI and SAM classification97
    Underwater spectral imaging with filterwheel[89]
    Fig. 29. Underwater spectral imaging with filterwheel89
    A tunable LED-based underwater multispectral imaging system[98]
    Fig. 30. A tunable LED-based underwater multispectral imaging system98
    Staring underwater spectral imaging system with optimal waveband subset[100]
    Fig. 31. Staring underwater spectral imaging system with optimal waveband subset100
    Self-supervised hyperspectral and multispectral image fusion network[110]
    Fig. 32. Self-supervised hyperspectral and multispectral image fusion network110
    The structure of single pixel camera[115]
    Fig. 33. The structure of single pixel camera115
    Single-pixel imaging system[116]
    Fig. 34. Single-pixel imaging system116
    Reconstruction results by traditional FSI and FSPI[129]
    Fig. 35. Reconstruction results by traditional FSI and FSPI129
    Reconstruction results of GAN-FSI and FSI at different sampling rates[130]
    Fig. 36. Reconstruction results of GAN-FSI and FSI at different sampling rates130
    CS-SRCNN network structure[133]
    Fig. 37. CS-SRCNN network structure133
    LLS structure
    Fig. 38. LLS structure
    Principle of streak tubeimaging[147]
    Fig. 39. Principle of streak tubeimaging147
    Results of streak tube 3D imaging[151-153]
    Fig. 40. Results of streak tube 3D imaging151-153
    The target imaging with the distance of 20 m in clear water was recorded by the lidar-radar[158]
    Fig. 41. The target imaging with the distance of 20 m in clear water was recorded by the lidar-radar158
    The principle of underwater range-gated imaging system
    Fig. 42. The principle of underwater range-gated imaging system
    Images of underwater target[169]
    Fig. 43. Images of underwater target169
    Holographic imaging structure diagram
    Fig. 44. Holographic imaging structure diagram
    Robot-driven DIHM[198]
    Fig. 45. Robot-driven DIHM198
    Rapidly extract focused targets from underwater digital holograms[212]
    Fig. 46. Rapidly extract focused targets from underwater digital holograms212
    MethodsPrincipleAdvantagesDisadvantagesApplication
    Spatial domainmethodAdjust the gray scale and RGB channels of spatial pixelsEasy to implementandobvious effectsEasy to cause oversaturation and loss of details;Has a certain blindnessAdjust the overall or local over bright(dark)problem;Increase image contrast
    Frequency domainmethodTransform images to the corresponding domain for filteringSeparate high and low frequency information;Enhance edge information;suppress interference noise;High processing efficient in the frequency domainLimited effect on processing color distortion and low contrastDenoising;Deblurring
    Color constancy methodAccording to the relationship between the environment and the target pixel,the environment information is estimated and the raw image is restored according to the hypothesisGreat color restoration effectRely on the accuracy of assumptions;Limited effect on image denoisingColor correction
    Method based on deep learningThe degraded image is restored by using the mapping between degraded image and restored image learned by neural networkNoise removal,color correction and contrast increase can be performed at the same time;No prior information is requiredNetwork training takes time;Heavily dependence on datasets;Poor generalization ability

    Denoising;

    Color correction;Improving contrast

    Table 1. Summary of traditional underwater image enhancement methods and deep learning methods
    MethodsPrincipleAdvantagesDisadvantagesApplication
    Restoration methods based on prioriThe water features and related parameters are estimated by a priori hypothesis,and the images before degradation are restored by physical modelIt is targeted and directional,and avoids blind recovery;Results recovered by physical model are naturalThe choice of a priori hypothesis is subject to subjective influence;The model deviation and other restrictive factors make it difficult to apply in complex water environment

    Deblurring;

    Color correction;Contrast enhancement

    Restoration methods based on deep learningNeural network is used to learn the mapping between degraded image and related parameters to estimate model parameters,and restore the degraded imageIt avoids subjective error caused by artificial selection of prior conditions and has certain generalizationIt heavily relies on datasets;Artificial datasets differ from the real environment;It takes longer time compared with prior method

    Deblurring;

    Color correction;Contrast enhancement

    Table 2. Summary of image restoration methods based on priori and deep learning
    MethodsPrincipleAdvantagesDisadvantagesApplication
    Polarization difference imagingIt uses the difference of the light vibration between the target and the background to remove the background scattering noiseSimple and effectiveThe restoration results of objects with various polarization and details are poorDeblurring;Imaging in scattering media
    Passive polarization imagingAccording to the difference of polarization characteristics between background scattered light and target light under natural light,the clear scene image is reconstructed by using underwater light transmission modelDistance information is added to the physical model,which has a significant restoration effect on complex scenesThe background area needs to be selected manually;The model is only applicable to objects with low degree of polarization;The recovery effect is poor under high scattering concentration;Uniform light field conditions are requiredDeblurring;Imaging in scattering media
    Active polarization imagingThe active complete polarized light source is introduced,and the background scattering noise is removed by using the polarization characteristics difference between the background and the target reflected lightIt is suitable for low illumination environment;Imaging quality is better than underwater passive polarization imagingThe restoration effect is limited when the difference between the target and the background polarization degree is small or the target contains multiple polarization degrees;The assumption that the polarization direction of the target light and the background scattered light in the model is the same is different from the realityDeblurring;Imaging in scattering media
    Polarization imaging based on deep learningIt uses the additional information of polarization on light intensity to improve the effect of traditional intensity image restoration,recognition,fusion and reconstructionIt has better imaging quality and complete details than conventional imagingIt is heavily dependent on datasets and still in preliminary explorationDeblurring;Imaging in scattering media
    Table 3. Summary of underwater polarization imaging methods and deep learning-based methods
    MethodsPrincipleAdvantagesDisadvantagesApplication
    TGIIt calculates the correlation of light field intensity fluctuation to reconstruct the targetStrong anti-interference ability;Lensless imaging;Wide scope of actionIt needs two optical paths,which is complicated in experiment;A large amount of data needs to be collected,and the relevant calculation takes a long time;Low signal-to-noise ratioDenoising;Imaging in scattering media
    CGIThe target image is obtained by calculating the intensity distribution and the second-order correlation of the intensity collected by the detectorThe controllable light field is obtained by SLM or DMD,and the experiment is simplified to a single light path;Greater imaging perspectiveIt still needs to collect a large amount of data,and the relevant calculation takes a long timeDenoising;Imaging in scattering media
    CSGICompressed sensing is used for sparse sampling reconstruction of ghost imageIt can reconstruct high-quality images at low sampling rate and shorten the sampling time;It hashigh signal to noise ratioIt needs mass computing,and signal processing takes long timeImaging at a low sampling rate;Super resolution imaging
    DIGLThe neural network is used to learn the mapping between blurred image and clear image,or signal collected by bucket detector and reconstructed imaging to reconstruct the imageIt avoids using illumination mode and acquires high quality images at a low sampling rate;The reconstruction from barrel detector avoids the complex calculation of CS reconstruction and has better resultsItstill needs mass computing,and heavily relies on datasetsImaging at a low sampling rate
    Table 4. Summary of different ghost imaging methods and methods based on deep learning
    MethodsPrincipleAdvantagesDisadvantagesApplication
    Matrix factorizationBased on the linear spectral hybrid model,the end element spectral matrix with high spectral resolution and the abundance matrix with high spatial resolution are obtained by alternating non negative matrix decomposition of HS and MS data,and then the fused image with high spatial resolution and high spectral resolution are obtained by multiplicationThe model theory is simple,easy to implement and close to the actual situationIt requires iterative solution and mass computing;Model parameters are sensitive and difficult to set;It relies on observation modelHS and MS fusion
    Tensor decompositionHS is regarded as a three-dimensional tensor,which is decomposed into a three-mode factor matrix and a three-dimensional core tensor by Tucker decomposition. The core tensor is extracted from the high-resolution MS block set by tensor sparse coding,and is multiplied with the factor matrix to obtain images with high spatial resolution and high spectral resolutionThe reconstruction quality is better than that based on matrix factorizationModel parameters are sensitive and difficult to set;It requires mass computingHS and MS fusion
    Deep learning basedThe mapping between HS and MS and hyperspectral images is established by using neural network for fusionIt has high reconstruction accuracy,high efficiency and good robustness without iterationIt relies heavily on datasets and has poor generalizationHS and MS fusion
    Table 5. Summary of traditional MS and HS fusion fusion method and deep learning-based method
    MethodsPrincipleAdvantagesDisadvantagesApplication
    Conventional SPIThe object image is reconstructed by cross-correlation between the illumination field modulated by random pattern and the value obtained by single pixel cameraIt has great interference immunity,high single pixel detection frequency and great weak light detection capabilityBetter image quality requires far more sampling times than the number of reconstructed image pixelsImaging in scattering media
    FSI/HSIThe Hadamard/ Fourier basis spectrum of the target image is obtained by modulating the light field with the Hadamard/ Fourier basis mask,and then the target image is reconstructed by applying the inverse Hadamard/ Fourier transformIt has great interference immunity,and reconstruct the object image without distortionHigh frequency details are easy to be lost;Image artifacts exist;High quality reconstruction requires more sampling timesImaging in scattering media
    Deeplearning basedNeural networks are used to learning the mapping ofimage or one-dimensional signalto reconstructed imagefor image reconstructionIt has high reconstruction efficiency,good reconstruction quality and certain de-noising abilityThe network is prone to over fitting and takes time to train;It requires high adaptability and robustness of neural networkImaging in scattering media
    Table 6. Summary of different SPI reconstruction methods and methods based on deep learning
    MethodsPrincipleAdvantagesDisadvantagesApplication
    LLSAccording to the characteristic that the backscattered light of waterdecreases rapidly relative to the central axis of illumination,the target light and scattered light are separated in spaceIt reduces the influence of scattered light on imagingImaging equipment has large volume;It is impossible to avoid the influence of scattering medium on the transmission optical path;Lengthy imaging time leads to accuracy degradationImaging in scattering media
    STILThe deflection module in the streak tube is used to convert the time information into the distance information to obtain the three-dimensional imageIt has high imaging accuracy,fast imaging speed and large field of viewIt is not suitable for moving target imaging;The system has a short imaging time,which cannot meet the needs of long-time photography3D imaging
    Range-gated imagingThe backscattered light in the process of light transmission is reduced by adjust the open time of laser and cameraIt reduces the influence of scattered light on imaging,and has fast imaging speedLaser energy is scattered,and only small field of view imaging can be performed;The system is costly with limited resolution,and the operation is complexImaging in scattering media;3D imaging
    Table 7. Summary of different underwater laser imaging methods
    MethodsPrincipleAdvantagesDisadvantagesApplication
    Fourier transform reconstructionAfter the hologram is transformed into frequency domain by Fourier transform,the angle difference between the target light wave and other holographic components is used for separation,and then the spatial carrier is removed by inverse Fourier transform. The reconstructed image is obtained by calculating the diffraction integralIt can obtain the amplitude and phase information of objects in real time and quantitativelyIt needs mass computing and prior knowledge;Only a single hologram can be processed each time,so the efficiency is low3D microscopic imaging
    Holographic reconstruction based on deep learningNeural network is used to establish the mapping between hologram and reconstructed image for holographic reconstructionIt has high imaging efficiency and higher imaging quality;No prior knowledge is requiredIt relies heavily on data sets,requires a large number of different sample data and a wide range of reconstructed distance quantization modelsMicrobial 3D image reconstruction;3D particle field reconstruction;Microbial identification classification
    Table 8. Summary of Fourier transform reconstruction and reconstruction based on deep learning
    Application fieldNetwork structureInput-outputdetailsLoss functionApplication problems
    Underwater Image EnhancementCNN,GANImage-imageResidual connection,Dense connection,Inception,Fusion,L1,LSE,MSE,SSIM,GAN LossDeblurring[24,26-27],Color Correction[25,29],Dehazing[28-30],Image Generation[32-39]
    Underwater Image RestorationCNN,GANImage-image,Image-parametersDense connection,Residual connection,Skip connection,Fusion,InceptionL1,Perpetual loss,MSE,GANColor Correction[54-56,57,58],Deblurring[59],Dehazing[57],Image Generation[60]
    Underwater Polarization ImagingCNNImage-imageResidual connection,Dense connection,Skip connection,Fusion,MSE,Perpetual lossDeblurring[71,73]
    Underwater Ghost ImagingMLP,CNN,GAN1D signal-image,Image-imageResidual Connection,Dense Connection,Fusion,InceptionMSE,Perpetual loss,self-designedLowSampling Rate Imaging[84-88],Deblurring[86,88]
    Underwater Spectral ImagingCNNMS image-ImageSkip ConnectionL1Spectral Fusion[108-110]
    Underwater Compressed Sensing ImagingCNN,GANImage-Image,1D signal-imageSkip ConnectionMSE,GANLow Sampling Rate Reconstruction[85,130,132-133],Deblurring[130]
    Underwater Laser Imaging
    Underwater Holographic ImagingCNNImage-3D particle field,Image-classification resultSkip connection,Residual connection,Fusion,Cross Entropy,MSE,L1,Huber loss[213]Improve Efficiency[204,207,212],3D Particle Field Reconstruction[207],Classification[210-212]
    Table 9. Application of deep learning in underwater imaging
    Jun XIE, Jianglei DI, Yuwen QIN. Application of Deep Learning in Underwater Imaging(Invited)[J]. Acta Photonica Sinica, 2022, 51(11): 1101001
    Download Citation