
- Chinese Optics Letters
- Vol. 22, Issue 12, 121103 (2024)
Abstract
1. Introduction
Scattering media, such as dense fog and biological tissues, are a major hindrance for detecting information, particularly in optical imaging[1–4]. The light propagation is strongly perturbed due to the inhomogeneity of the scattering medium. In most cases, the scattering medium is either unknown or incompletely characterized[5–8]. Therefore, the effective strategy for maximizing information recovery must be two-pronged: minimizing interference from the scattering medium and leveraging prior knowledge about the imaging targets[9–11]. For target extraction, characteristic recognition is a common way, and the prior knowledge of imaging objects can be well utilized by deep learning. However, the target extraction is still affected by environmental disturbances, and the extraction precision is limited.
As is well known, ghost imaging (GI) is an anti-interference imaging technology that obtains target information from the correlation between two correlated beams[12–21]. Recently, research about GI in scattering media has greatly intensified[22–28]. Moreover, the operation of the light source has also proven to be useful for obtaining object characteristics because GI is the corresponding imaging technique[29]. In addition, it is feasible to use kernels in convolutional neural networks to adjust the second-order correlation of the speckle patterns, so as to enhance imaging quality[30]. Based on these research insights, if the light source in GI includes object (one cares about) characteristics, it is beneficial for estimating if there exists the target in scattering media, thus extracting it.
In this paper, we propose the characteristic GI scheme for target extraction based on enhancing the characteristic response of the light source with deep learning. In our scheme, the light source is trained to contain object characteristics by the U-Net neural network and then is used for target extraction in GI. The numerical and experimental results validate our approach, demonstrating successful recognition of the target we care about with minimal measurements in complex imaging environments. The results present potential applications of target extraction through strong scattering interference.
Sign up for Chinese Optics Letters TOC. Get the latest issue of Chinese Optics Letters delivered right to you!Sign up now
2. Principle and Methods
The network structure is U-Net, which is a characteristic fusion network, as shown in Fig. 1(b). The network consists of two parts: the left side can be considered as an encoder for the characteristic extraction of training images, and the right side can be seen as a decoder for the characteristic matching of the output label. The encoder has five sub-modules, each of which contains two
Figure 1.(a) Schematic of the experimental setup. L1–L3 are the optical lenses, and GG is the rotary ground glass. (b) Network architecture: U-Net. The U-Net consists of a characteristic extraction path (left side) and a characteristic matching path (right side) to introduce target characteristics into the light patterns.
Figure 1(a) shows the experimental setup of the GI system. Traditionally, the speckle pattern
For traditional GI, the image of the object can be reconstructed by an intensity fluctuation correlation between
To enhance the characteristic response of the illuminating pseudo-thermal light, the training model is generated in the network using the training object and transmitted light that passes through the object; then, the patterns are input into the training model to generate a new light source
In our proposed GI model with deep learning, the patterns after network training are used as the illumination source, and the reconstructed image can be expressed as
From Eq. (6), the ghost image
Here, it is natural to ask if the pseudo-thermal light that has been trained is suitable to realize GI. Figure 2 presents the intensity fluctuation correlation distributions (self-correlations) of the speckle under different training objects [see Figs. 2(a1)–2(d1)] and training epochs. The measurements are 500. Figure 2(e) illustrates the second-order coherence (
Figure 2.Intensity correlation distributions of the pseudothermal light under different training objects and training epochs. (a1)–(d1) Different training objects. (a2)–(d2) Intensity correlation distributions of the light field under the corresponding training objects. g(2) of the trained speckle patterns with different (e) training epochs and (f) training objects, respectively.
It is shown in Figs. 2(e) and 2(f) that
To quantitatively measure the imaging quality, the signal-to-noise ratio (SNR) is used in the following discussion[37]:
3. Results and Discussion
3.1. Target extraction with the characteristic-enhanced light field
We first demonstrate the ability of target extraction from complex scenarios, thus verifying the above theoretical model and analysis. The simulation and experiment are shown in Fig. 3. Here, we consider separating the overlapping targets using the trained light field. In the experiment, the gain of the bucket detector is 70 dB, and the time interval of the projector is 0.1 s. Three different objects (cat, dog, and pig) etched on the
Figure 3.(a1) GI results under pseudothermal light (first column) and light sources with different training characteristics (second to fourth columns). (a1) The imaging object and (b1)–(d1) different training objects. (a2)–(d2) The simulation results. (a3)–(d3) The corresponding experimental results.
Since our imaging target comprises a superposition of multiple distinct training objects, each individual object [Figs. 3(b1)–3(d1)] is introduced into the deep learning network to obtain its corresponding characteristic-enhanced light field; subsequently, these trained light fields are used for GI. The results show that the proposed method can effectively separate different objects, and the outlines of the real objects are obviously presented in both simulation [Figs. 3(b2)–3(d2)] and experiment [Figs. 3(b3)–3(d3)], leading to a remarkable improvement of target recognition ability. Comparing the imaging quality of different reconstructed images, ghost images with the dog have a significantly larger SNR than those with the cat and pig. The phenomenon indicates that the imaging quality is closely related to the characteristic response in the light source, which can be explained from
In addition, it is noticed that the bow characteristic on the cat head performs better than the whole cat, which is related to the enhancement intensity of the target characteristic in the light field. As can be seen from the results of the light field self-correlation in Fig. 2(c2), the bow on the cat head has a lot of shadows in the self-correlation, which not only indicates that the light field and the bow characteristic are integrated very well but also reveals that the bow characteristic in the light field has a larger enhancement. Therefore, the bow characteristic in the ghost image is clearer when compared with the other parts of the cat.
In practical application, there may exist multiple imaging objects with the same characteristics that need to be extracted in complex scenarios. Therefore, it is very important to recognize simultaneously these targets. Figure 4 shows the multi-target scenarios (the superposition of some animals), and there is more than one animal of each type. Here, we want to know if there exist any chickens in these scenarios and how many there are. The goal cannot be achieved by traditional GI [see Figs. 4(a1) and 4(a3)], while it is interesting to find that the target can be extracted using the characteristic-enhanced light field [see Figs. 4(a1)–4(a4)] when the training object is a chicken [the upper left parts of Figs. 4(a) and 4(b)]. In more complex scenarios with multiple targets of the same species that need to be identified, the traditional GI also fails [see Figs. 4(b1) and 4(b3)]. Interestingly, the targets sharing an identical shape with the training object can still be recognized using the trained light field, and the corresponding quantities are also clearly discernible, as depicted in Figs. 4(b2) and 4(b4). The results demonstrate that our scheme exhibits a remarkable ability for multiple-target extraction when the targets own similar characteristics. In addition, although the chicken has the best imaging quality under single-target extraction, some characteristics of the chicken are not obvious, such as the chicken comb. The reason is that the chicken and the rabbit have some similar characteristics, such as the chicken comb and the rabbit ears, which cause some interference with the target extraction. Therefore, similar targets have shortcomings in the part of characteristic extraction, but the targets can still be recognized.
Figure 4.Single- and multiple-target extraction results (500 measurements) in a complex scenario under a specific training target [the upper left parts of (a) and (b)]. (a1), (a3) and (b1), (b3) The results under pseudo-thermal light. (a2), (a4) and (b2), (b4) The results under the trained light. (a1), (a2) and (b1), (b2) Simulation results. (a3), (a4) and (b3), (b4) Experimental results.
3.2. Target extraction through a strong scattering environment and its application in biomedical imaging
From the above analysis, our method can be used for single- and multiple-target extraction in complex scenarios. In practical imaging scenarios, the extraction ability may be subject to scattering interference, so it is important to investigate the target extraction ability in scattering environments. Ground glass is generally considered a surface scattering medium, so we place the rotating ground glasses in front of and behind the object to simulate a strong scattering environment in our experiment, and the results are shown in Fig. 5. Due to scattering interference, traditional GI fails to recognize targets for single-target extraction [Figs. 5(a2) and 5(b2)], and distinguishing the multiple-target is even more challenging [Fig. 5(c2)]. However, our method presents better target extraction ability, and the SNR is further improved [Figs. 5(d2)–5(h2)] when compared with traditional GI. For example, Figs. 5(c2) and 5(h2) show the multiple-target extraction results under the pseudo-thermal light source and characteristic-enhanced light source, respectively. It is difficult to distinguish the target using traditional GI, while the trained light field can efficiently obtain the corresponding target we are concerned about and reduce the interference of the scattering environment caused by rotating ground glasses. Furthermore, the presence of scattering only slightly degrades the SNR and does not impede target extraction in comparison to results obtained without the scattering environment [Figs. 4(b4) and 5(h2)]. In other words, enhancing the characteristic response of the light source effectively mitigates the impact of scattering at a low sampling rate.
Figure 5.The experimental results under different imaging scenarios in a strong scattering environment (500 measurements). (a1)–(h1) The corresponding imaging objects and different training objects (the upper left part of each picture). The imaging results under pseudo-thermal light [(a2)–(c2)] and enhanced characteristic lights [(d2)–(h2)], respectively.
Then, we consider a practical scattering environment, such as biomedical tissue. In this process, it is inevitable to encounter the influences of scattering environments such as muscles and blood vessels. Taking bone imaging as an example, X-ray is traditionally used to solve this problem, which can directly obtain bone information through soft tissues. Then, we attempt to extract bone information using our scheme with the trained pseudo-thermal light. Here, the imaging objects are real zebrafish and crucian carp in the experiments, as shown in Figs. 6(b1) and 6(b2), and the training object is X-ray photographed fish bones shown in Figs. 6(a1) and 6(a2). The area in the dotted box is the imaging area, and the corresponding results are presented in Figs. 6(c1) and 6(c2). It is shown that GI with the trained light fields is sufficient for recognizing the fish bones at low measurements of 500. For the small zebrafish, the spine of almost the entire body is obtained in Fig. 6(c1), and the details of the spine can also be identified [see the upper right corner of Fig. 6(c1)]. For the larger crucian carp, the extraction ability of small bones (such as ribs) is not as good as that of the small fish due to the thicker fish body. However, the spine can also be effectively identified, which verifies the target extraction ability of our method in a practical scattering scenario.
Figure 6.Experimental results of fish bone extraction (500 measurements). (a1), (a2) X-ray photos of zebrafish and crucian carp bones. (b1), (b2) Real photos of zebrafish and crucian carp. (c1), (c2) Results using our imaging method.
We also notice the degradation of image quality with the increase of the fish thickness, the reason being that an increase of the fish thickness reduces the signal intensity received by the bucket detector. Therefore, it is necessary to investigate the influence of the detected signal intensity and fish thickness on characteristic extraction, and the results are plotted in Fig. 7. Here, the detected SNR (
Figure 7.Experimental results of fish bone extraction (500 measurements). (a1)–(a3) Real photos of zebrafish and crucian carp with different thicknesses. (b1)–(f1), (b2)–(f2), and (b3)–(f3) The results from our method under different S/R values.
To better present the experimental results, we conduct multiple experiments and set the light power in the object plane to 50, 40, 30, 20, and 10 µW to implement the experiments. The effect of different fish thicknesses on bone extraction is also discussed, and the thicknesses of fish are selected as 0.8, 1.4, and 2.1 mm, as shown in Figs. 7(a1)–7(a3). A larger thickness corresponds to a smaller
Here, it should be emphasized that our proposed method is a pre-processing of the light source, not a post-processing of the imaging result[38,39] or the object light[40]. Existing deep learning GI schemes can indeed eliminate the interference and achieve target extraction, but there is also a problem: the training process of deep learning needs a large number of data sets and training periods to ensure the stability of target recognition. Therefore, due to the processing of target information, the existing deep learning GI generally needs larger training sets (thousands of images)[38], longer training periods (more than 50 periods)[40], and higher time consumption[38,40]. However, our method does not focus on the data processing of the detected information, it only enhances the target characteristic response in the light field to realize the target extraction during the imaging process. The training process requires fewer resources: smaller training sets (1000 images), shorter training periods (10–20 periods), less time consumption (only a few minutes), and fewer measurements (500 samples), which is beneficial for real-time imaging.
According to the above analysis, our scheme can be utilized to recognize the targets that own identical characteristics to the training object from a complex scenario. Moreover, the model can eliminate the effect of strong scattering by enhancing the target characteristic response of the light source. Note that the reported studies[27,28] on object identification based on GI mainly depend on the post-processing of detected signals, which increases the time consumption of image reconstruction. Our method is beneficial for real-time imaging without requiring the post-processing of target signals. Moreover, previous speckle processing[30] can only improve the imaging quality of the object at a low sampling rate and cannot separate the target from multiple interference targets, which demonstrates the potential for target extraction under complex scenarios.
4. Conclusion
A characteristic imaging scheme based on deep learning to enhance the characteristic response of pseudothermal light in GI has been proposed for target extraction. Since the characteristic of the training target is contained in the new light source, the target can be recognized under low measurements even when the imaging object is covered by occlusions. The simulation and experimental results verify the single- and multiple-target extraction ability of our scheme when there exist interference targets and strong scattering. Our proposed scheme may be a promising target extraction method and has potential applications in complex application scenarios.
References
[19] W. L. Gong. High-resolution pseudo-inverse ghost imaging. Photonics Res., 3, 234(2015).
[30] X. Nie, H. Song, W. Ren et al. Deep-learned speckle pattern and its application to ghost imaging(2021).
[32] J. Long, E. Shelhamer, T. Darrell. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3431(2015).
[33] O. Ronneberger, P. Fischer, T. Brox. U-Net: convolutional networks for biomedical image segmentation. International Conference on Medical image computing and computer-assisted intervention, 234(2015).
[34] Y. LeCun, Y. Bengio, G. Hinton. Deep learning. Nature, 521, 436(2015).

Set citation alerts for the article
Please enter your email address