
- Chinese Optics Letters
- Vol. 22, Issue 7, 071101 (2024)
Abstract
Keywords
1. Introduction
Optical coherence tomography (OCT) is a near-infrared imaging technology that was proposed in the 1990s[1] and is now widely used in ophthalmology[2–5], dentistry[6–8], gastroenterology[9,10], and so on. One of its advantages is the ability to non-invasively achieve high-resolution imaging of biological tissues. At present, the typical lateral scanning range of OCT is 2–6 mm[11], which is too small for most clinical scenarios, even for ophthalmology and endoscopy, not to mention intraoperative brain or kidney imaging[12]. There are two ways to improve the limited imaging range: one is to increase the optical scanning range of the sample arm[13,14], which presents challenges for optical design or manufacturing, and the other is to perform registration algorithms to merge multiple small-range images into a single large-scale image, which is more feasible compared to the previous one.
With the rapid development of OCT, there is also a soaring demand for advanced image processing and analysis techniques. The application of image registration technology in OCT serves three main purposes: noise suppression, multimodal image registration, and obtaining larger field images. For noise suppression, Liu et al.[15] proposed an image registration algorithm using a regularized dynamic programming method, which corrected both axial and transverse motion offsets, effectively suppressing speckle noise and improving image quality. Wei et al.[16] introduced a 3D registration method for retinal OCT volume, which applied a non-rigid registration method that combined normalized mutual information-based registration with landmark-based coherent point drift (CPD) registration. For multimodal image registration, Zang et al.[17] proposed a feature-based registration method, which identified a set of control points (CP) and computed the corresponding feature vectors to identify the best CP match to register fundus photographs to spectral domian optical coherence tomography (SD-OCT) projection images.
In light of advancements in computer technology and image processing techniques, grayscale features of images have remained a prominent research topic for registration algorithms[18], in which normalized cross-correlation (NCC) is the most common and widely used[19]. However, the NCC-based image registration method is not capable of handling rotation, and one of the image registration algorithms for addressing both translation and rotation is the Fourier–Mellin transform (FMT)[20]. However, the FMT algorithm needs a high overlap rate between images to collect enough features for proper image alignment, which degrades patient scanning efficiency in clinical applications. In this work, we performed cross-correlation (instead of phase-correlation) in the workflow of the FMT method (called dual-cross-correlation-based translation and rotation registration method, DCCTRR) to address both translation and rotation mismatches between two optical coherence tomography angiography (OCTA) images. Both phantom and in vivo experiments were implemented for the performance comparisons. The results quantitatively demonstrate that the DCCTRR can align OCTA images with a lower overlap rate compared to the FMT method, which could improve the scanning efficiency of large-scale imaging in clinical applications.
Sign up for Chinese Optics Letters TOC. Get the latest issue of Chinese Optics Letters delivered right to you!Sign up now
2. Method
The workflow of the DCCTRR method is demonstrated by the flowchart in Fig. 1, where both rotation and translation offsets could be obtained by the following steps:
Figure 1.Workflow of the dual-cross-correlation-based translation and rotation registration (DCCTRR) algorithm.
First, perform Fourier transform on both images, which transforms the images from the spatial domain to the frequency domain and allows image processing operations (such as filtering, noise reduction, and contrast enhancement) to be performed prior to image registration, so as to improve the accuracy of registration.
Then, transform the images to log-polar coordinates. Taking the center of the frequency domain image as the origin (
The next step is to align the two log-polar images based on the normalized cross-correlation coefficient map of the two images. The NCC between the two images is expressed by[22,23]
The last step is to rotate the image at the calculated angle
In this pross, one image is rotated, causing a difference in the size of the two images. For translation alignment, we first match the size of the two images by zero-padding the reference image. Then, we calculate the NCC coefficient map and find out the coordinate of the maximum value (
The last step of the DCCTRR is to apply the obtained translation compensation
The basic principle of the FMT method can be found in Guo’s work[20]. In brief, the workflow of the FMT method is similar to Fig. 1, and the difference is that the maximum value of the phase-correlation map is used for rotation and translation alignment. The phase-correlation parameter is obtained by
3. Results
In this work, the signal-to-noise ratio (SNR) and registration accuracy are used to evaluate the performance of the DCCTRR and FMT methods, and the SNR is calculated by dividing the maximum correlation coefficient value by the standard deviation of the whole correlation map,
The registration accuracy of the overlapped region after alignment is defined as the proportion of the pixel number of the matched region (with signals on both images) over the average of the pixel numbers of the signaled regions in both images, which can be expressed by
Image stich plays a key role in medical imaging for disease diagnosis, because not well-stitched large-region images may lead to an incorrect diagnosis and cause harmful consequence to patients. With an improved correlation map, SNR, and registration accuracy, the image registration process is more reliable, and the output is more accurate, which could greatly secure a doctor’s judgments for proper treatment for the patient.
3.1. System set-up of our home-built SDOCT system
All OCT/OCTA images shown in this work were obtained by our home-built SDOCT system, as illustrated in Fig. 2. The light source for this system is an SLD (cBLMD-T-850-HP-I, SUPERLUM, Ireland) with a central wavelength of 850 nm and a 3 dB bandwidth of 165 nm. The system comprises a Michelson interferometer structure, in which the optical beam is divided into a reference beam and a sample beam by a 50:50 fiber coupler. The incident beam in the reference arm is reflected by a mirror, while the backscattered beam in the sample arm interferes with it at the fiber coupler. The interference signal is then directed to the high-resolution spectrometer for detection. In the system, L1, L2, and L4 are collimating lenses with a focal length of 19 mm in the reference arm, sample arm, and spectrometer, respectively. L3 is an objective lens with a focal length of 30 mm, providing a lateral resolution of
Figure 2.Schematic of our home-built SDOCT system. L1-L5, achromatic lenses; PC1-PC2, polarization controllers; DP, dispersion compensation pair.
3.2. Phantom experiments
For the phantom experiments, we first tested the performance of a single cross-correlation based method on images with only translation and both translation and rotation, and the results are shown in Figs. 3 and 4, where silk fabric with the fiber diameter of
Figure 3.Results of the phantom experiment (silk fabric with a silk fiber diameter of ∼ 32 µm is used to mimic biological microvasculature) with the normalized cross-correlation method. (a) The normalized cross-correlation coefficient map between an image pair. The horizontal coordinate represents the position of the registered image relative to the reference image, and the vertical coordinate represents the cross-correlation coefficient value at that corresponding position. (b) The plot of the image overlap rates versus the normalized SNR of the correlation coefficient map. (c) The plot of different image overlap rates versus normalized registration accuracy. (d)–(h) Merged images of five pairs of representative images with different overlap rates. (i)–(m) Merged RGB images. The red and green channels are the two original binary images, and the matched signaled regions are yellow.
Figure 4.Results of image registration of cross-correlation method for the images with rotations. (a)–(e) The images with different rotation angles of 0°–4° respectively relative to the same referent image, and the overlap rates are all 96.6%. (f)–(j) The merged RGB images of (a)–(e), respectively. The red and green channels refer to the two original binary images, and the yellow indicates the matched signaled regions. (k) The plot of different rotation angles versus the obtained normalized SNR of the correlation coefficient. (l) The plot of different overlap rates versus the normalized registration accuracy.
Image registration based on normalized cross-correlation cannot cope with rotation, while registration based on the DCCTRR and FMT can. Therefore, we use the registration algorithm based on the FMT and dual cross-correlation to carry out phantom experiments. These two methods register the same pair of images at the same rotation angle (9.3769°), and we put the registration results of these two methods together to make a comparison, as shown in Fig. 5. The reason why 9.3769° is chosen as the rotation angle to explore the limit of the overlap rate is that in the practical OCT application in ophthalmology, patients remain stationary while being scanned, so the rotation mismatch angle can be assumed to be smaller than 10°.
Figure 5.Results of the comparison between the DCCTRR and FMT methods. In the coefficient map, the horizontal coordinate represents the position of the registered image relative to the reference image, and the vertical coordinate represents the correlation coefficient value at that corresponding position. (a) The normalized cross-correlation coefficient map. (b) The phase-correlation coefficient map. (c) The plot of the normalized SNRs of the correlation coefficient map versus different overlap rates. (d) The plot of the normalized registration accuracies versus different overlap rates. (e)–(i) Merged images of five representative overlap rates with the DCCTRR method (the first and the last are the images with the minimum and the maximum overlap rates, respectively). (j)–(n) The merged RGB images of (e)–(i), respectively. (o)–(s) Merged images of the same five pairs of images as (e)–(i) with the FMT method. (t)–(x) The merged RGB images of (o)–(s), respectively.
The correlation coefficient graphs of the registration algorithm based on the DCCTRR and FMT are cross-correlation and phase-correlation, respectively, as shown in Figs. 5(a) and 5(b). The difference between the two methods of cross-correlation and phase-correlation can be seen clearly from Figs. 5(a) and 5(b). The cross-correlation map is a whole continuous surface, and the phase-correlation map is discontinuous and composed of countless vertical lines. Therefore, the FMT method has high accuracy and high SNR. However, for the FMT method, having a higher SNR does not mean that it can have better registration performance. In Fig. 5(c), it can be seen that the FMT method failed early when the overlap rate was less than 61.86% while the DCCTRR could successfully register images until the overlap rate was lower than 12.58%, and the normalized SNR of these two methods changed quite similarly with the overlap rate. Then, Figs. 5(e)–5(i) show registration result images of five representative overlap rates with the image registration algorithm based on dual cross-correlation. The first image represents the registration result with the maximum overlap rate, the fifth image represents the result with the minimum overlap rate, and the intermediate images have equal intervals. The RGB images shown in Figs. 5(j)–5(n) corresponding to the registration result of Figs. 5(e)–5(i). The red and green channels refer to two original binary images. The yellow region indicates the matched signaled regions between the two images. Figures 5(o)–5(s) show the registration result images of five representative overlap rates with the FMT method. The RGB images shown in Figs. 5(t)–5(x) correspond to the registration result of Figs. 5(o)–5(s). The relationship between different overlap rates and registration SNR of these two methods is shown in Fig. 5(c). The relationship between different overlap rates and registration accuracies of these two methods is shown in Fig. 5(d). Not only just the SNR but also the registration accuracy increases along with the overlap rate. In Figs. 5(c) and 5(d), the blue lines represent the results of the DCCTRR, while the red lines represent the results of the FMT method, and points on the curves indicate actual sampling points. As shown in Fig. 5(d), the DCCTRR can still successfully register images when the overlap area ratio is 12.58% with a registration accuracy of 88.51%, while the FMT method will fail when the overlap ratio is less than or equal to 61.86%. Compared to the FMT method, the DCCTRR exhibits a slower decrease in normalized accuracy with a decreasing overlap ratio, indicating superior performance.
To assess the DCCTRR’s ability of robustness to background noise, Gaussian noise was added to two en face OCT images for registration, and the noise floor was gradually increased until the registration failed. The used OCTA images are shown in Fig. 6(a) with an overlap rate of 90.708% and a relative rotation angle of 9.3769°. The maximum STDs of the added Gaussian noise for the DCCTRR and FMT methods are 2.52 and 0.43, respectively. The aligned results are shown in Figs. 6(b) and 6(c). The merged RGB images of (a), (b), and (c) are shown in (d), (e), and (f), respectively. In Figs. 6(g) and 6(h), it can be observed that the normalized SNRs and the normalized accuracies of both the DCCTRR and FMT methods decreased as the STD of the Gaussian noise increases, which matches with the theory. However, the DCCTRR exhibits a smaller slope, meaning higher robustness to background noise.
Figure 6.DCCTRR and FMT methods for registering images with added Gaussian noise. (a) Registered images without added Gaussian noise. Merged images using the (b) DCCTRR and (c) FMT methods with the maximum STD of the added Gaussian noise. The merged RGB images of (a), (b), and (c) are shown in (d), (e), and (f), respectively. (g) The plot of the normalized SNRs of the coefficient map versus the STDs of the added Gaussian noise. (h) The plot of the normalized registration accuracies versus the STDs of the added Gaussian noise.
3.3. In vivo experiments
In vivo experiments were also conducted to investigate the performance of the two methods, and the results are shown in Fig. 7, where a human finger was scanned, and the obtained en face images were used for testing the performance of the two methods. Figures 7(a)–7(f) show images of six representative overlap rates. The first image represents the registration result with the maximum overlap rate, the sixth image represents the result with the minimum overlap rate, and the intermediate images have equal intervals. The RGB images shown in Figs. 7(g)–7(l) correspond to the registration results of Figs. 7(a)–7(f). The red and green channels refer to two original binarized images. If there are signals in the same position of both channels, then they will present as yellow. Figures 7(m) and 7(n), respectively, illustrate the relationship between the overlap rate and the registration SNR/accuracy. On the curves, the points indicate actual sampling points. Both increase as the overlap rate increases. The DCCTRR can successfully register two OCTA en face images of human finger capillaries with an overlap rate of 12.09%.
Figure 7.Registration results of human finger OCTA images with the DCCTRR method. (a)–(f) Merged images of six representative overlap rates (The first one and the last one, respectively, have the maximum and the minimum overlap rates). (g)–(l) The corresponding RGB images of (a)–(f), respectively. (m) The plot of the normalized SNRs of the correlation coefficient map versus the different overlap rates. (n) The plot of the normalized registration accuracies versus the different overlap rates.
In the aforementioned in vivo experiment, the rotational deviation between the two images was measured to be 0.1839°. It is important to note that practical applications may involve larger rotational deviations, which can result in a reduced overlap rate compared to the parameters here. Hence, it is recommended to employ an overlap rate of 20% for practical applications, which is slightly higher than the threshold value.
It deserves to be mentioned that the FMT method fails to register the images of Fig. 7(a) with an overlap rate of 30.238%, which matches with the results of the phantom experiments, where the overlap rate for the FMT method needs to be
Furthermore, we finally performed multiple scans on the volunteer’s finger with an overlap rate of 20% (
Figure 8.Merged large-scale OCTA images with 3 × 3 OCTA en face images. (a) The left hand of a healthy volunteer (the marked region by a solid line was scanned). (b) The merged large-scale image obtained by the DCCTRR method.
4. Discussion
The phase-correlation-based registration method is widely used in satellite-map stitching and geographic-map stitching and performs better over the cross-correlation-based method because it is robust to image intensity changes, illumination, and noise. In comparison to the cross-correlation map, which is smooth and continuous, and the peek indicates the matched location, the phase-correlation map is rough, and sharp spikes indicate the possible matching locations. This feature provides the phase-correlation with good registration accuracy and SNR. However, the phase noise in the phase-correlation method may also be spiked and may degrade the accuracy of the registration. For example, in Eq. (6), the SNR is calculated by dividing the maximum peak by the STD of the entire image. In the phase-correlation map, what affects the registration accuracy is random phase noise that appears also as spikes. The image alignment fails if any phase noise is bigger than the corresponding phase correlation signal.
The DCCTRR and the FMT methods use cross-correlation and phase-correlation, respectively, for image registration, and the computing complexity and processing time are also different. For cross-correlation operation, shown by Eq. (1), the method entails more multiplication and addition operations, along with convolving the entire image, leading to higher computational complexity. On the other hand, phase-correlation is typically computed in the frequency domain, leveraging the properties of the Fourier transform for efficient processing. It primarily involves Fourier transform, complex multiplication, inverse Fourier transform, and other operations, as depicted in Eq. (5), resulting in a lower computational complexity. We performed both methods on the same laboratory computer (Windows 10, 3.70 GHz CPU, 32 GB memory), and the time required to align a pair of OCTA images (1000 pixel × 1000 pixel) for the FMT method was 0.8664 s, in which two rounds of phase-correlation operations cost 0.2642 s (about 30.49% of the total time). In contrast, the DCCTRR method took 1.2522 s, in which the two rounds of cross-correlation operations cost 0.9145 s (about 73.03% of the total time). The results indicate that cross-correlation calculation mainly contributes to the total computing time of the DCCTRR method. Therefore, for the applications where (near) real-time processing is needed, a parallel computing structure (such as GPU and FPGA) may be able to be applied to improve cross-correlation calculation, which may also be under our future research scope.
Image stich plays a key role in medical imaging for disease diagnosis because not well-stitched large-region images may lead to an incorrect diagnosis and cause harmful consequences to patients. With improved correlation map SNR and registration accuracy, the image registration process is more reliable, and the output is more accurate, which could greatly secure doctor’s judgments for a proper treatment for the patient.
5. Conclusion
In summary, this work investigated the performance cross-correlation (instead of phase correlation) in the workflow of the FMT method for calculating translation and orientation offsets (called cross-correlation-based method, DCCTRR) and compared its performance to the FMT method on OCTA images alignment. Both phantom and in vivo experiments were implemented to demonstrate the comparisons, and the results show that the DCCTRR method requires a smaller overlap rate between the images to achieve a comparable registration accuracy to the FMT method. With a reduced overlap rate for image stitching, patient scanning efficiency for large-scale imaging in clinical applications could be improved, and furthermore, the DCCTRR method may also be helpful with the application of combining OCT with a robotic arm for automatic patient scanning.
References
[12] H. Zhang, X. Ma, M. Moradi et al. Large area kidney imaging for pre-transplant evaluation using real-time robotic optical coherence tomography. Res. Sq.(2023).
[19] A. Roche, G. Malandain, X. Pennec et al. The correlation ratio as a new similarity measure for multimodal image registration. The Conference on Medical Image Computing and Computer-Assisted Intervention, 1115(1998).
[20] X. Guo, Z. Xu, Y. Lu et al. An application of Fourier-Mellin transform in image registration. The Fifth International Conference on Computer and Information Technology, 619(2005).

Set citation alerts for the article
Please enter your email address