
- Infrared and Laser Engineering
- Vol. 50, Issue 8, 20200418 (2021)
Abstract
Keywords
0 Introduction
Infrared(IR) small dim target detection plays an important role in precise guidance[
A number of single frame based algorithms for detecting IR small dim targets have been proposed, including spatial domain algorithms[
In recent years, robust Human Visual System (HVS) properties have been introduced to the IR small dim target detection field. According to the contrast mechanism of HVS, it is the contrast but not the brightness that occupies the most important part in the streams of our visual system[
By now, researchers have not reached a consensus on how to define the local contrast, and many local contrast definitions have been proposed. Generally, existing algorithms can be divided into two categories: the difference form local contrast and the ratio form local contrast[
The difference form local contrast algorithms, such as the Laplacian of Gaussian (LoG) filter[
The ratio form local contrast algorithms, such as the Local Contrast Measure (LCM)[
In theory, the difference form local contrast and the ratio form local contrast have their respective advantages and disadvantages: the difference form local contrast can effectively eliminate high brightness backgrounds, but cannot effectively enhance real target; the ratio form local contrast can effectively enhance real target, but cannot effectively eliminate high brightness background. It will be expected to combine the advantages of the two if they can be used together. The simplest way is to use them as two separate stages. For example, the ILCM and the NLCM are both ratio form local contrast algorithms, but they utilized the DoG filter as a preprocessing algorithm to eliminate high brightness backgrounds first. However, dividing an algorithm into two stages may damage the robustness of the algorithm, because any error in any stage will affect the final detection performance. Other two-stage algorithms, such as LCM + LoG[
In this paper, a Multi-scale Ratio-Difference joint Local Contrast Measure (MRDLCM) is proposed. It integrates the ratio form local contrast and the difference form local contrast as a whole, and can combine the advantages of the ratio form methods and the difference form methods so no preprocessing algorithm is needed. After the MRDLCM calculation, different sizes of small dim targets can be enhanced by utilizing the ratio form local contrast; high brightness backgrounds can be eliminated by utilizing the difference form local contrast; complex background edges can be suppressed by introducing the directional information; and PNHB can be suppressed by the reasonable design of the formula. Besides, a weighted function utilizing the Reversed Local Diversity (RLD) of the local surrounding areas is proposed to suppress the complex backgrounds further.
The contributions of this paper can be summarized as follows: (1) A new local contrast measure named MRDLCM is proposed, it can effectively enhance IR small dim targets while suppressing high brightness backgrounds, complex background edges and PNHB simultaneously. (2) A weighted function named RLD is proposed, it utilizes the local diversity of the local surrounding areas to suppress the complex backgrounds further. (3) An IR small dim target detection algorithm based on MRDLCM_RLD is proposed, it doesn’t need any preprocessing algorithm (such as DoG or LoG, etc.) and has a simple structure, so its robustness can be guaranteed.
Real IR sequences are used to test the performance of the proposed algorithm versus some existing state-of-the-art algorithms. Experimental results demonstrate that the proposed algorithm can successfully detect the small dim targets under complex backgrounds and can achieve the best detection performance against existing algorithms in detection rate and false alarm rate. Besides, the proposed algorithm has the potential of parallel processing, which is very useful for improving the detection speed.
This paper is organized as follows. In Section I, the calculations of MRDLCM and RLD are described in details, and the detection ability is analyzed; in Section II, the detection of the small dim target based on MRDLCM_RLD is described; experimental results are presented in Section III, and this paper concludes in Section IV.
1 The calculations of MRDLCM and RLD
In this section, firstly the different features between real IR small dim target and other interferences are analyzed, including high brightness background, complex background edges, and PNHB. Based on the analysis, a new local contrast method named MRDLCM to enhance real target while suppressing all the interferences is proposed, and a weighted function named RLD to suppress the complex backgrounds further. The detection ability of the proposed algorithm is analyzed in this section too.
1.1 Features of different types of components in IR image
A sample of real IR image which contains a real small dim target[
Figure 1.(a) A sample of real IR image; (b) 3D distributions of different types of components. Here, TT represents true small target, NB represents normal background, HB represents high brightness background, EB represents complex background edge, and PNHB represents Pixel-sized Noises with High Brightness
From Fig.1 we can get the conclusions as follows:
(1) True small dim target usually concentrates in a small, compact area that attenuates from the center, it usually emerges in flat, homogenous background zones, and is usually brighter than its immediate neighboring background in the spatial domain since the target is usually hotter. In other words, it has a small local contrast. Besides, true small dim target is usually assumed as rough circles without anisotropy and prevailing directions[
(2) Normal background is usually dark and flat, it has correlation in the spatial domain and is not salient both in gray value and in local contrast.
(3) High brightness background may have a gray value far larger than real small dim target, but it usually has a large area and has correlation in the spatial domain too, thus, its local contrast is not obvious.
(4) Complex background edge also has local contrast information between the two sides, but background edge usually distributes along a particular direction in a local small area, which is different from true small dim target.
(5) PNHB has the most similar pattern to true small dim target, however, a PNHB which is usually caused by random electrical noises only emerges as a single pixel, but a true small dim target usually has a small area due to the optics Point Spread Function (PSF) of the thermal imaging system at a long distance[
1.2 MRDLCM
MRDLCM describes a pixel position by generating a signal value. As mentioned before, a small dim target is salient in a local area but not in the whole image, so we will focus on a local small image patch (as shown in Fig.2), and define the MRDLCM of the central pixel in the central cell cell(0) utilizing the ratio and the difference between cell(0) and cell(i) (i=1,2,···,8 represents different directions). See below for details. Here the central cell is used to catch real target and the surrounding cells are used to capture the surrounding backgrounds, so the cell size N should be close to or slightly larger than a real target. If the target size is unknown, to ensure that a cell can contain the total target while introducing as few interferences as possible, N should be approximated to the general maximal size of small targets[
Figure 2.A local small image patch used for MRDLCM calculation. It is divided into 9 cells, and the cell size
(1) The Ratio Form Local Contrast Measure (RFLCM)
First, considering real small dim target is usually the brightest in a local area, and normal background is usually dark and flat, to enhance the target, a new ratio form local contrast measure is proposed, and the RFLCM of the central pixel in cell(0) for the ith direction is defined as
where Imean0 denotes the average gray of the K max pixels in cell(0), Imeani denotes the average gray of the K max pixels in cell(i), shown in Eqs.(2) and (3):
where K is the number of maximal gray values considered, Gj0 and Gji are the jth maximal gray value of cell(0) and cell(i). The average operation here is used to reduce the interference caused by PNHB, so K is suggested to be set to a value larger than 1. Besides, considering true target usually attenuates from its center, to get a larger enhancement on real target, K is suggested to be set to a value smaller than the target size.
Figure 3 illustrates the different cases when the central pixel of cell(0) is different. Similar to Fig.1, TT means the central pixel of cell(0) is a true target pixel, NB means the central pixel of cell(0) is a normal background pixel, HB means the central pixel of cell(0) is a high brightness background pixel, EB means the central pixel of cell(0) is a complex background edge pixel, and PNHB means the central pixel of cell(0) is a PNHB.
Figure 3.Different cases when the central pixel of
From Fig.3, it can be easily deduced that for TT and NB there will be
Since normal background usually occupies a large area and real target is usually salient in local. Therefore, true small dim target can be enhanced using the proposed RFLCM.
For PNHB, even if its gray value is close to or slightly larger than a real target, the average operation in (2) will suppress PNHB effectively if K is larger than 1, so it can be easily deduced that
At last, to suppress complex background edges, the direction information is utilized and the RFLCM is finally defined as
It can be easily deduced that
Therefore, both PNHB and complex background edges can be suppressed using the proposedRFLCM.
(2) The Difference Form Local Contrast Measure (DFLCM)
Then, considering real small dim target is usually local salient, and high brightness background usually has a large area and has correlation in the spatial domain, a new difference form local contrast measure is proposed to eliminate the high brightness background, and the DFLCM of the central pixel in cell(0) for the ith direction is defined as
where Imean0 and Imeani are same as Eqs.(2) and (3).
From Fig.3, it can be easily deduced that for TT and HB there will be
Since high brightness background usually has a large area and has correlation in the spatial domain, while real target is usually local salient. Therefore, high brightness background can be eliminated using the proposed DFLCM.
Similar to RFLCM, the final DFLCM is defined as
and the conclusions in (6) and (8) are easily to be proved as true for DFLCM, too, see Eqs.(13) and (14), i.e., both PNHB and complex background edges can be suppressed using the proposedDFLCM.
(3) The Ratio-Difference joint Local Contrast Measure (RDLCM)
To combine the advantages of the ratio form local contrast and the difference form local contrast, the Ratio-Difference joint Local Contrast Measure (RDLCM) is proposed here, and the method to calculate the RDLCM for a raw IR image is shown in the Algorithm 1.
1: Form 9 cells as a patch, as shown in Fig.2.
2: Slide the patch on the raw IR image from left to right and top to down.
3: At each pixel, calculate its RFLCM and DFLCM according to Eqs.(1), (7) and Eqs.(9), (12).
4: After the calculation is end for the total image, form the results as two new matrixes RFLCM and DFLCM respectively.
5: Normalize the elements in RFLCM to the range (0, 1).
6: Normalize the elements in DFLCM to the range (0, 1).
7: Calculate the RDLCM of the raw IR image using the Hadamard product of RFLCM and DFLCM:
(4) The MRDLCM
The K in Eqs.(2) and (3) is a key parameter in the proposed algorithm. To get a better detection performance, K should be adjusted adaptively with the size of the target. However, in real applications, target size is usually unknown. Thus, the multi-scale detection is utilized in the proposed algorithm, and the calculation of MRDLCM for a raw IR image is shown the Algorithm 2 when L is the number of scales:
1:
Calculate the RDLCMs according to the Algorithm 1 using Ks.
2:
3: At each pixel, output the maximum RDLCM value of the L scales as the final MRDLCM value, i.e.,
where (i, j) is the coordinate of each pixel.
1.3 RLD
Existing local contrast algorithms (including ratio form and difference form) just simply taken the local surrounding areas (cell(1)-cell(8) in Fig.2) as background when they calculate the local contrast information, but did not taken into account the diversity of the local surrounding background itself. In fact, complex backgrounds are the most common interferences in IR small dim target detection, and it’s obviously that the more complex the background, the larger the local diversity, while a real target which emerges in flat, homogenous background zones will has a smaller local diversity. Thus, the local diversity information can be used to suppress complex backgrounds further.
In this paper, two aspects are considered in the Local Diversity (LD): the diversity within each cell, and the diversity between cell(1)-cell(8). First, the diversity within each cell is defined as
where Imaxi and Imini are maximum and minimum gray value in cell(i) respectively. It can be easily deduced that if a cell is made up of a flat, homogenous region, the diversity within the cell will be close to 0; on the other hand, if a cell is made up of complex background, the diversity within the cell will be far larger than 0.
Then, the LD of the central pixel of cell(0) will be defined as
In fact, Eq.(18) is the variance of Dini, i.e., it takes into account the diversity between cell(1)-cell(8). It can be easily deduced that if a pixel is located at a flat, homogenous region, its LD will be close to 0; on the other hand, if a pixel is located at complex background, its LD will be far larger than 0.
Based on the analysis above, it can be seen that for a raw IR image, a target pixel will have a smaller LD, and a complex background pixel will have a larger LD. Then, we can use the LD as a weight function to further suppress complex background. However, to ensure that a target pixel will have a larger weight value and a complex background pixel will have a smaller weight value, the normalize and reverse operation will be needed and the RLD is proposed. The calculation of RLD is shown in the Algorithm 3.
1: Form 9 cells as a patch, as shown in Fig.2.
2: Slide the patch on the raw IR image from left to right and top to down.
3: At each pixel, calculate its LD according to Eqs.(17) and (18).
4: After the calculation is end for the total image, form the results as a new matrixes LD.
5: Normalize the elements in LD to the range (0, 1).
6: Reverse:
1.4 Detection ability analysis for MRDLCM_RLD
For a raw IR image, after the MRDLCM and the RLD are calculated, the Multi-scale Ratio-Difference Local Contrast Measure weighted by Reversed Local Diversity (MRDLCM_RLD) will be calculated according to the Algorithm 4.
1: Calculate the MRDLCM_RLD of the raw IR image using the Hadamard product of MRDLCM and RLD:
Then, the MRDLCM_RLD results for different types of pixels (see Fig.3) will be discussed.
(1) For a TT, since a true target usually concentrates in a small, compact area and is hotter than its neighboring background, its RFLCM (before normalization, the same below) will be larger than 1, DFLCM (before normalization, the same below) will be larger than 0, then its MRDLCM will be large. Besides, since a true target usually emerges in flat, homogenous background zones, its RLD will be large, too. Thus, there will be
(2) For a NB, since normal background is usually dark and flat, its RFLCM will be close to 1, and its DFLCM will be close to 0, then its MRDLCM will be smaller than a TT’s. Thus, there will be
although the RLD of a NB may be close to the RLD of a TT.
(3) For an HB, similar to NB, it can be easily deduced that
(4) For an EB, since background edge usually distributes along a particular direction in a local small area, the minimum operation in Eqs.(7) and (12) ensures that the EB’s RFLCM will be close to or smaller than 1, DFLCM will be close to or smaller than 0, then its MRDLCM will smaller than a TT’s. Besides, the RLD of an EB will be smaller than a TT’s, too. Thus, there will be
(5) For a PNHB, since PNHB usually emerges as a single pixel and the average operation is utilized in Eqs.(2) and (3), the PNHB’s RFLCM and DFLCM will be smaller than a TT’s if K is set to a value larger than 1, even the PNHB has a same gray value with TT. Thus, there will be
although the RLD of a PNHB may be close to the RLD of a TT.
From the discussions above it can be seen that after the MRDLCM_RLD calculation, true small dim target will be the most salient, while all the other interferences are suppressed, which means the proposed MRDLCM_RLD has a good detection ability for IR small dim target, and no preprocessing is needed.
2 The detection of the small dim target based on MRDLCM_RLD
The flow chart of the whole detection algorithm is shown in Fig.4. The MRDLCM and RLD will be first calculated for a raw IR image respectively, then the RLD result will be used as a weight function for MRDLCM, and the MRDLCM_RLD is calculated. From Fig.4 it can be seen that after the MRDLCM_RLD calculation, true small dim target will be the most salient in the saliency map MRDLCM_RLD, so a threshold operation can be used to extract target. Besides, Fig.4 also hints that the proposed algorithm has the potential of parallel processing.
Figure 4.Flow chart of the proposed algorithm
2.1 The threshold operation
There are many types of threshold definitions. In this paper, we adopt the idea of the widely used Gaussian threshold because it is an adaptive threshold definition and can be effectively used to capture the abnormal salient information in a large number of data, and the threshold Th is adaptively defined as
where μ and σ are the mean and standard deviation of MRDLCM_RLD, kth is a given parameter. Our experiments show that the optimal range of kth is from 2 to 7.
In MRDLCM_RLD, the pixels which have larger value than Th will be output as target pixels, while other parts are discarded. In the final detection result, each connected area will be regarded as a detected target (In order to eliminate clutters, a dilation operation may be needed).
2.2 The potential of parallel processing analysis
Detection speed is an important index in the field of IR small dim target detection since real time detection is usually needed in many practical applications. Parallel processing is a common method to improve the detection speed as parallel processing devices such as GPU have been widely used. In fact, in most cases the lack of parallel processing capability is the key problem that affects the detection speed, so it is necessary to analyze the parallel processing capability of the proposed algorithm.
It can be easily deduced that the proposed algorithm has the potential of parallel processing. First, the calculations of MRDLCM and RLD can be carried out in parallel; then, the calculations of different scales can be carried out in parallel. See Fig.4. Besides, for each scale, the calculations of each pixel can be carried out in parallel; for each pixel, the calculations of each direction can be carried out in parallel.
3 Experimental results
In this section, four real IR sequences are used to verify the performance of the proposed algorithm. First, the features of different sequences are introduced. Then, the key parameter K in the proposed algorithm is optimized using different sizes of targets. Then, the detection results for different sequences of the proposed algorithm are given. To further illustrate the effectiveness of the proposed algorithm, the comparisons of the detection performances between the proposed algorithm and other state-of-the-art algorithms are given, too. All the experiments are conducted on a computer with 4 GB random access memory and 3.4 GHz Intel i3 processor, and the code was implemented in MATLAB R2016b.
3.1 Features of different sequences
In this paper, four real IR sequences which contains different sizes of small dim targets and different types of backgrounds are used to verify the performance of the proposed algorithm. The samples for each sequence are shown in Fig.5.
Figure 5.Samples for the four IR sequences. (a) A sample for Seq. 1; (b) A sample for Seq. 2; (c) A sample for Seq. 3; (d) A sample for Seq. 4
From Fig.5 it can be seen that the targets are very small and dim, and the backgrounds are complex. The details of the features of different sequences are given in Table 1.
Frames | Image resolution | Target ID | Target size | Target details | Background details | |
Seq. 1 | 300 | 320×240 | Only 1 | 7×5 | • Plane target.• A long imaging distance.
| • Sky-Cloud background.• Heavy clutter.
|
Seq. 2 | 100 | 256×256 | Only 1 | 5×5 | • Truck target.• A long imaging distance.
| • Ground-Tree background.
|
Seq. 3 | 100 | 320×256 | Only 1 | 3×3 | • Plane target.• A long imaging distance.
| • Ground-Sky background.
|
Seq. 4 | 100 | 256×256 | Target 1 | 7×5 | • Boat target.• A long imaging distance.
| • Sea-Sky background.
|
Target 2 | 5×5 | |||||
Target 3 | 5×5 | |||||
Target 4 | 6×6 |
Table 1. Features of different sequences
Besides, Table 2 gives some characteristics of the first frame of the four sequences, here Cwh, Cnb, and SCR are defined as follows, and It is the maximum gray value of the target, Iwh is the maximum gray value of the whole image, Inb is the mean gray value in the neighborhood area of the target, σwh is the stand deviation of the whole image:
Target ID | Cwh | Cnb | SCR | |
Seq. 1 | Only 1 | 6.3875 | 0.8896 | |
Seq. 2 | Only 1 | 0.9043 | 2.8461 | |
Seq. 3 | Only 1 | |||
Seq. 4 | Target 1 | 0.8325 | 1.6194 | 1.5707 |
Target 2 | 0.8030 | 1.4725 | 1.2457 | |
Target 3 | 0.7340 | 1.3254 | 0.8545 | |
Target 4 | 0.7734 | 1.2787 | 0.7848 |
Table 2. Characteristics of the first frame of the four sequences
From Fig.5 and Table 2 we can see that for the four sequences been used in this paper, the targets may be not the brightest part in the whole image, i.e., the Cwh of the four sequences are all smaller than 1. For example, the Cwh of Seq. 1 or Seq. 3 is just about 0.3. However, in a local region, the targets are all brighter than their neighborhood areas, i.e., the Cnb of the four sequences are all larger than 1. Even in the worst case in Seq. 2 or Seq. 3, the Cnb is still larger than 1.16. Besides, complex backgrounds and heavy clutters bring a low SCR, especially for Seq. 3, which has a SCR lower than 0.2.
3.2 The optimization of K
In the proposed algorithm, the K in Eqs.(2) and (3) for MRDLCM calculation is a key parameter. It is obviously that to reduce the interference caused by PNHB, K should be larger than 1; but, to get a better enhancement on real target, K should be adjusted adaptively with the size of the target. Considering target size is usually unknown in practice, multi-scale detection is adopted in this paper (see the Algorithm 2), then the optimization of the parameters K1, K2, ···, KL (L is the scale number totally used) for each scale will be an essential matter.
In this section, numerous simulations have been done to choose the optimal K for different target sizes. The 2D Gaussian function is used to model the small target, and the maximum gray value of the target is set to 120. Different target sizes including 3 × 3, 5 × 5, 7 × 7 and 9 × 9 are tested. The resolution of the simulated images is set to 320 × 256. The gray vale of the normal background is set to 100, particularly, in the left-up corner there is a small area (80 × 240) of high brightness background with gray value 200. Target is located at (156, 180) in the first frame, and moves from left to right one pixel per frame. Random noises with a standard variation of 20 are added to each simulated image.
Figure 6 gives the SCR value using different K for different target sizes, for each target size 20 frames are listed here. From Fig.6 it can be seen that for a smaller target, K=2 will be proper; for a larger target, K=4 will be proper.
Figure 6.SCR results before and after MRDLCM calculation using different
To verify this conclusion, three real sequences including Seq. 1, Seq. 2 and Seq. 3 are used for test, too. The results are shown in Fig.7, for each sequence 10 frames are listed here. It can be seen that in most cases the optimal value of K is 2 or 4, too. Thus, without loss of generality, 2 scales are used in this paper (i.e., L=2) and K1 is set to 2, K2 is set to 4.
Figure 7.SCR results before and after MRDLCM calculation using different
3.3 The detection results of the proposed algorithm
First, to verify the effectiveness of the proposed algorithm, the calculation results for different types of pixels are given in Fig.8. The same figure with Fig.1 and Fig.3 is used here.
Figure 8.Calculation results for different types of pixels using the proposed MRDLCM_RLD algorithm. (a) Different cases when the central pixel of
Comparing Fig.8(c) to Fig.1(b) it can be seen that real small dim target is enhanced using the proposed algorithm, while other interferences all been suppressed, which is consistent with the discussions in Section 1.4.
Figure 9 gives the detection results of different sequences using the proposed MRDLCM_RLD algorithm. Here K1 is set to 2, K2 is set to 4, and N is 9 × 9. The same samples with Fig.5 are used here.
Figure 9.From top to bottom: the detection results using the proposed MRDLCM_RLD algorithm for Seq. 1, Seq. 2, Seq. 3 and Seq. 4. (a) The raw IR image samples of the four sequences; (b) The DFLCM result; (c)The RFLCM result; (d)The MRDLCM result; (e) The RLD result; (f) The MRDLCM_RLD result; (g) The threshold operation results, each connected area is regarded as a target
It can be seen from Fig.9 that in the raw IR images, the targets are small and dim, and complex backgrounds and heavy clutters exist, see Fig.9(a). After the DFLCM calculation, the complex background can be suppressed to some extent, but the target is still weak, especially when the target is small and dim in the raw images, see Fig.9(b). In the RFLCM calculation results, the target is usually larger, see Fig.9(c), thus, in the results after MRDLCM calculation, the targets can be enhanced, while complex backgrounds and heavy clutters are suppressed further, see Fig.9(d). After the RLD calculation, the homogenous background area will get a large value, and the complex background area will get a small value, see Fig.9(e). Thus, when we weighted MRDLCM with RLD, the complex backgrounds and heavy clutters will be suppressed further, see Fig.9(f). At last, real small dim targets are output correctly for both single target cases and multi targets cases by the threshold operation, while other parts are discarded, see Fig.9(g). Only in Seq. 3 there is a false alarm since Seq. 3 has a very small dim target and very complex backgrounds (see Table 2).
To better explain the effectiveness of RLD, Fig.10 gives the detection result using only MRDLCM alone for Seq. 3. It can be seen that there will be more false alarms when RLD is removed, which proves from the opposite side that RLD can further suppress complex backgrounds.
Figure 10.Detection result using only MRDLCM alone for Seq. 3. (a) The raw IR image sample of Seq. 3; (b) The MRDLCM result; (c) The threshold operation result on MRDLCM, more false alarms emerge
3.4 Comparisons with other algorithms
To further illustrate the effectiveness of the proposed algorithm, six state-of-the-art algorithms are chosen for comparison, including DoG[
First, the single frame detection results using the comparison algorithms are shown in Fig.11, the same samples with Fig.9 are used here.
Figure 11.Comparisons of detection results between different algorithms, from top to down: the detection results of Seq. 1, Seq. 2, Seq. 3 and Seq. 4 using (a) DoG; (b) ILCM; (c) NLCM; (d) WLDM; (e) MPCM; and (f) RLCM
From Fig.11 it can be seen that:
(1) For Seq. 1, DoG, ILCM, NLCM and WLDM cannot detect the small target correctly but a lot of false alarms emerge; MPCM can detect the small target, but false alarm emerges too; only RLCM can detect the small target correctly without any false alarms.
(2) The case of Seq. 2 is similar to that of Seq. 1.
(3) For Seq. 3, even MPCM cannot detect the target successfully since the target is too dim and the backgrounds are too complex. RLCM, on the other hand, although can detect the small target, presents more false alarms than the proposed algorithm.
(4) For Seq. 4 which contains multi targets, since the targets are somewhat bright and the backgrounds are homogenous, almost all the algorithms can successfully detect the four targets without any false alarms, except DoG, which detects only three real targets but presents two false alarms.
Comparing Fig.11 to Fig.9, it is easy to get the conclusion that the proposed algorithm can achieve the best detection performance against other existing algorithms.
Then, the comparison results for the whole sequences using different algorithms are also given in Fig.12, here the Receiver Operating Characteristic (ROC) curves[
Figure 12.ROC curves of different algorithms for (a) Seq. 1, (b) Seq. 2, (c) Seq. 3 and (d) Seq. 4
(1) The detection performance of DoG is the worst, because DoG is a difference form local contrast algorithm, it cannot effectively enhance the small dim target. Besides, it doesnot utilize the directional information and cannot distinguish complex background edges.
(2) WLDM has a better performance than DoG, because WLDM is a ratio form local contrast method and can enhance the small dim targets effectively. However, it can’t eliminate high brightness backgrounds effectively.
(3) ILCM has a better performance than DoG and WLCM, because it is a ratio form local contrast method and takes DoG as preprocessing, so it can enhance small dim targets and suppress high brightness backgrounds simultaneously. Besides, the directional information is utilized in ILCM to better suppress the complex background edges.
(4) The case of NLCM is similar to ILCM, and their performances are similar too.
(5) MPCM adopts multi-scale detection can achieve a satisfied detection performance in Seq. 1, Seq. 2, and Seq. 4, but in Seq. 3, its performance is the worst, because the target is too small and too dim, and the backgrounds are too complex in Seq. 3. In other words, the robustness of MPCM is not good. The reason is that MPCM is a difference form local contrast algorithm, it cannot effectively enhance the small dim target.
(6) As a newly proposed multi-scale local contrast method, RLCM can achieve a better detection performance than other existing algorithms in all sequences since it utilizes both ratio operation and difference operation, but its performance in Seq. 3 is still not good.
(7) The proposed algorithm, which joints the ratio form local contrast and the difference form local contrast together to enhance real target and suppress interferences simultaneously, can achieve the best detection performance with good robustness in all the four sequences. Especially, for Seq. 3 in which real target is dim and backgrounds are complex, the detection performance of the proposed algorithm is greatly improved comparing to RLCM.
4 Conclusion
In this paper, a Multi-scale Ratio-Difference Local Contrast Measure (MRDLCM) weighted by Reversed Local Diversity (RLD) for IR small dim target detection is proposed. MRDLCM can combine the advantages of the ratio form methods and the difference form methods, so it can suppress all the types of interferences while enhancing different sizes of real targets, and do not need any preprocessing algorithms. RLD utilizes the local diversity information to suppress the complex backgrounds further. Four real IR sequences which contain different types of backgrounds and different sizes of targets are used for experiments, and the experimental results show the effectiveness and the robustness of the proposed MRDLCM_RLD algorithm in detection rate and false alarm rate against six existing state-of-the-art algorithms. Besides, the proposed algorithm has the potential of parallel processing, which is very useful for improving the detection speed.
References
[1] Mingming Fan, Shaoqing Tian, Kai Liu, et al. Infrared small target detection agorithm based on gradient direction consistency and eigendecomposition. Infrared and Laser Engineering, 49, 0126001(2020).
[2] M Nasiri, S Chehresa. Infrared small target enhancement based on variance difference. Infrared Phys Technol, 82, 107-119(2017).
[3] L Dong, B Wang, Z Ming, et al. Robust infrared maritime target detection based on visual attention and spatiotemporal filtering. IEEE Trans Geosci Remote Sens, 55, 3037-3050(2017).
[4] C Gao, D Meng, Y Yang, et al. Infrared patch-image model for small target detection in a single image. IEEE Trans Image Process, 22, 4996-5009(2013).
[5] Xiangyue Zhang, Qinghai Ding, Haibo Luo, et al. Infrared dim target detection algorithm based on improved LCM. Infrared and Laser Engineering, 46, 0726002(2017).
[6] C Gao, T Zhang, Q Li. Small infrared target detection using sparse ring representation. IEEE Aerosp Electron Syst Mag, 27, 21-30(2012).
[7] Kim S. Target attributebased false alarm rejection in small infrared target detection[C]Proc SPIE, 2012, 8537: 85370G.
[8] J Han, Y Ma, B Zhou, et al. A robust infrared small target detection algorithm based on human visual system. IEEE Geosci Remote Sensing Lett, 11, 2168-2172(2014).
[9] Deshpe S, Meng H, Venkateswarlu R. Maxmean maxmedian filters f detection of small targets[C]Proc SPIE, 1999, 3809: 7483.
[10] B Zhang, T Zhang, Z Cao, et al. Fast new small-target detection algorithm based on a modified partial differential equation in infrared clutter. Opt Eng, 46, 106401(2007).
[11] Gregis D J, Yu S K, Tritchew S, et al. Wavelet transfmbased filtering f the enhancement of dim targets in FLIR images[C]Proc SPIE, 1994, 2242: 573583.
[12] B Qi, T Wu, H He. Robust detection of small infrared objects in maritime scenarios using local minimum patterns and spatio-temporal context. Opt Eng, 51, 027205(2012).
[13] X Bai, F Zhou. Analysis of new top-hat transformation and the application for infrared dim small target detection. Pattern Recognit, 43, 2145-2156(2010).
[14] Y Cao, R Liu, J Yang. Small target detection using two-dimensional least mean square (TDLMS) filter based on neighborhood analysis. J Infrared Millim W, 29, 188-200(2008).
[15] H Ding, H Zhao. Adaptive method for the detection of infrared small target. Opt Eng, 54, 113107(2015).
[16] Z Wang, J Tian, J Liu, et al. Small infrared target fusion detection based on support vector machines in the wavelet domain. Opt Eng, 45, 076401(2006).
[17] Y Bi, X Bai, T Jin, et al. Multiple feature analysis for infrared small target detection. IEEE Geosci Remote Sensing Lett, 14, 1333-1337(2017).
[18] X Wang, Z Peng, D Kong, et al. Infrared dim target detection based on total variation regularization and principal component pursuit. Image Vision Comput, 63, 1-9(2017).
[19] C Wang, S Qin. Adaptive detection method of infrared small target based on target-background separation via robust principal component analysis. Infrared Phys Technol, 69, 123-135(2015).
[20] L Itti, C Koch, E Niebur. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell, 20, 1254-1259(1998).
[21] J Han, K Liang, B Zhou, et al. Infrared small target detection utilizing the multi-scale relative local contrast measure. IEEE Geosci Remote Sensing Lett, 15, 612-616(2018).
[22] S Kim, Y Yang, J Lee, et al. Small target detection utilizing robust methods of the human visual system for IRST. J Infrared Millim Terahz Waves, 30, 994-1011(2009).
[23] X Shao, H Fan, G Lu, et al. An improved infrared dim and small target detection algorithm based on the contrast mechanism of human visual system. Infrared Phys Technol, 55, 403-408(2012).
[24] X Wang, G Lv, L Xu. Infrared dim target detection based on visual attention. Infrared Phys Technol, 55, 513-521(2012).
[25] X Dong, X Huang, Y Zheng, et al. A novel infrared small moving target detection method based on tracking interest points under complicated background. Infrared Phys Technol, 65, 36-42(2014).
[26] X Dong, X Huang, Y Zheng, et al. Infrared dim and small target detecting and tracking method inspired by human visual system. Infrared Phys Technol, 62, 100-109(2014).
[27] J Han, Y Ma, J Huang, et al. An infrared small target detecting algorithm based on human visual system. IEEE Geosci Remote Sensing Lett, 13, 452-456(2016).
[28] K Xie, K Fu, T Zhou, et al. Small target detection based on accumulated center-surround difference measure. Infrared Phys Technol, 67, 229-236(2014).
[29] Y Wei, X You, H Li. Multiscale patch-based contrast measure for small infrared target detection. Pattern Recognit, 58, 216-226(2016).
[30] C L P Chen, H Li, Y Wei, et al. A local contrast method for small infrared target detection. IEEE Trans Geosci Remote Sens, 52, 574-581(2014).
[31] Y Qin, B Li. Effective infrared small target detection utilizing a novel local contrast method. IEEE Geosci Remote Sensing Lett, 13, 1890-1894(2016).
[32] H Deng, X Sun, M Liu, et al. Small infrared target detection based on weighted local difference measure. IEEE Trans Geosci Remote Sens, 54, 4204-4214(2016).
[33] H Deng, X Sun, M Liu, et al. Infrared small-target detection using multiscale gray difference weighted image entropy. IEEE Trans Aerosp Electron Syst, 52, 60-72(2016).
[34] T Xia, Y Y Tang. Biologically inspired small infrared target detection using local contrast mechanisms. Int J Wavelets Multiresolut Inf Process, 13, 1550025(2015).
[35] Z Cui, J Yang, J Li, et al. An infrared small target detection framework based on local contrast method. Measurement, 91, 405-413(2016).
[36] Z Cui, J Yang, S Jiang, et al. An infrared small target detection algorithm based on high-speed local contrast method. Infrared Phys Technol, 76, 474-481(2016).
[37] Y Chen, Y Xin. An efficient infrared small target detection method based on visual contrast mechanism. IEEE Geosci Remote Sensing Lett, 13, 962-966(2016).
[38] IEEE OTCBVS WS Series Bench, Rol Miezianko, Terravic Research Infrared Database[DBOL]. [2013] http:vciplokstate.gpbvsbenchindex.html.
[39] Zhang W, Cong M, Wang L. Algithms f optical weak small targets detection tracking: review[C]Proc IEEE Int Conf Neural w Signal Process, 2003, 12: 643647.
[40] Davis J, Goadrich M. The relationship between precisionrecall ROC curves[C]Proc 23rd Int Conf Mach Learn, 2006: 233240.

Set citation alerts for the article
Please enter your email address