Deep plug-and-play priors for spectral snapshot compressive imaging

Siming Zheng; Yang Liu; Ziyi Meng; Mu Qiao; Zhishen Tong; Xiaoyu Yang; Shensheng Han; Xin Yuan

doi:10.1364/PRJ.411745

Abstract

We propose a plug-and-play (PnP) method that uses deep-learning-based denoisers as regularization priors for spectral snapshot compressive imaging (SCI). Our method is efficient in terms of reconstruction quality and speed trade-off, and flexible enough to be ready to use for different compressive coding mechanisms. We demonstrate the efficiency and flexibility in both simulations and five different spectral SCI systems and show that the proposed deep PnP prior could achieve state-of-the-art results with a simple plug-in based on the optimization framework. This paves the way for capturing and recovering multi- or hyperspectral information in one snapshot, which might inspire intriguing applications in remote sensing, biomedical science, and material science. Our code is available at: https://github.com/zsm1211/PnP-CASSI.

1. INTRODUCTION

(x, y, λ)

(x, y, λ)

The underlying principle of the spectral SCI hardware is to modulate different bands (corresponding to different wavelengths) in the spectral data cube by different weights and then integrate the light to the sensor. To perform the modulation, which should be different for different spectral bands, various techniques have been used. The pioneer work of coded aperture snapshot spectral imaging (CASSI) [12] used a fixed mask (coded aperture) and two dispersers to implement the band-wise modulation, termed DD-CASSI; here DD means dual disperser. Following this, the single-disperser (SD) CASSI was developed [19], which achieves modulation by removing a disperser. Following CASSI, various spectral SCI systems have been built using disperser/prism and masks [20 - 24]. Recently, motivated by the spectral variant responses of other media, spatial light modulators [25], ground-glass-based light field modulation [26], and scatters [27] have also been employed for spectral SCI. In addition, some compact systems have also been built [28, 29].

Sign up for Photonics Research TOC. Get the latest issue of Photonics Research delivered right to you！Sign up now

The software decoder, i.e., the reconstruction algorithm, plays a pivotal role in spectral SCI as it outputs the desired data cube. At the beginning, optimization-based algorithms developed for inverse problems such as CS were employed. Since spectral SCI is an ill-posed problem, regularizers or priors are generally used, such as the sparsity [30] and total variation [15]. Later, the patch-based methods such as dictionary learning [25, 31] and Gaussian mixture models [32] were developed for the reconstruction of spectral SCI. Recently, by utilizing the nonlocal similarity in the spectral data cube, group sparsity [17] and low-rank models [16] have been developed to achieve state-of-the-art results. The main bottleneck of these high performance iterative optimization-based algorithms is the low reconstruction speed. Since the spectral data cube is usually large-scale, sometimes it needs hours to reconstruct a spectral data cube from a snapshot measurement. This precludes the real applications of spectral SCI systems.

To address the above speed issue in optimization algorithms, and inspired by the performance of deep-learning approaches for other inverse problems [33, 34], convolutional neural networks (CNNs) have been used to solve the inverse problem of spectral SCI for the sake of high speed [35 - 39]. These networks have led to better results than their optimization counterparts, given sufficient training data and time, which usually take days or weeks. After training, the network can output the reconstruction instantaneously and thus lead to end-to-end spectral SCI sampling and reconstruction [39]. However, these networks are usually system-specific. For example, different numbers of spectral bands exist in different spectral SCI systems. Further, due to the different designs of masks, the trained CNNs cannot be used in other systems, while retraining a new network from scratch would take a long time.

Bearing the above concerns in mind, i.e., optimization-based and deep-learning-based algorithms each have their own pros and cons, it is desirable to develop a fast, flexible, and high accuracy algorithm for spectral SCI. Fortunately, the plug-and-play (PnP) framework [40, 41] has been proposed for inverse problems with provable convergence [42, 43]. The idea of PnP is intuitive, since the goal is to use the state-of-the-art denoiser as a simple plug-in for recovery. The rationale here is to employ recent advanced deep denoisers [44 - 46] in the iterative optimization algorithm to speed up the reconstruction process. Since these denoisers are pretrained with a wide range of noise levels, the PnP algorithm is very efficient and usually only tens or hundreds of iterations would provide promising results [18]. More importantly, no training is required for different tasks and thus the same denoising network can be directly used in different systems. Therefore, PnP is a good trade-off for reconstruction quality, speed, and flexibility.

However, since most existing flexible denoising networks are designed for natural images, i.e., the gray-scale or RGB images, directly using these networks into spectral SCI systems would not lead to good results. To address this issue, in this paper, we propose training a flexible denoising network for multispectral/HSIs and then apply it to the PnP framework to solve the reconstruction problem of spectral SCI.

Our proposed approach enjoys the advantages of speed, flexibility, and high accuracy. We apply the proposed method in five different real systems (three SD-CASSI systems [39, 47, 48], one mutispectral endomicroscopy system [36], and one ghost imaging spectral system [26]) and all of them have achieved promising results. To compare with other state-of-the-art algorithms, simulations are also conducted to provide quantitative analysis. Spectral sensor design and fabrication [2, 4 - 8] may benefit from our method by taking inspiration from the coding mechanisms and the simple plug-in for recovery.

Note that the PnP framework has been used in other inverse problems such as video CS [18], which emphasized the theoretical analysis of PnP for SCI problems in general and used an off-the-shelf denoiser (FFDNet) [46] to demonstrate its capability in video SCI. No spectral SCI results have been shown therein because spectral SCI is more challenging in terms of its various coding mechanisms and no off-the-shelf denoiser could provide a fast, flexible, and high-accuracy solution. As a matter of fact, this observation serves as the initial motivation for this paper. Towards this end, the novelty of this paper is twofold. First, we propose a CNN-based deep spectral denoising network as the spatio-spectral prior, which is flexible in terms of data size and the input noise levels. Second, we summarize the image-plane and aperture-plane coding mechanisms for spectral SCI and use the PnP method combined with our proposed deep spectral denoising prior for both simulations and five different spectral SCI systems (including image-plane and aperture-plane coding-based ones).

The paper is organized as follows. Section 2 introduces different spectral SCI systems. The proposed PnP method is derived in Section 3 . Extensive results are shown in Section 4, and Section 5 concludes the entire paper.

2. SPECTRAL SCI

The basic idea of SCI is to encode 3D or multidimensional visual information onto 2D sensor measurement. For spectral SCI, a 3D spatio-spectral data cube is encoded to form a snapshot 2D measurement on the charge coupled device (CCD) or complementary metal oxide semiconductor (CMOS) sensor, as shown in Fig. 1 .

Generalized image formation (left) and the discrete matrix-form model (right) of spectral SCI. Here color denotes the corresponding spectral band.

Figure 1.Generalized image formation (left) and the discrete matrix-form model (right) of spectral SCI. Here color denotes the corresponding spectral band.

A. SCI Forward Model

X \in R^{W \times H \times B}

A

B. Spectral SCI Systems

To encode spectral information onto a single-shot measurement, the sensing matrix must be spectrally variant. To this end, spectral SCI systems need to involve spectral dispersion devices (dispersers), like prisms, diffraction gratings, or diffusers.

A

1. Image-Plane Coded Mask

For image-plane coding, the coded mask is typically located at the conjugate image plane of the sensor plane, where one spatio-spectral voxel is directly modulated by one pixel on the coded mask and then relayed to one pixel on the detector. Therefore, there is a voxel-to-pixel mapping between the scene and the corresponding column of the sensing matrix.

As mentioned before, CASSI [12, 19, 47, 48] was the first spectral SCI system, to the best of our knowledge. And CASSI systems can be categorized into image-plane coded masks, whether they use dual dispersers or a single disperser. The key success of CASSI is to use a coded mask for spatial coding and implement a spectral shearing with a disperser (a prism [12, 19, 27, 47, 48], a grating [20], or other spectrally variant devices like spatial light modulators (SLMs) [25, 49, 50]) to encode 3D spatio-spectral information onto a snapshot measurement on a 2D detector.

DD-CASSI [12] preshears the spectral cube of the scene via the first prism and then spatially encodes it using a coded mask at the image plane, where the coded spectral cube is finally unsheared to match the size of the original spectral cube via the second prism. Thereby, each voxel of the scene spectral cube would correspond to one element in the sensing matrix, and the encoded spectral cube is unsheared and thus has the same spatial size as the 2D measurement thanks to the usage of two complementary prisms, as shown in the first row of Fig. 2 . Single disperser, or SD-CASSI [19, 47] does not preshear the scene spectral cube and only performs the spatial coding and spectral shearing with a coded mask and a prism successively, as shown in the upper part of Fig. 3 . In this way, the encoded spectral cube is sheared and contains some zero rows along the shearing boundaries, as shown in the second row of Fig. 2 .

Figure 2.Comparison of image-plane coding (upper) and aperture-plane coding (lower) spectral SCI systems in terms of sensing matrix. Here each color block denotes the corresponding transport matrix at that spectral band.

Figure 3.Image formation process of a typical spectral SCI system, i.e., SD-CASSI and the reconstruction process using the proposed deep PnP prior algorithm.

A = [D_{1}, \dots, D_{B}],

2. Aperture-Plane Coded Mask

A A^{⊤}

There are two types of implementations for aperture-plane coding of a spectral SCI. The main difference is whether the point spread function (PSF) of each spatio-spectral voxel of the scene spectral cube is spatially invariant or not. Typical spatially invariant implementations are using speckles along with memory effect [52,53] and a diffractive optical element (DOE) [28] for spatially invariant PSFs, as shown in the third row of Fig. 2. Less calibration is involved for spatially invariant implementations, which would also suffer from this assumption mismatch. Spatially variant PSFs are more general, with a ghost imaging via sparsity constraints (GISC) spectral camera [26,54] and the compact prism-based spectral camera [29] as two representatives, as shown in last row of Fig. 2. We will talk about both the algorithm for aperture-coding-based spectral SCI (Section 3.A) and the experimental results on the GISC spectral camera [54] (Section 4.B.3) as well.

3. METHODS

Recovering 3D or multidimensional information from 2D SCI measurements is an ill-posed linear inverse problem. The main take-away from the CS [9, 10, 55, 56] community is that sub-Nyquist sampling and reliable recovery could be achieved by constraints of the sampling/sensing matrix [55, 57] and proper priors of the signal. The performance bound of the SCI-induced sensing matrix has been proved in Ref. [58]. And the fact ion that denoisers using deep neural networks could serve as the prior of natural images with certain constraints on the network training process is getting wide attention [43].

ℓ_{1}

We further use the PnP method [40, 41] based on the alternating direction method of multipliers (ADMM) [59] for image-plane coding and the two-step iterative shrinkage/thresholding (TwIST) [15] algorithm for aperture-plane coding to solve Eq. (5).

A. PnP Method

The basic idea of PnP method for inverse problems is to use a pretrained denoiser for the desired signal as a prior. It builds on the optimization-based recovery method, where the whole inverse problem is broken into easier subproblems by handling the forward-model (data-fidelity) term and the prior term separately [59] and alternating the solutions to subproblems in an iterative manner. This is why it is called the PnP method, since the denoiser could serve as a simple plug-in for the reconstruction process. Here, for spectral SCI, we use a pretrained HSI denoising network as the deep spectral prior and integrate it into an iterative optimization framework for reconstruction, as shown in the lower part of Fig. 3 . We will start with the PnP-ADMM method for spectral SCI with image-plane coding, and then substitute the ADMM projection with TwIST for aperture-plane coding. Note that the difference lies in the “Projection” step in Fig. 3 .

The proposed PnP method has guaranteed convergence for SCI with a bounded denoiser [42, 43] and the assumption of estimated noise levels in a nonincreasing order [18].

1. PnP–ADMM for Image-Plane Coding

x^{k + 1} = \arg {min}_{x} \frac{1}{2} {‖ A x - y ‖}_{2}^{2} + \frac{ρ}{2} {‖ x - (z^{k} - u^{k}) ‖}_{2}^{2},

A A^{⊤}

For spectral SCI, we use a deep spectral denoiser as the prior, as detailed in Section 3.B . This is very straightforward for DD-CASSI. However, for SD-CASSI, there are spatial shifts between adjacent spectral bands because the spectrum is not unsheared by another disperser. Pratically, we calibrate spatial shifts of all spectral bands or keep the same spatial shifts for all adjacent bands and calibrate the corresponding wavelengths. We take the spatial shifts into account by unshifting the spectral bands before applying denoising and then reshifting them back to match the forward model.

2. PnP–TwIST for Aperture-Plane Coding

A A^{⊤}

A^{⊤}

B. Deep Spectral Denoising Prior

From the idea of the PnP method for linear inverse problems, we can see that a proper denoiser could serve as a prior of optimization-based approaches, where a better denoiser would contribute to higher reconstruction quality. Deep-learning-based denoisers, especially those based on CNNs for images/videos are among the state of the art. A key challenge for using deep denoisers as priors is the flexibility in terms of data size and the input noise levels. According to Eq. (14) in PnP-ADMM and Eq. (17) in PnP-TwIST, the denoiser should be adapted to different input noise levels. Inspired by the recent advance of the fast and flexible denoising CNN (FFDNet) [46] and its success applied to video SCI [18], we propose using a deep spectral denoising network as the spatio-spectral prior, that is, the deep spectral denoising prior. The network structure of the deep spectral denoising prior is shown in Fig. 4 .

Figure 4.Network structure of the deep spectral denoising prior.

D_{σ} (v) = {prox}_{σ^{2} R} (v) = \arg {min}_{x} R (x) + \frac{1}{2 σ^{2}} {‖ x - v ‖}_{2}^{2},

W \times H

C. Training Details of Our Deep Spectral Image Denoising Network

512 \times 512

σ

4. RESULTS

λ

A. Simulations

Hereby, we verify the performance of PnP by simulation using different data sets of different sizes and compare it with other popular algorithms. For the simulation data, we generate measurements following the SD-CASSI framework, as shown in the second row of Fig. 2 .

1. Data Sets

1392 \times 1300

Figure 5.Test spectral data from (a) ICVL [69] and (b) KAIST [35] data sets used in simulation. The reference RGB images with pixel resolution $256 \times 256$ are shown here. We crop similar regions of the whole image for spatial sizes of $512 \times 512$ and $1024 \times 1024$ .

2. Competing Methods and Comparison Metrics

λ

3 \times 3

Both quantitative and qualitative metrics are used for comparison. The quantitative metrics are peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) [71]. For qualitative comparison, we plot spectral frames along with spectral curves and compare them with the ground truth for visual verification. Additionally, we use Pearson correlation coefficient (corr) to assess the fidelity of recovered spectra.

3. Parameter Setting

{0,1}

For the proposed PnP algorithm, it usually needs a warm starting point to speed up the convergence. To address this, for the proposed PnP algorithm, we first run 80 iterations of GAP-TV. Since the only difference is the denoising algorithm, TV, or deep denoising, in each iteration, we only need to switch the denoising method in the flow chart, shown in Fig. 3 .

The other important parameter of PnP is the noise level in each iteration. One method is to estimate the noise level in each iteration. However, this will make it computationally extensive. Therefore, similar to other PnP methods [18], we set the noise level manually in each iteration. This is also the reason we train the HSI denoising network to a wide noise range. Specifically, we set the noise level in a decreasing manner. For instance, assuming that the range of each pixel is [0,255], we set the noise level to 25 for 20 iterations, followed by 15 for 20 iterations and then tune the noise level to be smaller during the last few iterations.

4. Simulation Results of Different Spatial Sizes

Table 1 summarizes the average results of the 16 scenes shown in Fig. 5 with different spatial sizes. It can be seen that in all these three spatial sizes, PnP always leads to the best results. In particular, PnP outperforms GAP-TV by at least 2 dB in PSNR, which is the best among other algorithms. What else stands out in the table is that AE does not perform as well as in the DD-CASSI system shown in Ref. [35]. We also tested all the above algorithms using DD-CASSI; AE can achieve better results than other algorithms except PnP. Table 1.

Average PSNR (in dB), SSIM, and Running Time (in Seconds) of 16 Simulation Scenes (8 from ICVL and 8 from KAIST) at Different Spatial Sizes Using Various Algorithms^a

Spatial Size	Data Set	TwIST			GAP-TV			AE			U-net			PnP
Spatial Size	Data Set	PSNR (dB)	SSIM	Running Time (s)	PSNR (dB)	SSIM	Running Time (s)	PSNR (dB)	SSIM	Running Time (s)	PSNR (dB)	SSIM	Running Time (s)	PSNR (dB)	SSIM	Running Time (s)
$256 \times 256$	ICVL	30.58	0.8731	156.3	32.57	0.8794	130.2	29.41	0.8711	144.2	31.13	0.8897	0.8	35.03	0.9274	132.7
$256 \times 256$	KAIST	27.32	0.8495	156.3	29.66	0.8584	130.2	26.79	0.8498	144.2	29.44	0.8941	0.8	33.21	0.9273	132.7
$512 \times 512$	ICVL	31.82	0.8955	1380.2	33.58	0.8965	399.1	31.22	0.8969	493.6	NA	NA	NA	35.68	0.9319	401.6
$512 \times 512$	KAIST	29.09	0.8944	1380.2	31.38	0.8993	399.1	29.28	0.8974	493.6	NA	NA	NA	34.29	0.9378	401.6
$1024 \times 1024$	ICVL	32.68	0.9159	3657.6	34.22	0.9157	1460.7	32.03	0.9158	2053.5	NA	NA	NA	36.21	0.9434	1453.6
$1024 \times 1024$	KAIST	31.64	0.9099	3657.6	33.66	0.9134	1460.7	31.05	0.9071	2053.5	NA	NA	NA	36.41	0.9433	1453.6

NA denotes not available.

256 \times 256

256 \times 256

Figure 6.Simulation results of color-checker with size of $256 \times 256$ from KAIST data set compared with the ground truth. PSNR and SSIM results are also shown for each algorithm.

Figure 7.Simulation results of exemplar scenes (top, ICVL; bottom, KAIST) with size of $256 \times 256$ compared with the ground truth. Spectral curves of selected regions are also plotted to compare with the ground truth.

512 \times 512

Figure 8.Simulation results of four selected scenes shown in sRGB and spectral curves with spatial size of $512 \times 512$ (shown in full size in the far left column). The spectra of the pinned (yellow) region of the close-up are shown on the right.

Figure 9.Simulation results of four selected scenes shown in sRGB and spectral curves with spatial size of $1024 \times 1024$ (shown in full size in the far left column). The spectra of the pinned (yellow) region of the close-up are shown on the right.

B. Real Data

In this section, we apply our proposed PnP algorithm into five real spectral SCI systems, namely, three SD-CASSI systems [39, 47, 48], one snapshot multispectral endomicroscopy [36], and a ghost spectral compressive imaging system [54]. Note that our PnP framework is using the pretrained HSI denoising network on the simulation data. Though these systems have different spatial and spectral resolutions, PnP can be used directly to all these systems. Due to the speed consideration, we only compare with TwIST and/or GAP-TV in these real data sets.

1. Single-Disperser CASSI

256 \times 210

Figure 10.Real data, object SD-CASSI data ( $256 \times 210 \times 33$ ).

Figure 11.Real data, bird SD-CASSI data ( $1021 \times 731 \times 33$ ).

Figure 12.Real data, Lego SD-CASSI data ( $660 \times 550 \times 28$ ).

Figure 13.Real data, plant SD-CASSI data ( $660 \times 550 \times 28$ ).

2. Snapshot Multispectral Endomicroscopy

660 \times 660

Figure 14.Real data, snapshot multispectral endomicroscopy data ( $660 \times 660 \times 24$ ).

3. Ghost Imaging Spectral Camera

330 \times 330 \times 16

Figure 15.Real data, GISC spectral camera data ( $330 \times 330 \times 16$ ).

5. CONCLUSION

We have developed a deep PnP algorithm for the reconstruction of spectral SCI. We trained a deep denoiser for hyper/multispectral images and plugged it to the ADMM and TwIST frameworks for different spectral CS systems. Importantly, a single pretrained denoiser can be applied to different systems with different settings. Therefore, our proposed algorithm is highly flexible and is ready to be used in different real applications. Extensive results on both simulation and real data captured by diverse systems have verified the performance of our proposed algorithm.

K

References

[1] B. E. Bayer. Color imaging array. U.S. patent(1976).

[2] B. Redding, S. F. Liew, R. Sarma, H. Cao. Compact spectrometer based on a disordered photonic chip. Nat. Photonics, 7, 746-751(2013).

[3] Z. Wang, Z. Yu. Spectral analysis based on compressive sensing in nanophotonic structures. Opt. Express, 22, 25608-25614(2014).

[4] J. Bao, M. G. Bawendi. A colloidal quantum dot spectrometer. Nature, 523, 67-70(2015).

[5] Z. Yang, T. Albrow-Owen, H. Cui, J. Alexander-Webber, F. Gu, X. Wang, T.-C. Wu, M. Zhuge, C. Williams, P. Wang, V. A. Zayats, W. Cai, L. Dai, S. Hofmann, M. Overend, L. Tong, Q. Yang, Z. Sun, T. Hasan. Single-nanowire spectrometers. Science, 365, 1017-1020(2019).

[6] Z. Wang, S. Yi, A. Chen, M. Zhou, T. S. Luk, A. James, J. Nogan, W. Ross, G. Joe, A. Shahsafi, K. X. Wang, M. A. Kats, Z. Yu. Single-shot on-chip spectral sensors based on photonic crystal slabs. Nat. Commun., 10, 1020(2019).

[7] A. McClung, S. Samudrala, M. Torfeh, M. Mansouree, A. Arbabi. Snapshot spectral imaging with parallel metasystems. Sci. Adv., 6, eabc7646(2020).

[8] Y. Kwak, S. M. Park, Z. Ku, A. Urbas, Y. L. Kim. A pearl spectrometer. Nano Lett.(2020).

[9] D. Donoho. Compressed sensing. IEEE Trans. Inf. Theory, 52, 1289-1306(2006).

[10] E. J. Candès, J. Romberg, T. Tao. Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory, 52, 489-509(2006).

[11] E. J. Candès, M. B. Wakin. An introduction to compressive sampling. IEEE Signal Process. Mag., 25, 21-30(2008).

[12] M. E. Gehm, R. John, D. J. Brady, R. M. Willett, T. J. Schulz. Single-shot compressive spectral imaging with a dual-disperser architecture. Opt. Express, 15, 14013-14027(2007).

[13] G. R. Arce, D. J. Brady, L. Carin, H. Arguello, D. S. Kittle. Compressive coded aperture spectral imaging: an introduction. IEEE Signal Process. Mag., 31, 105-115(2014).

[14] X. Cao, T. Yue, X. Lin, S. Lin, X. Yuan, Q. Dai, L. Carin, D. J. Brady. Computational snapshot multispectral cameras: toward dynamic capture of the spectral world. IEEE Signal Process. Mag., 33, 95-108(2016).

[15] J. Bioucas-Dias, M. Figueiredo. A new TwIST: two-step iterative shrinkage/thresholding algorithms for image restoration. IEEE Trans. Image Process., 16, 2992-3004(2007).

[16] Y. Liu, X. Yuan, J. Suo, D. J. Brady, Q. Dai. Rank minimization for snapshot compressive imaging. IEEE Trans. Pattern Anal. Mach. Intell., 41, 2990-3006(2019).

[17] L. Wang, Z. Xiong, G. Shi, F. Wu, W. Zeng. Adaptive nonlocal sparse representation for dual-camera compressive hyperspectral imaging. IEEE Trans. Pattern Anal. Mach. Intell., 39, 2104-2111(2017).

[18] X. Yuan, Y. Liu, J. Suo, Q. Dai. Plug-and-play algorithms for large-scale snapshot compressive imaging. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1447-1457(2020).

[19] A. Wagadarikar, R. John, R. Willett, D. J. Brady. Single disperser design for coded aperture snapshot spectral imaging. Appl. Opt., 47, B44-B51(2008).

[20] X. Lin, Y. Liu, J. Wu, Q. Dai. Spatial-spectral encoded compressive hyperspectral imaging. ACM Trans. Graph., 33, 233(2014).

[21] X. Cao, H. Du, X. Tong, Q. Dai, S. Lin. A prism-mask system for multispectral video acquisition. IEEE Trans. Pattern Anal. Mach. Intell., 33, 2423-2435(2011).

[22] H. Arguello, H. Rueda, Y. Wu, D. W. Prather, G. R. Arce. Higher-order computational model for coded aperture spectral imaging. Appl. Opt., 52, D12-D21(2013).

[23] L. Wang, Z. Xiong, D. Gao, G. Shi, F. Wu. Dual-camera design for coded aperture snapshot spectral imaging. Appl. Opt., 54, 848-858(2015).

[24] C. V. Correa, H. Arguello, G. R. Arce. Snapshot colored compressive spectral imager. J. Opt. Soc. Am. A, 32, 1754-1763(2015).

[25] X. Yuan, T.-H. Tsai, R. Zhu, P. Llull, D. J. Brady, L. Carin. Compressive hyperspectral imaging with side information. IEEE J. Sel. Top. Signal Process., 9, 964-976(2015).

[26] Z. Liu, S. Tan, J. Wu, E. Li, X. Shen, S. Han. Spectral camera based on ghost imaging via sparsity constraints. Sci. Rep., 6, 25718(2016).

[27] X. Li, J. A. Greenberg, M. E. Gehm. Single-shot multispectral imaging through a thin scatterer. Optica, 6, 864-871(2019).

[28] D. S. Jeon, S.-H. Baek, S. Yi, Q. Fu, X. Dun, W. Heidrich, M. H. Kim. Compact snapshot hyperspectral imaging with diffracted rotation. ACM Trans. Graph., 38, 117(2019).

[29] S.-H. Baek, I. Kim, D. Gutierrez, M. H. Kim. Compact single-shot hyperspectral imaging using a prism. ACM Trans. Graph., 36(2017).

[30] M. A. T. Figueiredo, R. D. Nowak, S. J. Wright. Gradient projection for sparse reconstruction: application to compressed sensing and other inverse problems. IEEE J. Sel. Top. Signal Process., 1, 586-597(2007).

[31] M. Aharon, M. Elad, A. Bruckstein. K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process., 54, 4311-4322(2006).

[32] J. Yang, X. Liao, X. Yuan, P. Llull, D. J. Brady, G. Sapiro, L. Carin. Compressive sensing by learning a Gaussian mixture model from measurements. IEEE Trans. Image Process., 24, 106-119(2015).

[33] G. Barbastathis, A. Ozcan, G. Situ. On the use of deep learning for computational imaging. Optica, 6, 921-943(2019).

[34] X. Yuan, Y. Pu. Parallel lensless compressive imaging via deep convolutional neural networks. Opt. Express, 26, 1962-1977(2018).

[35] I. Choi, D. S. Jeon, G. Nam, D. Gutierrez, M. H. Kim. High-quality hyperspectral reconstruction using a spectral prior. ACM Trans. Graph., 36, 218(2017).

[36] Z. Meng, M. Qiao, J. Ma, Z. Yu, K. Xu, X. Yuan. Snapshot multispectral endomicroscopy. Opt. Lett., 45, 3897-3900(2020).

[37] X. Miao, X. Yuan, Y. Pu, V. Athitsos. λ-net: reconstruct hyperspectral images from a snapshot measurement. IEEE/CVF Conference on Computer Vision (ICCV), 4058-4068(2019).

[38] L. Wang, C. Sun, Y. Fu, M. H. Kim, H. Huang. Hyperspectral image reconstruction using a deep spatial-spectral prior. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8024-8033(2019).

[39] Z. Meng, J. Ma, X. Yuan. End-to-end low cost compressive spectral imaging with spatial-spectral self-attention. European Conference on Computer Vision (ECCV), 187-204(2020).

[40] S. V. Venkatakrishnan, C. A. Bouman, B. Wohlberg. Plug-and-play priors for model based reconstruction. IEEE Global Conference on Signal and Information Processing, 945-948(2013).

[41] S. Sreehari, S. V. Venkatakrishnan, B. Wohlberg, G. T. Buzzard, L. F. Drummy, J. P. Simmons, C. A. Bouman. Plug-and-play priors for bright field electron tomography and sparse interpolation. IEEE Trans. Comput. Imaging, 2, 408-423(2016).

[42] S. H. Chan, X. Wang, O. A. Elgendy. Plug-and-play ADMM for image restoration: fixed-point convergence and applications. IEEE Trans. Comput. Imaging, 3, 84-98(2017).

[43] E. K. Ryu, J. Liu, S. Wang, X. Chen, Z. Wang, W. Yin. Plug-and-play methods provably converge with properly trained denoisers(2019).

[44] L. Zhang, W. Zuo. Image restoration: from sparse and low-rank priors to deep priors. IEEE Signal Process. Mag., 34, 172-179(2017).

[45] K. Zhang, W. Zuo, Y. Chen, D. Meng, L. Zhang. Beyond a Gaussian denoiser: residual learning of deep CNN for image denoising. IEEE Trans. Image Process., 26, 3142-3155(2017).

[46] K. Zhang, W. Zuo, L. Zhang. FFDNet: toward a fast and flexible solution for CNN-based image denoising. IEEE Trans. Image Process., 27, 4608-4622(2018).

[47] A. A. Wagadarikar, N. P. Pitsianis, X. Sun, D. J. Brady. Video rate spectral imaging using a coded aperture snapshot spectral imager. Opt. Express, 17, 6368-6388(2009).

[48] D. Kittle, K. Choi, A. Wagadarikar, D. J. Brady. Multiframe image estimation for coded aperture snapshot spectral imagers. Appl. Opt., 49, 6824-6833(2010).

[49] R. Zhu, T. Tsai, D. J. Brady. Coded aperture snapshot spectral imager based on liquid crystal spatial light modulator. Frontiers in Optics, FW1D-4(2013).

[50] T.-H. Tsai, X. Yuan, D. J. Brady. Spatial light modulator based color polarization imaging. Opt. Express, 23, 11912-11926(2015).

[51] X. Yuan. Generalized alternating projection based total variation minimization for compressive sensing. IEEE International Conference on Image Processing (ICIP), 2539-2543(2016).

[52] S. K. Sahoo, D. Tang, C. Dang. Single-shot multispectral imaging with a monochromatic camera. Optica, 4, 1209-1213(2017).

[53] K. Monakhova, K. Yanny, N. Aggarwal, L. Waller. Spectral diffusercam: lensless snapshot hyperspectral imaging with a spectral filter array. Optica, 7, 1298-1307(2020).

[54] J. Wu, E. Li, X. Shen, S. Yao, Z. Tong, C. Hu, Z. Liu, S. Liu, S. Tan, S. Han. Experimental results of the balloon-borne spectral camera based on ghost imaging via sparsity constraints. IEEE Access, 6, 68740-68748(2018).

[55] E. Candès, J. Romberg, T. Tao. Stable signal recovery from incomplete and inaccurate measurements. Commun. Pure Appl. Math., 59, 1207-1223(2006).

[56] E. J. Candès, T. Tao. Near-optimal signal recovery from random projections: universal encoding strategies?. IEEE Trans. Inf. Theory, 52, 5406-5425(2006).

[57] E. J. Candès. The restricted isometry property and its implications for compressed sensing. C.R. Math., 346, 589-592(2008).

[58] S. Jalali, X. Yuan. Snapshot compressed sensing: performance bounds and algorithms. IEEE Trans. Inf. Theory, 65, 8005-8024(2019).

[59] S. Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn., 3, 1-122(2011).

[60] N. Parikh, S. Boyd. Proximal algorithms. Found. Trends Optim., 1, 127-239(2014).

[61] W. W. Hager. Updating the inverse of a matrix. SIAM Rev., 31, 221-239(1989).

[62] I. Daubechies, M. Defrise, C. De Mol. An iterative thresholding algorithm for linear inverse problems with a sparsity constraint. Commun. Pure Appl. Math., 57, 1413-1457(2004).

[63] M. Gharbi, G. Chaurasia, S. Paris, F. Durand. Deep joint demosaicking and denoising. ACM Trans. Graph., 35(2016).

[64] M. Tassano, J. Delon, T. Veit. FastDVDnet: towards real-time deep video denoising without flow estimation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1354-1363(2020).

[65] A. Maffei, J. M. Haut, M. E. Paoletti, J. Plaza, L. Bruzzone, A. Plaza. A single model CNN for hyperspectral image denoising. IEEE Trans. Geosci. Remote Sens., 58, 2516-2529(2020).

[66] F. Yasuma, T. Mitsunaga, D. Iso, S. K. Nayar. Generalized assorted pixel camera: postcapture control of resolution, dynamic range, and spectrum. IEEE Trans. Image Process., 19, 2241-2253(2010).

[67] A. Paszke, H. Wallach, S. Gross, H. Larochelle, A. Beygelzimer, F. Massa, F. d’ Alché-Buc, A. Lerer, J. Bradbury, E. Fox, G. Chanan, R. Garnett, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, S. Chintala. PyTorch: an imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems 32, 8024-8035(2019).

[68] D. P. Kingma, J. Ba. Adam: a method for stochastic optimization(2014).

[69] B. Arad, O. Ben-Shahar. Sparse recovery of hyperspectral signal from natural RGB images. European Conference on Computer Vision, 19-34(2016).

[70] O. Ronneberger, P. Fischer, T. Brox. U-net: convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, 234-241(2015).

[71] Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process., 13, 600-612(2004).

[72] T. Smith, J. Guild. The C.I.E. colorimetric standards and their use. Trans. Opt. Soc., 33, 73-134(1931).

微信扫一扫：分享

微信扫一扫：分享