Massively parallel universal linear transformations using a wavelength-multiplexed diffractive optical network

Jingxi Li; Tianyi Gan; Bijie Bai; Yi Luo; Mona Jarrahi; Aydogan Ozcan

doi:10.1117/1.AP.5.1.016003

Journals >Advanced Photonics >Volume 5 >Issue 1 >Page 016003 > Article

Advanced Photonics
Vol. 5, Issue 1, 016003 (2023)

Massively parallel universal linear transformations using a wavelength-multiplexed diffractive optical network

Jingxi Li^1、2、3, Tianyi Gan^1、3, Bijie Bai^1、2、3, Yi Luo^1、2、3, Mona Jarrahi^1、3, and Aydogan Ozcan^{1、2、3、*}

Author Affiliations

¹University of California, Electrical and Computer Engineering Department, Los Angeles, California, United States

²University of California, Bioengineering Department, Los Angeles, California, United States

³University of California, California NanoSystems Institute, Los Angeles, California, United States

show less

DOI: 10.1117/1.AP.5.1.016003 Cite this Article Set citation alerts

Jingxi Li, Tianyi Gan, Bijie Bai, Yi Luo, Mona Jarrahi, Aydogan Ozcan. Massively parallel universal linear transformations using a wavelength-multiplexed diffractive optical network[J]. Advanced Photonics, 2023, 5(1): 016003 Copy Citation Text

EndNote(RIS)

BibTex

Plain Text

show less

$Schematic of massively parallel, wavelength-multiplexed diffractive optical computing. Optical layout of the wavelength-multiplexed diffractive neural network, where the diffractive layers are jointly trained to perform Nw different arbitrarily selected, complex-valued linear transformations between the input field i and the output field o′ using wavelength multiplexing. The optical fields at the input FOV, i1,i2,…,iNw, are encoded at a predetermined set of distinct wavelengths λ1,λ2,…,λNw, respectively, using a wavelength multiplexing (“MUX”) scheme. At the output FOV of the broadband diffractive network, wavelength demultiplexing (“DEMUX”) is performed to extract the diffractive output fields o1′,o2′,…,oNw′ at the corresponding wavelengths λ1,λ2,…,λNw, respectively, which represent the all-optical estimates of the target output fields o1,o2,…,oNw, corresponding to the target linear transformations (A1,A2,…,ANw). Through this diffractive architecture, Nw different arbitrarily selected complex-valued linear transformations can be all-optically performed at distinct wavelengths, running in parallel channels of the broadband diffractive processor.$

Fig. 1. Schematic of massively parallel, wavelength-multiplexed diffractive optical computing. Optical layout of the wavelength-multiplexed diffractive neural network, where the diffractive layers are jointly trained to perform

N_{w}

different arbitrarily selected, complex-valued linear transformations between the input field

i

and the output field

o^{'}

using wavelength multiplexing. The optical fields at the input FOV,

i_{1}, i_{2}, \dots, i_{N_{w}}

, are encoded at a predetermined set of distinct wavelengths

λ_{1}, λ_{2}, \dots, λ_{N_{w}}

, respectively, using a wavelength multiplexing (“MUX”) scheme. At the output FOV of the broadband diffractive network, wavelength demultiplexing (“DEMUX”) is performed to extract the diffractive output fields

o_{1}^{'}, o_{2}^{'}, \dots, o_{N_{w}}^{'}

at the corresponding wavelengths

λ_{1}, λ_{2}, \dots, λ_{N_{w}}

, respectively, which represent the all-optical estimates of the target output fields

o_{1}, o_{2}, \dots, o_{N_{w}}

, corresponding to the target linear transformations (

A_{1}, A_{2}, \dots, A_{N_{w}}

). Through this diffractive architecture,

N_{w}

different arbitrarily selected complex-valued linear transformations can be all-optically performed at distinct wavelengths, running in parallel channels of the broadband diffractive processor.

Download full size | View in the Article

$All-optical transformation performances of broadband diffractive networks using different numbers of wavelength channels. (a) As examples, we show the amplitude and phase of the first eight matrices in {A1,A2,…,A32} that were randomly generated, serving as the ground truth (target) for the diffractive all-optical transformations. See Fig. S1 in the Supplementary Material for the cosine similarity values calculated between any two combinations of these 32 target linear transformation matrices. (b) The mean values of the normalized MSE between the ground-truth transformation matrices (Aw) and the corresponding all-optical transforms (Aw′) across different wavelength channels are reported as a function of the number of diffractive neurons N. The results of the diffractive networks using different numbers of wavelength channels (Nw) are encoded with different colors, and the space between the simulation data points is linearly interpolated. Nw ∈ {1, 2, 4, 8, 16, and 32}, N ∈ {3.9k, 8.2k, 16.9k, 32.8k, 64.8k, 131.1k, 265.0k} and Ni=No=82. (c) Same as (b) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (d) Same as (b) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported.$

Fig. 2. All-optical transformation performances of broadband diffractive networks using different numbers of wavelength channels. (a) As examples, we show the amplitude and phase of the first eight matrices in

{A_{1}, A_{2}, \dots, A_{32}}

that were randomly generated, serving as the ground truth (target) for the diffractive all-optical transformations. See Fig. S1 in the Supplementary Material for the cosine similarity values calculated between any two combinations of these 32 target linear transformation matrices. (b) The mean values of the normalized MSE between the ground-truth transformation matrices (

A_{w}

) and the corresponding all-optical transforms (

A_{w}^{'}

) across different wavelength channels are reported as a function of the number of diffractive neurons

N

. The results of the diffractive networks using different numbers of wavelength channels

(N_{w})

are encoded with different colors, and the space between the simulation data points is linearly interpolated.

N_{w}

∈ {1, 2, 4, 8, 16, and 32},

N

∈ {3.9k, 8.2k, 16.9k, 32.8k, 64.8k, 131.1k, 265.0k} and

N_{i} = N_{o} = 8^{2}

. (c) Same as (b) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (d) Same as (b) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported.

Download full size | View in the Article

$All-optical transformation performances of the individual wavelength channels in broadband diffractive network designs with N≈2NwNiNo and Ni=No=82. The output field errors (MSEOutput) for the all-optical linear transforms achieved by the wavelength-multiplexed diffractive network models with (a) 2-channel wavelength multiplexing (Nw=2), N≈4NiNo; (b) 4-channel wavelength multiplexing (Nw=4), N≈8NiNo; (c) 8-channel wavelength multiplexing (Nw=8), N≈16NiNo; (d) 16-channel wavelength multiplexing (Nw=16), N≈32NiNo; and (e) 32-channel wavelength multiplexing (Nw=32), N≈64NiNo. The standard deviations (error bars) of these metrics are calculated across the entire testing data set.$

Fig. 3. All-optical transformation performances of the individual wavelength channels in broadband diffractive network designs with

N \approx 2 N_{w} N_{i} N_{o}

and

N_{i} = N_{o} = 8^{2}

. The output field errors

({MSE}_{O utput})

for the all-optical linear transforms achieved by the wavelength-multiplexed diffractive network models with (a) 2-channel wavelength multiplexing (

N_{w} = 2

N \approx 4 N_{i} N_{o}

; (b) 4-channel wavelength multiplexing (

N_{w} = 4

N \approx 8 N_{i} N_{o}

; (c) 8-channel wavelength multiplexing (

N_{w} = 8

N \approx 16 N_{i} N_{o}

; (d) 16-channel wavelength multiplexing (

N_{w} = 16

N \approx 32 N_{i} N_{o}

; and (e) 32-channel wavelength multiplexing (

N_{w} = 32

N \approx 64 N_{i} N_{o}

. The standard deviations (error bars) of these metrics are calculated across the entire testing data set.

Download full size | View in the Article

$All-optical transformation matrices estimated by two different wavelength-multiplexed broadband diffractive networks with Nw=8 and Ni=No=82. The first broadband diffractive network has N≈2NwNiNo=16NiNo=64,800 trainable diffractive neurons. The second broadband diffractive network has N≈4NwNiNo=32NiNo=131,100 trainable diffractive neurons. The absolute differences between these all-optical transformation matrices and the corresponding ground-truth (target) matrices are also shown in each case. N=131,100 diffractive design achieves a much smaller and negligible absolute error due to the increased degrees of freedom.$

Fig. 4. All-optical transformation matrices estimated by two different wavelength-multiplexed broadband diffractive networks with

N_{w} = 8

and

N_{i} = N_{o} = 8^{2}

. The first broadband diffractive network has

N \approx 2 N_{w} N_{i} N_{o} = 16 N_{i} N_{o} = 64,800

trainable diffractive neurons. The second broadband diffractive network has

N \approx 4 N_{w} N_{i} N_{o} = 32 N_{i} N_{o} = 131,100

trainable diffractive neurons. The absolute differences between these all-optical transformation matrices and the corresponding ground-truth (target) matrices are also shown in each case.

N = 131,100

diffractive design achieves a much smaller and negligible absolute error due to the increased degrees of freedom.

Download full size | View in the Article

$Examples of the input/output complex fields for the ground-truth (target) transformations along with the all-optical output fields resulting from the 8-channel wavelength-multiplexed diffractive design using N≈4NwNiNo=32NiNo=131,100. Absolute errors between the ground-truth output fields and the all-optical diffractive network output fields are negligible. Note that |∠o−∠o′^|π indicates the wrapped phase difference between the ground-truth output field o and the normalized diffractive network output field o′^.$

Fig. 5. Examples of the input/output complex fields for the ground-truth (target) transformations along with the all-optical output fields resulting from the 8-channel wavelength-multiplexed diffractive design using

N \approx 4 N_{w} N_{i} N_{o} = 32 N_{i} N_{o} = 131,100

. Absolute errors between the ground-truth output fields and the all-optical diffractive network output fields are negligible. Note that

{| ∠ o - ∠ \hat{o^{'}} |}_{π}

indicates the wrapped phase difference between the ground-truth output field

o

and the normalized diffractive network output field

\hat{o^{'}}

Download full size | View in the Article

$Exploration of the limits of the number of wavelength channels (Nw) that can be implemented in a broadband diffractive network. (a) The mean values of the normalized MSE between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) across different wavelength channels are reported as a function of Nw∈{1,2,4,8,16,32,64,128,184}. The results of the broadband diffractive networks using different numbers of diffractive neurons (N) are presented with different colors: N∈{1.5NwNiNo,2NwNiNo,3NwNiNo}. Dotted lines are fitted based on the data points whose diffractive networks share the same N. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported. Ni=No=52.$

Fig. 6. Exploration of the limits of the number of wavelength channels

(N_{w})

that can be implemented in a broadband diffractive network. (a) The mean values of the normalized MSE between the ground-truth transformation matrices (

A_{w}

) and the all-optical transforms (

A_{w}^{'}

) across different wavelength channels are reported as a function of

N_{w} \in {1, 2, 4, 8, 16, 32, 64, 128, 184}

. The results of the broadband diffractive networks using different numbers of diffractive neurons (

N

) are presented with different colors:

N \in {1.5 N_{w} N_{i} N_{o}, 2 N_{w} N_{i} N_{o}, 3 N_{w} N_{i} N_{o}}

. Dotted lines are fitted based on the data points whose diffractive networks share the same

N

N_{i} = N_{o} = 5^{2}

Download full size | View in the Article

$The impact of material dispersion and losses on the all-optical transformation performance of wavelength-multiplexed broadband diffractive networks. (a) The mean values of the normalized MSE between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) across different wavelength channels are reported as a function of the material of the diffractive layers. The results of the diffractive networks trained with and without diffraction efficiency penalty are presented in yellow and purple colors, respectively. Nw=128, N=3NwNiNo, and Ni=No=52. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth fields are reported. (d) The mean diffraction efficiencies of the presented diffractive models across all the wavelength channels. (e) Diffraction efficiency of the individual wavelength channels for the broadband diffractive network model presented in (a)–(d) that uses the dielectric material without the diffraction efficiency-related penalty term in its loss function. (f) Same as (e), but the diffractive network was trained using a loss function with the diffraction efficiency-related penalty term.$

Fig. 7. The impact of material dispersion and losses on the all-optical transformation performance of wavelength-multiplexed broadband diffractive networks. (a) The mean values of the normalized MSE between the ground-truth transformation matrices (

A_{w}

) and the all-optical transforms (

A_{w}^{'}

) across different wavelength channels are reported as a function of the material of the diffractive layers. The results of the diffractive networks trained with and without diffraction efficiency penalty are presented in yellow and purple colors, respectively.

N_{w} = 128

N = 3 N_{w} N_{i} N_{o}

, and

N_{i} = N_{o} = 5^{2}

. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth fields are reported. (d) The mean diffraction efficiencies of the presented diffractive models across all the wavelength channels. (e) Diffraction efficiency of the individual wavelength channels for the broadband diffractive network model presented in (a)–(d) that uses the dielectric material without the diffraction efficiency-related penalty term in its loss function. (f) Same as (e), but the diffractive network was trained using a loss function with the diffraction efficiency-related penalty term.

Download full size | View in the Article

$All-optical transformation performance of broadband diffractive network designs with Nw=184, reported as a function of N and the bit depth of the diffractive neurons. (a) The mean values of normalized MSE between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) across different wavelength channels are reported as a function of N. The results of the diffractive networks using different bit depths of the diffractive neurons, including 4, 8, 12, and 32, are encoded with different colors, and the space between the data points is linearly interpolated. N∈{0.5NwNiNo=56,000,NwNiNo=115.000,2NwNiNo=231,000,4NwNiNo=461,000}, and Ni=No=52. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported.$

Fig. 8. All-optical transformation performance of broadband diffractive network designs with

N_{w} = 184

, reported as a function of

N

and the bit depth of the diffractive neurons. (a) The mean values of normalized MSE between the ground-truth transformation matrices (

A_{w}

) and the all-optical transforms (

A_{w}^{'}

) across different wavelength channels are reported as a function of

N

. The results of the diffractive networks using different bit depths of the diffractive neurons, including 4, 8, 12, and 32, are encoded with different colors, and the space between the data points is linearly interpolated.

N \in {0.5 N_{w} N_{i} N_{o} = 56,000, N_{w} N_{i} N_{o} = 115.000, 2 N_{w} N_{i} N_{o} = 231,000, 4 N_{w} N_{i} N_{o} = 461,000}

, and

N_{i} = N_{o} = 5^{2}

Download full size | View in the Article

$The impact of the encoding wavelength error on the all-optical linear transformation performance of a wavelength-multiplexed broadband diffractive network; Nw=4, N≈2NwNiNo=8NiNo, and Ni=No=82. (a) The normalized MSE values between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) for the four different wavelength channels are reported as a function of the wavelengths used during the testing. The results of the different wavelength channels are shown with different colors, and the space between the simulation data points is linearly interpolated. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported. The shaded areas indicate the standard deviation values calculated based on all the samples in the testing data set.$

Fig. 9. The impact of the encoding wavelength error on the all-optical linear transformation performance of a wavelength-multiplexed broadband diffractive network;

N_{w} = 4

N \approx 2 N_{w} N_{i} N_{o} = 8 N_{i} N_{o}

, and

N_{i} = N_{o} = 8^{2}

. (a) The normalized MSE values between the ground-truth transformation matrices (

A_{w}

) and the all-optical transforms (

A_{w}^{'}

) for the four different wavelength channels are reported as a function of the wavelengths used during the testing. The results of the different wavelength channels are shown with different colors, and the space between the simulation data points is linearly interpolated. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported. The shaded areas indicate the standard deviation values calculated based on all the samples in the testing data set.

Download full size | View in the Article

$An example of a wavelength-multiplexed diffractive network (Nw=8, N≈2NwNiNo=16NiNo=64,800) that all-optically performs eight different permutation (encoding) operations between its input and output FOVs, with each target permutation matrix assigned to a unique wavelength. (a) Input/output examples. Each one of the Nw=8 wavelength channels in the diffractive processor is assigned to a different permutation matrix Pw. The absolute differences between the diffractive network output fields and the ground-truth (target) permuted (encoded) output fields are also shown in the last column. (b) Arbitrarily generated permutation matrices P1,P2,…,P8 that serve as the ground truth (target) for the wavelength-multiplexed diffractive permutation transformations shown in (a).$

Fig. 10. An example of a wavelength-multiplexed diffractive network (

N_{w} = 8

N \approx 2 N_{w} N_{i} N_{o} = 16 N_{i} N_{o} = 64,800

) that all-optically performs eight different permutation (encoding) operations between its input and output FOVs, with each target permutation matrix assigned to a unique wavelength. (a) Input/output examples. Each one of the

N_{w} = 8

wavelength channels in the diffractive processor is assigned to a different permutation matrix

P_{w}

. The absolute differences between the diffractive network output fields and the ground-truth (target) permuted (encoded) output fields are also shown in the last column. (b) Arbitrarily generated permutation matrices

P_{1}, P_{2}, \dots, P_{8}

that serve as the ground truth (target) for the wavelength-multiplexed diffractive permutation transformations shown in (a).

Download full size | View in the Article

$Experimental validation of a wavelength-multiplexed diffractive network with Nw=2 and Ni=No=32. (a) Photograph of the experimental setup, including the schematic of the THz setup. (b) The fabricated wavelength-multiplexed diffractive processor. (c) The learned thickness profiles of the diffractive layers. (d) Photographs of the 3D-printed diffractive layers. (e) Experimental results of the diffractive processor for the two wavelength channels λ1=0.667 mm and λ2=0.698 mm using the fabricated diffractive layers, which reveal a good agreement with their numerical counterparts and the ground truth. λm=(λ1+λ2)/2=0.6825 mm.$

Fig. 11. Experimental validation of a wavelength-multiplexed diffractive network with

N_{w} = 2

and

N_{i} = N_{o} = 3^{2}

. (a) Photograph of the experimental setup, including the schematic of the THz setup. (b) The fabricated wavelength-multiplexed diffractive processor. (c) The learned thickness profiles of the diffractive layers. (d) Photographs of the 3D-printed diffractive layers. (e) Experimental results of the diffractive processor for the two wavelength channels