
- Chinese Optics Letters
- Vol. 22, Issue 10, 102201 (2024)
Abstract
1. Introduction
Optical neural networks (ONNs), including holographic neural networks[1], large-scale photonic recurrent neural networks[2], integrated photonic neural networks[3], and diffractive neural networks (DNNs) at terahertz and near-infrared wavelengths[4,5], have been an intense research field for deep learning. Specifically, ONNs use photons instead of electrons for computation, offering high-speed optical communication (speed of light in a medium) and massive parallelism of optical signals (multiplexing in time, space, wavelength, polarization, orbital angular momentum, etc.). Among all these ONNs, the free-space DNNs exploit the optical inference induced by the multilayer diffraction plates (amplitude or phase plates) and have demonstrated their outstanding advantages, such as high speed (the speed of light), lower power consumption, and high neuron density in applications including medical image analysis[6,7], image recognition[8–10], and classification[11–13].
Recent progress in DNNs has demonstrated the requirement to fabricate DNNs with smaller device footprints[5,14], pushing the working wavelengths from terahertz to the near-infrared region, then further to visible wavelengths, so that the compact DNN devices can be integrated with microscopic systems, realizing the DNNs for lab-on-chip applications[15–17]. Additionally, the biological samples are commonly labeled by probes fluorescing at visible wavelengths[18,19]. However, DNNs working at the visible wavelength fabricated by electron beam lithography (EBL) are limited to the two-dimensional (2D)[20]. This large-sized DNN at the visible wavelength is not suitable for applications in microscopic imaging systems. Therefore, the development of three-dimensional (3D) micrometer-scale DNNs working at the visible wavelength is of great significance for applications in microscopic imaging systems.
This work presents the investigation of 3D micrometer-scale DNNs functioning at the visible wavelength manufactured by the two-photon nanolithography (TPN) technique[21–26]. Two types of diffraction elements, i.e., cylindrical and cubical diffraction neurons, are studied for their amplitude and phase modulation properties, and finite-difference time-domain (FDTD) simulations demonstrated that cubical diffraction neurons provide an average of 50% higher optical amplitude and phase modulation efficiency compared with the cylindrical diffraction neurons. Based on the theoretical analysis of DNNs for handwritten digit classification[20], the performance of micrometer-scale DNNs at the visible wavelength is further studied as a function of the sizes of the detecting area, and the number of diffraction layers for circularly distributed detector patterns. Experimentally, the single-layer and two-layer DNNs with
Sign up for Chinese Optics Letters TOC. Get the latest issue of Chinese Optics Letters delivered right to you!Sign up now
2. Results
The schematic concept of DNN is shown in Fig. 1(A), where the DNN exploits the nature of light propagation in free space and the interaction of light fields (amplitude and phase) with thin diffraction layers to perform passive matrix multiplication operations in the optical domain. For DNNs performing image classification, the input image is projected to the first layer, where every neuron on the first layer transmits input light to other neurons on the following layers according to Huygens’ principle and the Rayleigh–Sommerfeld diffraction equation[27]. A neuron of a given DNN layer can be considered as a secondary wave source that waves propagate in the following optical mode:
Figure 1.Concept of the micrometer-scale DNN at the visible wavelength and the comparison of cubical neurons and cylindrical neurons. (A) The DNN for image classification is based on light propagation according to Huygens’ principle and the Rayleigh–Sommerfeld diffraction theory. (B) Cylindrical diffraction neurons with different heights; (C) cubical diffraction neurons with different heights; (D) phase modulation as a function of heights for cubical neurons (squares) and cylindrical neurons (dots) simulated by FDTD, where the red dashed line represents the results of the OPD formula; (E) cubical neurons maintain a higher amplitude modulation efficiency than cylindrical neurons.
Therefore, a neuron establishes connections with neurons in subsequent layers via a secondary wave that undergoes amplitude and phase modulation influenced by the input interference pattern from preceding layers and the neuron’s own modulation abilities. Defining several detector areas on the output layer of DNN, the classification criterion is to find the detector area with the strongest energy distribution.
To realize micrometer-scale DNNs at the visible wavelength, two typically used diffraction elements, i.e., cylindrical and cubical diffraction neurons, are studied for their amplitude and phase modulation properties[5,28]. Theoretically, the optical properties of the cylindrical and cubical diffraction neurons can be understood, in the first approximation, within the Fresnel diffraction picture: the light transmitted through the neurons can be divided into two parts, one propagating in the neurons and the other in the surrounding medium (air). The FDTD method is used to further investigate the amplitude and phase modulation characteristics of both cylindrical and cubical neurons.
First, it is revealed that cubical diffraction neurons demonstrated a more effective phase modulation capability than cylindrical neurons. Compared with the optical path difference (OPD) formula (shown in Fig. 1(D) red dashed line),
In terms of amplitude modulation capability, cubical diffraction neurons demonstrate a higher amplitude modulation efficiency than cylindrical diffraction neurons. As is demonstrated in Fig. 1(E), cubical neurons maintain a higher amplitude modulation efficiency than cylindrical neurons, where the efficiency is defined as the ratio between the energy transmitted through the neuron and the energy leaked into the spacing between the neurons, which results from the excitation of high-order modes due to the spacings between cylindrical neurons[5,29]. In our simulation, diffraction neurons with the size of 0.8 µm, and four diffraction neurons modulating the phase change from 0, π/4, π/2, and
Furthermore, it is found that the size of the detector area is an important factor influencing the performance of DNNs for image classification. First, we designed and investigated a single-layer DNN for two types of image classification. Given a single-layer DNN with a total size of 84 neurons by 84 neurons, the classification accuracy is a function of the size of the detector area, i.e., the number of neurons in the detector area. The numerical simulation results are shown in Fig. 2(A), where classification accuracy is plotted as a function of varied sizes of the detector area. With the decrease of the size of the detector area from
Figure 2.Analysis of the influence of the detector area sizes on the classification accuracy of the DNN. (A) Classification accuracy of the single-layer two-classifier DNN as a function of detector area size; (B) numerically calculated phase distribution for the single-layer two-classifier DNN with the detector area size of 6 neurons × 6 neurons; (C) classification accuracy of the two-layer 10-classifier DNN as a function of detector area size; (D) numerically calculated phase distribution for the two-layer 10-classifier DNN with the detector area size of 6 neurons × 6 neurons.
In addition, the accuracy of a two-layer DNN for 10 types of image classification demonstrates a similar relationship in Fig. 2(C). It is easy to understand that the smaller detector area provides a smaller region for the previous layer for the convergence of energy, which leads to an area with more light energy, thus increasing the classification accuracy. The corresponding phase distribution of the two-layer DNN is plotted in Fig. 2(D).
This numerical simulation was performed by using Google’s TensorFlow (v2.7.0, Google Inc., USA) framework on a computer with a GeForce RTX 3060 graphical processing unit (GPU) and an IntelR CoreTM i5-12490F central processing unit (CPU) @3.00 GHz and 32 GB of random access memory (RAM), running the Windows 11 operating system (Microsoft Corporation, USA). Phase-only modulation two-classifier DNN was designed by training a diffraction layer with 8090 images (720 validation images) from the Modified National Institute of Standards and Technology (MNIST) handwritten digit database[30]. For the two-layer 10-classifier DNN, 42,000 training images and 28,000 validation images are also from the MNIST handwritten digit database. Both the single-layer DNN and two-layer DNN are simulated with a size of
Based on the numerical simulation above, single-layer and two-layer DNNs functioning at the visible wavelength were experimentally fabricated using a home-built 3D DLW system. The system consists of a femtosecond laser at a wavelength of 532 nm and a 3D piezo nanotranslation stage (detailed system setup is shown in Section 4), where the arbitrary 3D microstructures can be fabricated through the accurate movement of the nanotranslation stage. To characterize the ability to classify and quantify the performance of single-layer DNN, we used the characterization setup depicted in Section 4. The input images of the handwritten digits were generated by spatially modulating the light from a 532 nm laser source using a spatial light modulator (SLM) and illuminating on the DNN using a 4f system and objective to resize the illuminated digit. The detector area on the output layer was imaged through an objective and detected using a charge-coupled device (CCD) camera.
The fabricated single-layer two-classifier DNN presents a good performance in the classification of 0 and 1. A CAD model of single-layer DNN in Fig. 3(A) realizes the process of the classification of numbers 0 and 1. Figure 3(B) shows the phase map of a single-layer two-classifier DNN and its scanning electron microscope (SEM) image. In Fig. 3(C), we report the characterization of single-layer two-classifier DNN illuminated by handwritten digits 0 and 1. For the 56 digits generated by the SLM, the experimental blind testing accuracy was 89.3%, close to the simulated result of 99.98%. To further evaluate the performance of the single-layer two-classifier DNN, we calculated and normalized the energy distribution of two detector areas. The normalized energy distribution of two detector areas of the experimental results conforms to the simulated results.
Figure 3.Single-layer two-classifier DNN for digit classification. (A) CAD model of the single-layer two-classifier DNN for digit classification; (B) left is the numerical phase distribution map of the single-layer two-classifier DNN, and right is the SEM image of the fabricated height map transformed from the phase distribution map (scale bar is 20 µm). (C) Experimental classification results of the single-layer two-classifier DNN and normalized energy distribution compared with numerical results.
Furthermore, a DNN with ten detector areas and two layers was developed and investigated. The results are presented in Fig. 4. The CAD model of the two-layer 10-classifier DNN is depicted in Fig. 4(A), where the optimized parameters for the distance between the layers and the size of the neurons were determined through the iterative training, where the layer distance is 40 µm, diffraction layer size is
Figure 4.Design and fabrication results of the two-layer 10-classifier DNN. (A) CAD model of the two-layer 10-classifier DNN; (B) numerical phase distribution map of the neurons on layer 1 and layer 2; (C) perspective SEM image of the fabricated two-layer 10-classifier DNN with a tilting angle of 25° (scale bar is 20 µm); (D) SEM images of the fabricated height maps of the neurons on layer 1 and layer 2 transformed from the phase distribution map in (B).
The experimental characterization of a two-layer 10-classifier DNN was achieved using a benchtop optical setup, which is discussed in the previous section of this paper. Optical images of handwritten digits ranging from 0 to 9 were projected on the first layer of the two-layer 10-classifier DNN, the position of which was precisely aligned by a combination of a 3D microtranslation stage and a 2D microtranslation stage under an objective with a magnification of
Figure 5.Experimental classification and simulated results of the two-layer 10-classifier DNN. Output images of handwritten digits ranging from 0 to 9 (the square is the detector area) and the histograms of normalized energy distribution, where orange represents the normalized experimental energy distribution and green represents the normalized numerical energy distribution.
As is shown in Fig. 5, the measured normalized intensity distributions demonstrate a consistent digit classification capability of the fabricated two-layer 10-classifier. For instance, in Fig. 5A(1), the optical image of digit 0 was incident onto the first layer of the DNN, and the diffraction energy distribution captured on the output position shows the strongest energy distribution at the detector area-0, while other detector areas have much less energy. This result is confirmed by both the energy distribution image and the histogram results in Fig. 5A(1). Therefore, the experimental results of all handwritten digits from 0 to 9 illustrated that the fabricated two-layer 10-classifier has an outstanding capability of image classification. For the 150 digits generated by the SLM, the experimental blind testing accuracy was 83.3%, close to the simulated result of 92.51%. The details of the theoretical calculation results and the experimental data, including the confusion matrix and the energy distribution, are shown in Section 4.
3. Conclusion
In conclusion, we designed and investigated the micrometer-scale DNNs at the visible wavelength. Different from DNNs working at the visible wavelength fabricated by EBL[20], the 3D DLW technique was used in this work to fabricate an ultracompact micrometer-scale DNN. By utilizing the layer-by-layer slicing method to fabricate cubical neurons, the cross talk caused by the gap between adjacent neurons is mitigated. Theoretical investigations on the performance of micrometer-scale DNNs with both one and two diffraction layers are conducted, taking into account the size of the detecting areas and the number of diffraction layers. The single-layer DNN for the two-classifier demonstrated an experimental classification accuracy of 89.3% when the detector area measured
4. Materials
4.1. Nanolithographic system and photoresist
As shown in Fig. 6, a femtosecond fiber laser (Coherent Axon) provides laser light at a wavelength of 532 nm. The laser pulses with a width of less than 150 fs and a repetition rate of 80 MHz are steered into a 1.4 NA
Figure 6.Home-built 3D TPN system.
4.2. Characterization system and confusion matrix
The characterization system is shown in Fig. 7(A). The input images of handwritten digits are generated by the SLM using a laser source at the wavelength of 532 nm, and the experimental results are captured by a CCD camera. The classification results are shown in Fig. 7(B), where the vertical axis represents the classification results of the DNN, and the horizontal axis represents the true labels of input digits. The classification accuracy achieved 89.3% for the single-layer two-classifier DNN and 83.3% for the two-layer 10-classifier DNN. The test data are not used in the training of the DNNs.
Figure 7.Design and results of characterization experiment. (A) Schematic of characterization system. (B) Experimentally obtained confusion matrix for classification results.
To accurately align the sample and the input images, a border-only images (shown in the figure below) with the same size as the test images are loaded onto the SLM, where the size of the border-only image is adjusted to adapt to the size of the DNN sample. This process enables the accurate generation of test images with the same size of the sample. At the same time, we used a 3D micrometer translation stage to align the sample with image at the focusing plane of the objective. Therefore, these methods ensure that in the

Set citation alerts for the article
Please enter your email address