• Photonics Research
  • Vol. 12, Issue 4, 682 (2024)
Wanxin Shi1、2、†, Xi Jiang3、†, Zheng Huang1, Xue Li3, Yuyang Han1, Sigang Yang1, Haizheng Zhong3, and Hongwei Chen1、*
Author Affiliations
  • 1Beijing National Research Center for Information Science and Technology (BNRist), Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
  • 2China Mobile Research Institute, Beijing 100053, China
  • 3MIIT Key Laboratory for Low-dimensional Quantum Structure and Devices, School of Materials Sciences & Engineering, Beijing Institute of Technology, Beijing 100081, China
  • show less
    DOI: 10.1364/PRJ.515349 Cite this Article Set citation alerts
    Wanxin Shi, Xi Jiang, Zheng Huang, Xue Li, Yuyang Han, Sigang Yang, Haizheng Zhong, Hongwei Chen. Lensless opto-electronic neural network with quantum dot nonlinear activation[J]. Photonics Research, 2024, 12(4): 682 Copy Citation Text show less

    Abstract

    With the swift advancement of neural networks and their expanding applications in many fields, optical neural networks have gradually become a feasible alternative to electrical neural networks due to their parallelism, high speed, low latency, and power consumption. Nonetheless, optical nonlinearity is hard to realize in free-space optics, which restricts the potential of the architecture. To harness the benefits of optical parallelism while ensuring compatibility with natural light scenes, it becomes essential to implement two-dimensional spatial nonlinearity within an incoherent light environment. Here, we demonstrate a lensless opto-electrical neural network that incorporates optical nonlinearity, capable of performing convolution calculations and achieving nonlinear activation via a quantum dot film, all without an external power supply. Through simulation and experiments, the proposed nonlinear system can enhance the accuracy of image classification tasks, yielding a maximum improvement of 5.88% over linear models. The scheme shows a facile implementation of passive incoherent two-dimensional nonlinearities, paving the way for the applications of multilayer incoherent optical neural networks in the future.

    1. INTRODUCTION

    Over the past decade, deep learning anchored in neural networks [1] has ushered in transformative changes across a myriad of fields, including computational imaging [2,3], autonomous driving [4], natural language processing [5], and numerous other intelligent applications [68]. However, the escalating demand for even faster and more energy-efficient systems has imposed constraints on the further development of electrical neural networks. In response to this challenge, researchers have turned to the realm of optical neural networks (ONNs) [913], harnessing their exceptional computing speed, massive parallelism, and low latency as promising alternative solutions. Considering compatibility with existing machine vision systems, such as the applications in practical natural scenarios, the emergence of hybrid opto-electric neural networks (OENNs) [1416] also offers a potential avenue.

    In most ONNs and OENNs, optics only completes linear operations, while the nonlinear activation is implemented electronically. This creates a significant barrier to implementing spatial nonlinear activation functions—a small but essential component of ONNs. Recently, some all-optical multilayer neural network schemes have been proposed to increase the accuracy of tasks. Ideally, when there is no nonlinearity between network layers, multilayer operations could be equivalent to single-layer matrix calculations. Correspondingly, experimental data indicates that an all-optical multilayer neural network, which lacks optical nonlinearity between layers, exhibits limited enhancements in task accuracy. Thus, its applicability in other visual task scenarios is also limited.

    In the pursuit of advancing optical neural networks, researchers have been exploring methods to achieve optical nonlinear activation functions with modulations. Previous research has demonstrated various approaches to realizing such nonlinearities, including the utilization of electro-optic modulators such as Mach–Zehnder interferometers (MZIs) or microrings [17,18]. Moreover, some strategies have been proposed to achieve optical nonlinear activation functions in all-optical neural networks with the aim of mitigating energy loss during photoelectric conversion. Among them, optical nonlinearity can be categorized into one-dimensional (1D) and two-dimensional (2D) spatial nonlinearity. The 1D nonlinearities refer to the realization of optical single-point nonlinear activation through specific materials or systems. At present, phase change materials (PCMs) [19] and saturable absorbers (SAs) [20,21] are commonly used to realize nonlinearity. PCMs exhibit two states—amorphous and crystalline—with high transmittance in the amorphous state and low transmittance in the crystalline state. In spiking neurosynaptic networks, when the input light intensity surpasses a certain threshold, PCMs change state and generate output pulses, effectively emulating the behavior of the ReLU function. Similarly, SAs are often integrated into lasers to facilitate nonlinear modulation. As the light intensity increases, the SA progressively reaches a saturated state, eventually transmitting light completely. Likewise, other materials such as periodically poled thin-film lithium niobate nanophotonic waveguides [22,23] and germanium silicon hybrid asymmetric couplers [24], can also implement optical point-to-point nonlinear activation. These aforementioned methods generally necessitate a coherent light source, which makes them difficult to use in natural scenes.

    Similar to 1D nonlinearity, each point in two-dimensional space has the same nonlinear modulation function in 2D nonlinearity. Two-dimensional graphene [25] materials have garnered considerable attention as a potential means of achieving optical nonlinearity. Graphene exhibits unique characteristics, such as low threshold, easy excitation, and strong nonlinear effects. When the light intensity exceeds the threshold, the transmittance varies with the input power, representing a nonlinear process. Besides, Ryou et al. [26] presented a free-space optical artificial neural network (ANN) with nonlinear activation facilitated by the saturable absorption of thermal atoms. Another noteworthy work by Zuo et al. [27] illustrated an all-optical neural network in which linear calculations are programmed by SLM and Fourier lenses, while nonlinear optical activation is realized by laser-cooled atoms based on electromagnetically induced transparency. In addition, vertical-cavity surface-emitting lasers (VCSELs) have been explored as nonlinear elements in photonic reservoir computing implementations [28]. With a single VCSEL neuron, a spiking neural network is built, which has a turn-on threshold to output pulses. Different from the methods above that were based on a coherent laser source to achieve nonlinear modulation, researchers have demonstrated the feasibility of using a semiconductor saturable absorber mirror (SESAM) to realize 2D spatial nonlinear modulation under an incoherent light source [29]. Nevertheless, this system requires complex devices such as a superluminescent diode (SLED), optical fibers, erbium doped fiber amplifiers (EDFAs), and Mach–Zehnder interferometers (MZIs) to ensure stable nonlinearity. In contrast, quantum dots (QDs) have proven to be an effective alternative for incoherent nonlinear implementations in recent years. Miscuglio et al. [30] set up a silicon photonic waveguide system with QD embedded, while the nonlinear modulation range is a function of input power density. Although the approach was still based on coherent light, it has experimentally substantiated the capability of quantum dots in achieving optical nonlinear mappings. Furthermore, Wang et al. [31] have proposed a multilayer nonlinear neural network based on a commercial image intensifier tube for image sensing. When light irradiates the photocathode of the image intensifier, it is converted into photoelectrons. These electrons undergo high-rate multiplication through the action of an electric field and are subsequently converted back into photons through a fluorescent quantum dot screen. Although this system can operate in natural light conditions, it necessitates using an image intensifier, along with a power supplement.

    In recent years, significant progress has been made in developing optical domain nonlinear methods. However, it is evident that fulfilling the requirements of passive low energy consumption, natural light operation, spatial two-dimensional nonlinearity, and a simple, miniaturized system concurrently remains a formidable challenge. In this paper, we demonstrate a lensless opto-electronic neural network with quantum dot (QD) nonlinear activation for visual tasks in natural scenes. Remarkably, the system does not need an additional power supply, while the convolution and nonlinear activation operations are realized only by an optical mask and QD film. This approach not only simplifies the system’s architecture but also enhances its compactness and energy efficiency. The introduction of optical nonlinearity into our opto-electronic neural network plays a role in improving the accuracy of the corresponding visual tasks.

    2. NONLINEAR OPTICAL ACTIVATION FUNCTION

    Quantum dots are semiconductor nanocrystals, typically measuring between 2 and 10 nm in size. During the photoluminescence process of quantum dots, external photons excite the electrons in the quantum dots. The electrons can directly jump to the valence band and then recombine with holes to emit light. Alternatively, electrons might first form excitons by combining with holes and then transition to the ground state, resulting in radiative luminescence. During this process, defects, impurities, and lattice thermal vibrations will increase nonradiative recombination and reduce luminous efficiency.

    The nonlinear optical properties of perovskite materials are generally reflected in phenomena such as saturation absorption and two-photon (multi-photon) absorption [32,33]. They have a variety of forms, including single crystals, polycrystalline films, quantum dots, etc. This article proposes a nonlinear optical system based on perovskite quantum dots, in which the relationship between excitation power and quantum dot luminescence intensity is similar to the ReLU function. When the excitation power surpasses a certain value, the detector begins to respond to the luminous intensity of the material. After that, as the excitation power increases, the quantum dot photoluminescent light intensity increases correspondingly.

    The excitons formed after material excitation have exciton binding energies close to room temperature thermal energy, and they are easily dissociated into free carriers. Since the electron–hole recombination rate of perovskite is slow, the excited carriers may first be captured by nonradiative recombination centers. Meanwhile, as the excitation power increases, the carrier density is higher, and radiative recombination becomes dominant. In carrier recombination dynamics, the quantum efficiency gradually increases to saturation as the excitation light intensity increases, and the increase process may correspond to a change in excitation intensity of 2–3 orders of magnitude [34]. Limited by the luminous intensity of the material and the sensitivity of the photodetectors, the detectors cannot respond to luminescence when the excitation power is low. The above reasons may jointly lead to the nonlinear process of quantum dot luminescence.

    3. MEASUREMENT OF NONLINEAR ACTIVATION CURVE

    The quantum dots material we use in the experiment is (C9NH20)7(ZnCl2)2(Pb3Cl11)9, which comprises metal halide quantum dots with a perovskite zero-dimensional structure, and its crystal structure is similar to perovskite (hereafter referred to as perovskite quantum dots).

    The quantum dot luminescence spectrum is shown in Fig. 1. In order to distinguish the excitation light and emitted light, the wavelength of excitation light is set as 365 nm. Meanwhile, in the experiment, the center wavelength of excitation light is 405 nm, and the perovskite quantum dots emit green light with a center wavelength of 512 nm.

    Photoluminescence of (C9NH20)7(ZnCl2)2(Pb3Cl11)9 QD film. (a) Photoluminescence emission spectra. 1:9:2 and 1:9:0.5 are the molar ratios of perovskite quantum dot components. (b), (c) Comparison of 1:9:2 (b) and 1:9:0.5 (c) quantum dot films under sunlight and UV light irradiation.

    Figure 1.Photoluminescence of (C9NH20)7(ZnCl2)2(Pb3Cl11)9 QD film. (a) Photoluminescence emission spectra. 1:9:2 and 1:9:0.5 are the molar ratios of perovskite quantum dot components. (b), (c) Comparison of 1:9:2 (b) and 1:9:0.5 (c) quantum dot films under sunlight and UV light irradiation.

    In the experiment, we use an LED array with 405 nm center wavelength to measure the nonlinear curve and project images of visual tasks. The pixel size of the array is 32×32, and the luminous intensity of each source point on the LED array is adjustable. First, light up a single source point on the LED array. The quantum dots absorb the purple light emitted by the source point, and green fluorescence is excited. The purple light intensity captured by the sensor is used as the excitation light intensity (input light intensity of the curve), while the measured green fluorescent light intensity is used as the luminous intensity (output light intensity of the curve). The fitting function relationship between the input and output light intensity is the activation curve of the quantum dot thin film.

    The activation curves of the quantum dots are shown in Fig. 2, formulated with component molar ratio of PbCl2, C9H20ClN, ZnCl2 at 1:9:2 and 1:9:0.5, respectively. As we can see in Fig. 2, the fitting function relationship of the quantum dots is nonlinear, which is consistent with the principle discussed above. At lower input light intensities, the quantum dots remain virtually nonluminescent on a macroscopic scale. However, once the input light intensity crosses a specific threshold, the excitation light intensity of the quantum dots demonstrates an approximately linear relationship with the input light intensity.

    Fitting nonlinear curves of quantum dots with different component molar ratios. (a) 1:9:2. (b) 1:9:0.5.

    Figure 2.Fitting nonlinear curves of quantum dots with different component molar ratios. (a) 1:9:2. (b) 1:9:0.5.

    4. JOINT OPTIMZATION OF THE OPTO-ELECTRONIC NEURAL NETWORK WITH QUANTUM DOT NONLINEARITY

    In electrical neural networks, nonlinear activation functions can make the network more powerful. Beyond accelerating convergence and improving task accuracy, they augment the network’s capacity to learn intricate data and represent sophisticated nonlinear mappings between inputs and outputs. As such, nonlinear activation functions play a crucial and indispensable role in neural networks.

    The opto-electronic neural network architecture based on quantum dot nonlinearity is shown in Fig. 3. In our previous research, we introduced the concept of the feature size Δ, defining it as the spatial size of a single pixel on the optical mask [35]. It has been verified that when the size of the light source, the feature size, and the distance between the light source, mask, and detector satisfy a certain relationship, the convolution operation can be completed in the optical domain. At this time, the image captured by the sensor is the feature map after the convolution calculation. When the quantum dot film is close to the optical mask in the above linear system, the convolution result of the light source and the mask will be transformed nonlinearly through the quantum dot film. Consequently, the light intensity distribution captured by the sensor is the convolution result following nonlinear activation mapping.

    Prototype for the lensless opto-electrical neural network system with optical nonlinearity. (a) Optical path diagram of the system. (b) Corresponding network structure diagram of the system.

    Figure 3.Prototype for the lensless opto-electrical neural network system with optical nonlinearity. (a) Optical path diagram of the system. (b) Corresponding network structure diagram of the system.

    The joint optimization process of the whole system link is shown in Fig. 4. The entire link consists of four components: the optical mask, which completes the convolution calculation, the quantum dots utilized for nonlinear activation, the sensor capturing the feature map following convolution and nonlinear calculation, and a digital processor executing subsequent electrical network operations. In pursuit of maximizing the overall network classification accuracy, the whole link is optimized in an end-to-end fashion. The nonlinear activation curve is used to be a fixed nonlinear activation module between the optical convolution part and the electrical neural network (suffix network). Subsequently, the loss function is calculated based on the captured nonlinearly activated feature map. All network parameters, except the nonlinear layer parameters, are simultaneously optimized through the loss function in the joint optimization process. Finally, the optical convolution kernel pattern is obtained.

    Joint optimization process based on quantum dot nonlinearity.

    Figure 4.Joint optimization process based on quantum dot nonlinearity.

    5. LENSLESS OPTO-ELECTRONIC NEURAL NETWORK WITH NONLINEARITY FOR APPLICATION

    A. Machine Learning Visual Tasks

    In this section, we numerically characterized the performance of the quantum dot nonlinearity on the visual tasks. We first chose the classification of the hand-drawn images. To this end, we opted for 10 classes from the QuickDraw image dataset. As illustrated in Fig. 5(b), the input image, sized at 28×28 pixels, was projected onto an LED array positioned before the optical mask. After the optical convolution calculation, the results were nonlinearly activated by quantum dot film, and the nonlinear feature map was captured by the sensor. The pixel size of the input for the electrical network was 26×26, while the electrical network was a fully convolutional architecture. The network contained three 5×5 convolution layers, one 1×1 convolution layer, and a maximum pooling layer, and the output of the network is 10 units, aligning with the 10 selected classes for classification.

    Visual tasks and the corresponding networks used in the experiment. Among them, the left side of (a)–(c) is part of the visual task dataset, the middle is the optical mask pattern optimized for the corresponding task, and the right side is the neural network architecture of the corresponding visual task. (a) Hand sign classification. (b) Hand drawn image classification. (c) Traffic sign classification.

    Figure 5.Visual tasks and the corresponding networks used in the experiment. Among them, the left side of (a)–(c) is part of the visual task dataset, the middle is the optical mask pattern optimized for the corresponding task, and the right side is the neural network architecture of the corresponding visual task. (a) Hand sign classification. (b) Hand drawn image classification. (c) Traffic sign classification.

    In addition to the hand-drawn image classification, we also conducted experiments for two other visual tasks: hand sign and traffic sign classification, as depicted in Fig. 5. Unlike the previous task, these images are grayscale representations captured from actual scenes, making their classification more challenging. As for the hand sign classification task in Fig. 5(a), the pixel size of the images was reshaped to 28×28. The first 3×3 convolution layer and nonlinear activation were operated on the light field, while the suffix electrical network consisted of 676 input vectors and a linear activation for the output of 4 units. Meanwhile, for the traffic sign classification task in Fig. 5(c), the input images were resized to 20×20. After passing through the optical convolution layer and the quantum dot nonlinear activation layer, the input size of the electrical network was 18×18. The architecture included three 5×5 convolution layers and a maximum pooling layer, and finally output 10 classification results through a 1×1 convolution layer.

    B. Nonlinear Activation Opto-Electrical Neural Network Experiment

    As shown in the system prototype in Fig. 3, the images were displayed on the LED array, situated 10–20 cm from the optical mask. The QD film was positioned close to the mask, maintaining a minute separation of approximately 1 mm. The sensor was also positioned about 1 mm from the QD film. Inspired by our previous work, the image signal processing (ISP) process includes many links such as demosaic, white balance, color correction, and tone mapping. However, only a few parameters can be adjusted in the process, and these processes are primarily optimized for human perception rather than task-oriented optimization. So it can be concluded that the ISP process of the sensor is not necessary for the whole pipeline. Therefore, we directly collected raw data without any demosaic processing, using the G channel of the raw data for subsequent analysis. A few images were rendered for human comprehension, as depicted in Fig. 6(a). We qualitatively compared the rendered images before and after adding nonlinearity from raw data. It can be seen in Fig. 6(b), when adding QD optical nonlinearity, the relative intensity of the original high light intensity part in the image does not change much, while the gray value of the original low intensity part decreases significantly. The phenomenon above is consistent with the measured nonlinear curve of QD in Fig. 2. As for quantitative comparison in Fig. 6(b), although the overall trend remained consistent, there existed a certain discrepancy between the image produced by directly adding the measured nonlinearity and the image obtained from the nonlinear system. For example, the “Nonlinear_exp” images measured have higher peak widths than the “Nonlinear_sim” images, and the trough values do not drop to 0. This discrepancy can be attributed to the noise of the imaging sensor, the blur caused by the quantum dot scattering process, system correction error, and minute variations in nonlinearity across different positions.

    Captured images. (a) Captured images of three visual tasks, which were rendered from raw data. (b) Comparison of images between linear and nonlinear systems. “Linear_exp” refers to the images captured in the linear system experiment, “Nonliear_sim” refers to the result of adding the measured QD nonlinearity to the images captured in the linear system experiment, and “Nonlinear_exp” refers to the images captured in the nonlinear system experiment. The red box in the figure is the enlarged part; the enlarged image is in the bottom-right corner. The “comparison” figures on the right side represent the comparisons of pixel values at the yellow line position of the corresponding three images.

    Figure 6.Captured images. (a) Captured images of three visual tasks, which were rendered from raw data. (b) Comparison of images between linear and nonlinear systems. “Linear_exp” refers to the images captured in the linear system experiment, “Nonliear_sim” refers to the result of adding the measured QD nonlinearity to the images captured in the linear system experiment, and “Nonlinear_exp” refers to the images captured in the nonlinear system experiment. The red box in the figure is the enlarged part; the enlarged image is in the bottom-right corner. The “comparison” figures on the right side represent the comparisons of pixel values at the yellow line position of the corresponding three images.

    The image classification results of the opto-electrical neural network with nonlinear activation are presented in Fig. 7. Throughout the simulation of three visual tasks, the incorporation of the quantum dot nonlinear layer led to an average increase in classification accuracy by 4.43%. Both the linear and nonlinear networks utilized a joint optimization strategy to enhance task accuracy. In our previous work [35], we demonstrated LOEN architecture, which can complete optical convolution and some image classification tasks. Comparatively, the classification accuracy achieved in the single-kernel optical convolution experiment was slightly lower than the simulation, with an average difference of 2%. Therefore, the linear system experiment was not repeated in this paper, and we directly referred to the simulation data and the conclusion of the accuracy difference between the simulation and experiment in LOEN [35].

    Results for optical nonlinear system. (a) Experimental recognition accuracy of hand sign, hand drawn image, and traffic sign classification tasks. (b) Confusion matrices of the three visual tasks based on experimental raw data. The abscissa is “predicted labels,” and the ordinate is “true labels.”

    Figure 7.Results for optical nonlinear system. (a) Experimental recognition accuracy of hand sign, hand drawn image, and traffic sign classification tasks. (b) Confusion matrices of the three visual tasks based on experimental raw data. The abscissa is “predicted labels,” and the ordinate is “true labels.”

    In the subsequent phase, we utilized the optical mask optimized by the entire neural network to construct the system and conduct the nonlinear experiments. The results of the experiment are shown in Fig. 7. The classification accuracy of hand sign, hand drawn images, and traffic sign tasks can reach 86.6%, 78.6%, and 84.74%, respectively, representing improvements of 2.16%, 3.75%, and 5.88% over the corresponding linear system simulation accuracy. If the gap between linear system simulation and experiment is taken into consideration, the accuracy improvement of the nonlinear system will be higher.

    The confusion matrices corresponding to the task classification results are shown in Fig. 7(b). It can be seen from the results that the addition of the QD nonlinear layer effectively improves the classification accuracy of each category. The average accuracy difference between the nonlinear experiments and simulations for the three visual tasks was measured at 1.46%, similar to the average difference observed between the linear system simulations and experiments. Several factors may contribute to this discrepancy. First, the inhomogeneity of the quantum dot film can yield disparities in the nonlinear activation functions of neurons at different positions within the network. Second, the quantum dot scattering process is relatively complex, potentially leading to a certain degree of blurring effect on the results of optical convolution, thereby influencing the accuracy of visual tasks.

    6. DISCUSSION

    In this study, we demonstrated a novel nonlinear opto-electrical neural network capable of efficiently handling various visual tasks. Unlike traditional approaches that rely solely on electrical processors for nonlinear activation, our architecture utilized the quantum dot film to achieve nonlinear activation directly on the feature map. Meanwhile, the convolution operation is efficiently performed by the optical mask. Compared to the other optical neural networks with optical nonlinearity, the method allows us to achieve passive spatial nonlinear activation, eliminating the need for additional electrical calculations and maintaining a lensless system. As a result, the overall size and energy consumption of the system are significantly reduced.

    Similar to our previous work, the entire pipeline is simplified without imaging and activation, and the whole architecture, including the optical and electrical parts, is task-oriented jointly optimized. The joint optimization strategy enhances the overall performance of the opto-electronic neural network, making it more effective in real-world applications. When considering the entire pipeline, it is essential to evaluate the energy demands not only of the subsequent electrical neural network computations but also of the sensor detection phase. This latter segment is predominantly governed by the energy costs of photosensitive detection and the ISP. The two parts have comparable energy cost. Due to the whole pipeline being optimized facing the task accuracy, the ISP process was removed from the experiments, which resulted in a nearly 50% reduction in energy consumption in the sensor detection part.

    The exploration in this paper is focused on a specific type of perovskite quantum dot as a nonlinear material. Compared with other optical nonlinear implementation methods, the main advantages of this method are that no additional power supply is required, the system is simple and miniaturized, and the light source is incoherent light, making it suitable for use in low-power edge devices. Future investigations should diversify by evaluating the nonlinear characteristics of other materials to enhance the accuracy of various machine vision tasks. On the other hand, it is necessary to improve the luminous conversion efficiency of quantum dots as much as possible to reduce energy loss and optimize the overall energy efficiency of the opto-electrical neural network. As for the multilayer nonlinearity cascade, it can be realized in the following ways in the future: adding other quantum dot materials [36], adjusting the size of the quantum dot material to change the band gap to further expand the photoluminescence wavelength range [37], and changing the composition ratio of the same quantum dot components to adjust the photoluminescence properties [38]. It is worth noting that because the light intensity is attenuated during transmission and there is no power supply in the system architecture, the system cannot complete light regeneration or light amplification between layers. The above factors will limit the number of cascadable quantum dot nonlinear layers.

    The opto-electrical neural network with only one optical convolution layer and one passive optical nonlinear activation layer was demonstrated. Similar to other optical neural networks [39,40], only a single nonlinear layer has limited improvement in the accuracy of visual tasks. However, optical nonlinearity provides ideas and methods for all-optical multilayer neural networks. Efficient all-optical multilayer neural networks are more promising when combined with optical nonlinearity. Apart from the quantum dot films used in the experiment, as long as the device has an optical-to-electrical-to-optical (OEO) or optical-to-optical (O-O) conversion function, it can be used to complete nonlinear modulation and cascade the next layer of the optical network. Considering the attenuation during light transmission, a power supply can be added to achieve light amplification or light regeneration. The power supply can further increase the number of layers of the nonlinear optical neural networks and help them be applied to various complex visual scenes. By expanding the optical nonlinear model, it becomes possible to propose a multilayer opto-electronic neural network with nonlinear activation, paving the way for the realization of an all-optical neural network with interlayer nonlinearity. In the future, we would focus on developing efficient multilayer optical neural networks, leveraging the power of optical nonlinearity to perform complex computations without relying on traditional electrical components. The all-optical neural network can complete specific tasks under natural incoherent light. There is theoretically no power consumption in the architecture, while the system size and weight can be minimized. The architecture is highly attractive for applications in autonomous driving, smart homes, and other real-world scenarios. With reduced energy consumption and compact design, the all-optical neural network can revolutionize various industries, offering a wide range of application scenarios and contributing to developing more energy-efficient and powerful artificial intelligence systems.

    7. MATERIALS AND METHODS

    A. Preparation of Quantum Dot Film

    The (C9NH20)7(ZnCl2)2(Pb3Cl11)9 quantum dot films were fabricated following the in situ formation process. The fabrication of (C9NH20)7(ZnCl2)2(Pb3Cl11)9/PVDF was as follows. A precursor solution was prepared by mixing 0.25 mmol PbCl2 (lead chloride, 99.99%, Meryer Shanghai), 2.25 mmol C9H20ClN (1-butyl-1-methylpyrrolidine chloride, 99%, Meryer Shanghai), 0.5 mmol or 0.125 mmol ZnCl2 (zinc chloride, 99.95%, Meryer Shanghai), and 0.5 g PVDF [250,000 (molecular mass), Meryer Shanghai] powder in 4.5 mL DMSO. The mixtures were stirred for 4 h at 70°C until completely dissolved, and the (C9NH20)7(ZnCl2)2(Pb3Cl11)9 precursor solution was obtained. The precursor solution was spin coated on a glass lens substrate at 2000 r/min (1000 r/min) for 30 s and baked at 130°C for 5 min to fabricate (C9NH20)7(ZnCl2)2(Pb3Cl11)9 quantum dot films.

    B. Mask Fabrication

    The mask is fabricated by photolithography on a chrome-coated glass substrate. This procedure encompasses photolithography, development, etching, and demolding, among other steps. The pattern of the optical mask is fixed while it is obtained by the optimization of the corresponding network. The reasons for choosing the fixed mask are as follows. Primarily, the compact size of the mask ensures a lensless system. Second, the fixed mask has higher contrast compared with the dynamic mask using a spatial light modulator. This not only reduces energy loss in the system but also heightens the accuracy for specific tasks. The detailed discussion has already been made in our previous work [35].

    C. Dataset Processing and Neural Network Training

    All the images of the experiments are grayscale and resized to match the corresponding network. Converting these images to binary might further enhance the accuracy for specific tasks. The networks are trained and tested on a work-station with a 2-GHz Intel Xeon Gold 6138 central processing unit (CPU) (32 GB RAM) and one Nvidia GeForce RTX 3090 GPU while using the PyTorch framework.

    References

    [1] Y. LeCun, Y. Bengio, G. Hinton. Deep learning. Nature, 521, 436-444(2015).

    [2] G. Barbastathis, A. Ozcan, G. Situ. On the use of deep learning for computational imaging. Optica, 6, 921-943(2019).

    [3] A. Sinha, J. Lee, S. Li. Lensless computational imaging through deep learning. Optica, 4, 1117-1125(2017).

    [4] M. Al-Qizwini, I. Barjasteh, H. Al-Qassab. Deep learning algorithm for autonomous driving using googlenet. IEEE Intelligent Vehicles Symposium (IV), 89-96(2017).

    [5] T. Young, D. Hazarika, S. Poria. Recent trends in deep learning based natural language processing. IEEE Comput. Intell. Mag., 13, 55-75(2018).

    [6] Q. Shi, Z. Zhang, T. He. Deep learning enabled smart mats as a scalable floor monitoring system. Nat. Commun., 11, 4609(2020).

    [7] H. Y. Xiong, B. Alipanahi, L. J. Lee. The human splicing code reveals new insights into the genetic determinants of disease. Science, 347, 1254806(2015).

    [8] M. Helmstaedter, K. L. Briggman, S. C. Turaga. Connectomic reconstruction of the inner plexiform layer in the mouse retina. Nature, 500, 168-174(2013).

    [9] G. Wetzstein, A. Ozcan, S. Gigan. Inference in artificial intelligence with deep optics and photonics. Nature, 588, 39-47(2020).

    [10] J. Li, D. Mengu, Y. Luo. Class-specific differential detection in diffractive optical neural networks improves inference accuracy. Adv. Photonics, 1, 046001(2019).

    [11] M. S. S. Rahman, J. Li, D. Mengu. Ensemble learning of diffractive optical networks. Light Sci. Appl., 10, 14(2021).

    [12] Y. Shen, N. C. Harris, S. Skirlo. Deep learning with coherent nanophotonic circuits. Nat. Photonics, 11, 441-446(2017).

    [13] X. Xu, M. Tan, B. Corcoran. 11 tops photonic convolutional accelerator for optical neural networks. Nature, 589, 44-51(2021).

    [14] J. Chang, V. Sitzmann, X. Dun. Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification. Sci. Rep., 8, 12324(2018).

    [15] T. Zhou, X. Lin, J. Wu. Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit. Nat. Photonics, 15, 367-373(2021).

    [16] M. Miscuglio, Z. Hu, S. Li. Massively parallel amplitude-only fourier neural network. Optica, 7, 1812-1819(2020).

    [17] M. M. P. Fard, I. A. Williamson, M. Edwards. Experimental realization of arbitrary activation functions for optical neural networks. Opt. Express, 28, 12138-12148(2020).

    [18] M. S. Nezami, T. F. de Lima, M. Mitchell. Packaging and interconnect considerations in neuromorphic photonic accelerators. IEEE J. Sel. Top. Quantum. Electron., 29, 6100311(2022).

    [19] J. Feldmann, N. Youngblood, C. D. Wright. All-optical spiking neurosynaptic networks with self-learning capabilities. Nature, 569, 208-214(2019).

    [20] M. A. Nahmias, B. J. Shastri, A. N. Tait. A leaky integrate-and-fire laser neuron for ultrafast cognitive computing. IEEE J. Sel. Top. Quantum. Electron., 19, 1800212(2013).

    [21] C. Teng, J. Zou, X. Tang. MOS2 as nonlinear optical material for optical neural networks. IEEE J. Sel. Top. Quantum. Electron., 29, 5101007(2023).

    [22] G. H. Li, R. Sekine, R. Nehra. All-optical ultrafast relu function for energy-efficient nanophotonic deep learning. Nanophotonics, 12, 847-855(2022).

    [23] Q. Guo, R. Sekine, L. Ledezma. Femtojoule femtosecond all-optical switching in lithium niobate nanophotonics. Nat. Photonics, 16, 625-631(2022).

    [24] H. Li, B. Wu, W. Tong. All-optical nonlinear activation function based on germanium silicon hybrid asymmetric coupler. IEEE J. Sel. Top. Quantum. Electron., 29, 8300106(2022).

    [25] G.-K. Lim, Z.-L. Chen, J. Clark. Giant broadband nonlinear optical absorption response in dispersed graphene single sheets. Nat. Photonics, 5, 554-560(2011).

    [26] A. Ryou, J. Whitehead, M. Zhelyeznyakov. Free-space optical neural network based on thermal atomic nonlinearity. Photonics Res., 9, B128-B134(2021).

    [27] Y. Zuo, B. Li, Y. Zhao. All-optical neural network with nonlinear activation functions. Optica, 6, 1132-1137(2019).

    [28] D. Owen-Newns, J. Robertson, M. Hejda. Ghz rate neuromorphic photonic spiking neural network with a single vertical-cavity surface-emitting laser (VCSEL). IEEE J. Sel. Top. Quantum. Electron., 29, 1500110(2022).

    [29] A. Dejonckheere, F. Duport, A. Smerieri. All-optical reservoir computer based on saturation of absorption. Opt. Express, 22, 10868-10881(2014).

    [30] M. Miscuglio, A. Mehrabian, Z. Hu. All-optical nonlinear activation function for photonic neural networks. Opt. Mater. Express, 8, 3851-3863(2018).

    [31] T. Wang, M. M. Sohoni, L. G. Wright. Image sensing with multilayer nonlinear optical neural networks. Nat. Photonics, 17, 408-415(2023).

    [32] R. Zhang, J. Fan, X. Zhang. Nonlinear optical response of organic–inorganic halide perovskites. ACS Photonics, 3, 371-377(2016).

    [33] W. Chen, S. Bhaumik, S. A. Veldhuis. Giant five-photon absorption from multidimensional core-shell halide perovskite colloidal nanocrystals. Nat. Commun, 8, 15198(2017).

    [34] X.-K. Liu, W. Xu, S. Bai. Metal halide perovskites for light-emitting diodes. Nat. Mater., 20, 10-21(2021).

    [35] W. Shi, Z. Huang, H. Huang. Loen: lensless opto-electronic neural network empowered machine vision. Light Sci. Appl., 11, 121(2022).

    [36] D. Li, E. Kou, W. Li. Oxidation-induced quenching mechanism of ultrabright red carbon dots and application in antioxidant RCDS/PVA film. Chem. Eng. J., 425, 131653(2021).

    [37] F. P. García de Arquer, D. V. Talapin, V. I. Klimov. Semiconductor quantum dots: technological progress and future challenges. Science, 373, eaaz8541(2021).

    [38] L. N. Quan, F. P. García de Arquer, R. P. Sabatini. Perovskites for light emission. Adv. Mater., 30, 1801996(2018).

    [39] S. Colburn, Y. Chu, E. Shilzerman. Optical frontend for a convolutional neural network. Appl. Opt., 58, 3179-3186(2019).

    [40] L. Huang, Q. A. Tanguy, J. E. Fröch. Photonic advantage of optical encoders. Nanophotonics(2023).

    Wanxin Shi, Xi Jiang, Zheng Huang, Xue Li, Yuyang Han, Sigang Yang, Haizheng Zhong, Hongwei Chen. Lensless opto-electronic neural network with quantum dot nonlinear activation[J]. Photonics Research, 2024, 12(4): 682
    Download Citation