Genetic-algorithm-based deep neural networks for highly efficient photonic device design

Yangming Ren; Lingxuan Zhang; Weiqiang Wang; Xinyu Wang; Yufang Lei; Yulong Xue; Xiaochen Sun; Wenfu Zhang

doi:10.1364/PRJ.416294

Abstract

While deep learning has demonstrated tremendous potential for photonic device design, it often demands a large amount of labeled data to train these deep neural network models. Preparing these data requires high-resolution numerical simulations or experimental measurements and cost significant, if not prohibitive, time and resources. In this work, we present a highly efficient inverse design method that combines deep neural networks with a genetic algorithm to optimize the geometry of photonic devices in the polar coordinate system. The method requires significantly less training data compared with previous inverse design methods. We implement this method to design several ultra-compact silicon photonics devices with challenging properties including power splitters with uncommon splitting ratios, a TE mode converter, and a broadband power splitter. These devices are free of the features beyond the capability of photolithography and generally in compliance with silicon photonics fabrication design rules.

1. INTRODUCTION

The advent of photonic integrated circuits (PICs) is expected to revolutionize many industries such as optical interconnects [1 - 3], optical sensing [3 - 6], and optical computing [7, 8]. The progress of PICs demands compact photonic devices with a variety of properties to increasingly integrate more functionality on a chip. The property of a photonic device is highly determined by its geometry, which is usually designed and optimized by an experienced photonic designer based on intuition and numerical simulations. This traditional method is limited by designers’ experience and usually only applies to the designs with simple geometric structures. However, complex geometries are often required for realizing ultra-compact photonic devices with challenging properties. The design of these complex structures with a large number of parameters is beyond the capability of human intuition and demands more efficient methods. The brute force optimization method is impractical, as the numerical simulation algorithms, such as the finite-difference time-domain (FDTD) method, are very computation intensive and time consuming. Traditional regression-based optimization methods face difficulties in convergence due to the high-dimensional parameter space. A variety of inverse design methods such as level set methods, adjoint method, local-optimization techniques, and direct-binary search [9 - 19] have been developed to tackle this issue. These methods expand the design space and realize many photonic devices with complex and nonintuitive structures that present extraordinary properties. Fundamentally, these algorithms are rule-based approaches containing iterative searching steps in a case-by-case manner, often relying on numerical simulations in each step to produce intermediate results that help to modify the searching strategy. Such stochastic algorithms are limited by their random-search nature and are insufficient for complex designs in a multi-constrained problem [20 - 22].

This challenge may be addressed by deep learning (DL) and more specifically deep neutral networks (DNNs), which use many processing layers to learn the representations of data with multiple levels of abstraction [23]. DNN has been revolutionizing many fields, including image analysis [24], natural language processing [25], materials science [26], quantum physics [27], and so on. Neural network (NN)-based methods have previously been adopted in studying photonic devices such as phase delay structures [28], photonic crystal nanocavities [29], and nanophotonic particles [30]. However, applying a DNN to photonic device design faces a practical challenge of preparing a large training dataset [20]. For example, Ma et al. used an NN model trained on 30,000 device designs for a chiral metamaterial inverse design [31]; Tahersima et al. used an NN model trained on 20,000 designs to realize a power splitter with specific splitting ratios [32]. These large amounts of design data are generated by high-resolution 3D FDTD simulations and may take days to complete on a typical workstation.

2.5 μm \times 1.5 μm

Sign up for Photonics Research TOC. Get the latest issue of Photonics Research delivered right to you！Sign up now

2. ALGORITHM

The GDNN-based inverse design algorithm developed in this work includes encoding, selection, and DNN-based forward and inverse design processes. The workflow of the algorithm is exhibited in Fig. 1 . First, the device models and numerical simulation results of the initial device designs are encoded into a dataset called population (step A of Fig. 1). In mimicking the natural selection process, the population is selected with respect to an objective function (step B of Fig. 1). The whole population is also used to generate the weight parameters in the DNN models of both the forward design and the inverse design [step C (1) of Fig. 1]. The selection process produces an adaptive (good performance) generation as the parents to reproduce their offspring through the inverse design process [step C (2) of Fig. 1]. These offspring are then added to the initial population (step D of Fig. 1) and update the weights in the DNN models to start the next iteration. This process is repeated until the DNN model meets the required optimization target to the investigated design problem.

Workflow of the GDNN algorithm developed in this paper.

Figure 1.Workflow of the GDNN algorithm developed in this paper.

A. Encoding

R_{i}^{1} = {R_{0}^{1}, R_{1}^{1}, \dots, R_{n}^{1}}, R_{i}^{2} = {R_{0}^{2}, R_{1}^{2}, \dots, R_{n}^{2}}

Figure 2.Encoding process that uses polar vectors and design rule constrains as a parameter vector to describe the design of a given photonic device.

B. Selection and DNN-Based Evolution

The population selection process is a prerequisite for DNN-based inversed design. It evaluates the FOM of each individual in the current population and selects a subset of the population with good values to be the parents that “reproduce” the next generation of device designs. Due to the “genetic” correlation between the parents and the offspring connected by the neural networks, additional factors can be added to the selection process to intentionally guide the direction of certain features in the evolution. For example, a punishment factor on the device area can be added to guide the algorithm to favor more compact designs.

A genetic algorithm mainly includes three operations: selection, crossover, and mutation. Crossover and mutation are the key steps to produce excellent offspring. In the GDNN algorithm, we use the DNN model to replace the crossover and mutation operations of the genetic algorithm. The DNN model can be inferred in both forward and inverse ways. The forward design process can obtain the device FOM and network weight parameters in the form of a given encoded design parameter vector. The network weight parameters are the key settings for enabling the reverse design process. The inverse design process after obtaining the network weight parameters utilizes the FOM data associated with wavelength as the input to produce offspring device designs that are correlated but different than the parent. The forward and the inverse design processes are schematically shown in Fig. 3 . A fully connected multilayer DNN is used with the sigmoid activation function and back-propagation method for weight updates.

Figure 3.Schematic drawing of the DNN models of the forward and inverse design processes.

S = {x_{1}, \dots, x_{n}}

3. SIMULATION RESULTS

In order to study its capability and performance, we applied the GDNN algorithm to design several very compact Si photonics devices as examples, including low-loss power splitters with specific unequal output powers, a TE mode converter, and a broadband power splitter. All of these devices are designed based on common Si photonics platforms with silicon-on-insulator (SOI) substrates with 220 nm device silicon and a 2-3 μm buried oxide (BOX) layer. The waveguide is 500 nm in width and operated at (or around) 1550 nm wavelength. All the devices are designed according to the process design rule that no line or space feature size is smaller than 0.2 μm.

> 80 %

Figure 4.Design analyses of a power splitter with splitting ratio of 2:3: (a) the evolution of the qualified population proportion; (b) and (c) the FDTD simulation result of the best devices in the initial population and the final population; (d) the distribution of optical transmission of the initial population.

2.2 μm \times 1.8 μm

Figure 5.GDNN design examples with transmission spectrum and FDTD simulation results: (a) a 1:2 power splitter, (b) a 1:1 power splitter, (c) a TE mode converter, and (d) a broadband power splitter.

4. DISCUSSIONS AND CONCLUSION

In this work, the DNN was implemented by using PyTorch, an open source machine learning framework. The same GDNN architecture is used for all the designs. The architecture includes 10 hidden layers with the dimension of 100. There are 1000 trained data for the initial population and 50 reproduced offspring for each iteration. The algorithm is implemented on a workstation with an AMD Ryzen 3700X CPU and a Nvidia GeForce RTX 2070 GPU. It takes about 6-10 h to complete the whole design including the FDTD simulation of the 1000 initial population data and the whole genetic optimization process.

DNN has been proved to be a powerful tool for designing optical devices using an inverse design process. Previous DNN-based inverse design algorithms usually require 20,000 to 40,000 training data [31, 32], which take the majority of the design time due to the computation intensive high-resolution FDTD simulations. The use of genetic optimization reduces the number of the training data by an order of magnitude. In addition, unlike many inverse design algorithms, our GDNN algorithm does not require the initial data set to include a large amount of good performance designs because the quality of the population is iteratively improved during the evolution. In fact, we intentionally include some device designs, violating design rules that serve for regulating the constraints of the inverse design. As shown in Table 1, the GDNN uses 1000 sets of data for the initial training of the model. After that, 50 sets of offspring will be added to participate in the training of the model. It is worth mentioning that the model design with similar structural design uses the same set of initial datasets (1000 sets of data), for instance, power splitter (1:1) and broadband splitter, power splitter (1:2) and power splitter (2:3). Our GDNN algorithm completes a design task with less than 3000 design data including the 1000 initial population and the offspring of all iterations. This is an order of magnitude less than the training data for a typical previous inverse design algorithm. Table 1.

Training Data Summary of the Designs in This Work

Device Designs	Initial Data	Number of Iterations	Number of Offspring	Total Data
Power splitter (1:1)	1000	35	50	2750
Power splitter (1:2)	1000	28	50	2400
Power splitter (2:3)	1000	32	50	2600
TE mode converter	1000	30	50	2500
Broadband splitter	1000	23	50	2150

In comparison, we have investigated using an efficiency-focused generative adversarial network (GAN) model to design the same photonic devices (2:3 power splitter). The results (Fig. 6) show that greater than 10,000 device design training data are required to achieve designs with comparable performance.

Figure 6.Comparison of GAN and GDNN design results.

In conclusion, by combining DNN with genetic evolution, our GDNN method exhibits great flexibility and efficiency in designing ultra-compact photonic devices with challenging properties. Compared with previous optimization algorithms, the GDNN algorithm is more capable of complex multi-constraint and multi-objective optimization. Without relying on pixelized features in a predefined geometry and grid, our method allows design of more flexible device structures while obeying fabrication design rules. The genetic evolution nature of the algorithm greatly increases computation efficiency and reduces FDTD training data by an order of magnitude compared with previous DNN-based inverse design methods. In addition, the genetic selection process allows introduction of biases to guide certain design directions such as a small footprint. To exhibit the capability of the method, we design several Si photonics devices including power splitters with uncommon splitting ratios, a TE mode converter, and a broadband power splitter. These devices are free of the features beyond the capability of photolithography and generally in compliance with silicon photonics fabrication design rules. We believe the presented GDNN algorithm can be widely implemented in designing many complex micro- and nanophotonic structures that could not be easily realized before.

Acknowledgment

Acknowledgment. The authors thank Dr. Shiyue Hua for helpful discussion.

References

[1] M. J. R. Heck, H. W. Chen, A. W. Fang, B. R. Koch, D. Liang, H. Park, M. N. Sysak, J. E. Bowers. Hybrid silicon photonics for optical interconnects. IEEE J. Sel. Top. Quantum Electron., 17, 333-346(2011).

[2] Y. Urino, T. Usuki, J. Fujikata, M. Ishizaka, K. Yamada, T. Horikawa, T. Nakamura, Y. Arakawa. High-density and wide-bandwidth optical interconnects with silicon optical interposers [Invited]. Photon. Res., 2, A1-A7(2014).

[3] Y. Li, Y. Zhang, L. Zhang, A. W. Poon. Silicon and hybrid silicon photonic devices for intra-datacenter applications: state of the art and perspectives. Photon. Res., 3, B10-B27(2015).

[4] J. Sun, E. Timurdogan, A. Yaacobi, E. S. Hosseini, M. R. Watts. Large-scale nanophotonic phased array. Nature, 493, 195-199(2013).

[5] X. Sun, L. Zhang, Q. Zhang, W. Zhang. Si photonics for practical LiDAR solutions. Appl. Sci., 9, 4225(2019).

[6] C. V. Poulton, M. J. Byrd, M. Raval, Z. Su, N. Li, E. Timurdogan, D. Coolbaugh, D. Vermeulen, M. R. Watts. Large-scale silicon nitride nanophotonic phased arrays at infrared and visible wavelengths. Opt. Lett., 42, 21-24(2017).

[7] J. Feldmann, N. Youngblood, C. D. Wright, H. Bhaskaran, W. H. P. Pernice. All-optical spiking neurosynaptic networks with self-learning capabilities. Nature, 569, 208-214(2019).

[8] K. Vandoorne, P. Mechet, T. Van Vaerenbergh, M. Fiers, G. Morthier, D. Verstraeten, B. Schrauwen, J. Dambre, P. Bienstman. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. Commun., 5, 3541(2014).

[9] K. Xu, L. Liu, X. Wen, W. Sun, N. Zhang, N. Yi, S. Sun, S. Xiao, Q. Song. Integrated photonic power divider with arbitrary power ratios. Opt. Lett., 42, 855-858(2017).

[10] E. Khoram, X. Qian, M. Yuan, Z. Yu. Controlling the minimal feature sizes in adjoint optimization of nanophotonic devices using b-spline surfaces. Opt. Express, 28, 7060-7069(2020).

[11] J. Lu, J. Vučković. Nanophotonic computational design. Opt. Express, 21, 13351-13367(2013).

[12] C. M. Lalau-Keraly, S. Bhargava, O. D. Miller, E. Yablonovitch. Adjoint shape optimization applied to electromagnetic design. Opt. Express, 21, 21693-21701(2013).

[13] B. Shen, P. Wang, R. Polson, R. Menon. An integrated-nanophotonics polarization beamsplitter with 2.4 × 2.4 μm² footprint. Nat. Photonics, 9, 378-382(2015).

[14] D. Vercruysse, N. V. Sapra, L. Su, R. Trivedi, J. Vučković. Analytical level set fabrication constraints for inverse design. Sci. Rep., 9, 8999(2019).

[15] A. Y. Piggott, E. Y. Ma, L. Su, G. H. Ahn, N. V. Sapra, D. Vercruysse, A. M. Netherton, A. S. P. Khope, J. E. Bowers, J. Vučković. Inverse-designed photonics for semiconductor foundries. ACS Photon., 7, 569-575(2020).

[16] A. Y. Piggott, J. Lu, K. G. Lagoudakis, J. Petykiewicz, T. M. Babinec, J. Vucković. Inverse design and demonstration of a compact and broadband on-chip wavelength demultiplexer. Nat. Photonics, 9, 374-377(2015).

[17] C. Y. Kao, S. Osher, E. Yablonovitch. Maximizing band gaps in two-dimensional photonic crystals by using level set methods. Appl. Phys. B, 81, 235-244(2005).

[18] K. Wang, X. Ren, W. Chang, L. Lu, D. Liu, M. Zhang. Inverse design of digital nanophotonic devices using the adjoint method. Photon. Res., 8, 528-533(2020).

[19] J. C. C. Mak, C. Sideris, J. Jeong, A. Hajimiri, J. K. S. Poon. Binary particle swarm optimized 2 × 2 power splitters in a standard foundry silicon photonic platform. Opt. Lett., 41, 3868-3871(2016).

[20] W. Ma, Z. Liu, Z. A. Kudyshev, A. Boltasseva, W. Cai, Y. Liu. Deep learning for the design of photonic structures. Nat. Photonics, 15, 77-90(2020).

[21] W. Li, F. Meng, Y. Chen, Y. Fan Li, X. Huang. Topology optimization of photonic and photonic crystals and metamaterials: a review. Adv. Theor. Simul., 2, 1900017(2019).

[22] S. Molesky, Z. Lin, A. Y. Piggott, W. Jin, J. Vucković, A. W. Rodriguez. Inverse design in nanophotonics. Nat. Photonics, 12, 659-670(2018).

[23] Y. Lecun, Y. Bengio, G. Hinton. Deep learning. Nature, 521, 436-444(2015).

[24] B. A. Krizhevsky, I. Sutskever, G. E. Hinton. Image net classification with deep convolutional neural networks. Commun. ACM, 60, 84-90(2012).

[25] T. Young, D. Hazarika, S. Poria, E. Cambria. Recent trends in deep learning based natural language processing. IEEE Comput. Intell. Mag., 13, 55-75(2018).

[26] Y. Mao, Q. He, X. Zhao. Designing complex architectured materials with generative adversarial networks. Sci. Adv., 6, eaaz4169(2020).

[27] G. Carleo, M. Troyer. Solving the quantum many-body problem with artificial Neural networks. Science, 355, 602-606(2007).

[28] D. Liu, Y. Tan, E. Khoram, Z. Yu. Training deep neural networks for the inverse design of nanophotonic structures. ACS Photon., 5, 1365-1369(2018).

[29] T. Asano, S. Noda. Optimization of photonic crystal nanocavities based on deep learning. Opt. Express, 26, 32704-32717(2018).

[30] J. Peurifoy, Y. Shen, L. Jing, Y. Yang, F. Cano-Renteria, B. Delacy, M. Tegmark, J. D. Joannopoulos, M. Soljacić. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv., 4, eaar4206(2018).

[31] W. Ma, F. Cheng, Y. Liu. Deep-learning-enabled on-demand design of chiral metamaterials. ACS Nano, 12, 6326-6334(2018).

[32] M. H. Tahersima, K. Kojima, T. Koike-Akino, D. Jha, B. Wang, C. Lin, K. Parsons. Deep neural network inverse design of integrated photonic power splitters. Sci. Rep., 9, 1368(2019).

[33] D. Dai, J. E. Bowers. Novel concept for ultracompact polarization splitter-rotator based on silicon nanowires. Opt. Express, 19, 10940-10949(2011).

[34] Y. Xiong, D.-X. Xu, J. H. Schmid, P. Cheben, S. Janz, W. N. Ye. Fabrication tolerant and broadband polarization splitter and rotator based on a taper-etched directional coupler. Opt. Express, 22, 17458-17465(2014).

微信扫一扫：分享

微信扫一扫：分享