Neural network modeling and prediction of HfO2 thin film properties tuned by thermal annealing

Min Gao; Chaoyi Yin; Jianda Shao; Meiping Zhu

doi:10.1017/hpl.2024.6

Abstract

Plasma-enhanced atomic layer deposition (PEALD) is gaining interest in thin films for laser applications, and post-annealing treatments are often used to improve thin film properties. However, research to improve thin film properties is often based on an expensive and time-consuming trial-and-error process. In this study, PEALD-HfO₂ thin film samples were deposited and treated under different annealing atmospheres and temperatures. The samples were characterized in terms of their refractive indices, layer thicknesses and O/Hf ratios. The collected data were split into training and validation sets and fed to multiple back-propagation neural networks with different hidden layers to determine the best way to construct the process–performance relationship. The results showed that the three-hidden-layer back-propagation neural network (THL-BPNN) achieved stable and accurate fitting. For the refractive index, layer thickness and O/Hf ratio, the THL-BPNN model achieved accuracy values of 0.99, 0.94 and 0.94, respectively, on the training set and 0.99, 0.91 and 0.90, respectively, on the validation set. The THL-BPNN model was further used to predict the laser-induced damage threshold of PEALD-HfO₂ thin films and the properties of the PEALD-SiO₂ thin films, both showing high accuracy. This study not only provides quantitative guidance for the improvement of thin film properties but also proposes a general model that can be applied to predict the properties of different types of laser thin films, saving experimental costs for process optimization.

Keywords

laser thin film laser-induced damage threshold neural network plasma-enhanced atomic layer deposition

1 Introduction

Optical thin films are key components of laser systems, and their optical properties and laser-induced damage threshold (LIDT) directly affect their output energy^[1^–3^]. Traditional preparation methods for laser thin films include electron-beam evaporation^[4^–6^] and ion-beam sputtering^[7^]. Recently, plasma-enhanced atomic layer deposition (PEALD) has attracted attention because of its precise thickness controllability^[8^], excellent conformality^[9^], low-temperature growth properties^[10^] and high LIDT^[11^]. Furthermore, post-treatment annealing improves the properties of thin films grown via PEALD^[12^]. However, owing to the diversity and wide range of process parameters, process optimization and thin film performance improvement often require extensive, expensive and time-consuming experiments.

Back-propagation neural networks (BPNNs), a subset of machine learning, have shown potential for mapping the relationship between experimental parameters and material properties^[13^,14^]. This approach can identify underlying regularities in the training data by updating the internal weight parameters^[15^,16^]. In recent years, researchers have begun to study the application of neural networks in the field of thin films to predict the growth rate^[17^–20^], hydrophobicity^[21^], permeate flux and foulant rejection^[22^]. Although these reports demonstrate the application of BPNNs in various thin films, studies on the properties of laser thin films are lacking. Furthermore, the adopted models were mainly shallow structures with single or double hidden layers. Shallow-structure neural networks can meet most modeling and prediction needs but may require a large number of neurons to accurately represent the relationship between the input and output^[23^], which increases the likelihood of errors in models^[22^]. In 2022, Mengu et al.^[24^], while studying the emerging symbiotic relationship between deep learning and optics, reported the advantages of deep neural networks with three or more hidden layers in terms of approximation and generalization capability. However, as the number of hidden layers increases, deep neural networks may suffer from poor performance or training failure owing to issues such as vanishing/exploding gradients^[25^]. Therefore, it is necessary to determine the optimal number of hidden layers for solving a special task.

In this study, we employ several BPNN models to establish the relationship between the annealing process and the properties of PEALD-HfO₂ thin films for laser applications. Firstly, comparing the performance of BPNN models with different numbers of hidden layers, it is deduced that the three-hidden-layer back-propagation neural network (THL-BPNN) performs best. The THL-BPNN model was then used to model and predict the relationship between the annealing process and the PEALD-HfO₂ thin film properties and was compared with the other two models. Finally, the LIDT of the PEALD-grown thin films and the properties of the PEALD-SiO₂ thin films were predicted using the THL-BPNN model, and the applicability of the THL-BPNN model was verified. We believe that the THL-BPNN model can help predict the properties of other laser thin films.

Sign up for High Power Laser Science and Engineering TOC. Get the latest issue of High Power Laser Science and Engineering delivered right to you！Sign up now

2 Materials and methods

2.1 Data preparation

The HfO₂ thin films used to construct the annealing process–thin film property relationship were grown on Si substrates using a commercial PEALD device (Picosun Advanced R200, Finland) with an integrated remote plasma source. HfO₂ thin films were grown by alternating exposure to the precursor tetrakis-ethylmethylamino hafnium (Hf(N(CH₃)(CH₂CH₃))₄, TEMAH) and O₂/Ar gas mixture plasma reactant at a deposition temperature of 150°C. The number of deposition cycles was 500, and the pulse sequence for each HfO₂ growth cycle was as follows: TEMAH feeding (1.6 s), N₂ purging (19 s), Ar/O₂ mixture feeding (11 s) and Ar purging (10 s). The samples were then annealed in quartz tube annealing equipment (RS 80/300/11, Nabertherm) for 3 h. The annealing process included a combination of three atmospheres (vacuum, O₂ and N₂) and six annealing temperatures (300°C to 800°C in 100°C increments). For vacuum annealing, the pressure in the tubular annealing chamber was approximately 1 × 10^–4 Pa. For O₂ and N₂ atmosphere annealing, the gas flow rate was 150 SCCM for both O₂ and N₂. The HfO₂ thin films were measured using an ellipsometer (Horiba Uvisel 2), and the thicknesses and refractive indices were extracted using the Tauc-Lorentz model in DeltaPsi2 software, neglecting the extinction coefficient (k). The O/Hf ratio of the HfO₂ thin films was analyzed using X-ray photoelectron spectroscopy (XPS) (Thermo Scientific) with a monochromatic Al Kα (1486.6 eV) X-ray source. The data used to construct the annealing process–thin film property relationship consisted of 19 samples, including 1 as-deposited sample and 18 annealed samples.

The HfO₂ thin film data used for LIDT modeling and prediction come from Ref. [26], including 12 samples treated by different annealing process parameters. Among them, six samples were annealed in an O₂ atmosphere, and the other six samples were annealed in a N₂ atmosphere. The annealing temperature ranged from 300°C to 800°C.

The SiO₂ thin film data used for property modeling and prediction come from Ref. [27], including 10 samples grown by different deposition process parameters. Among them, four samples were grown at different temperatures ranging from 50°C to 200°C, and six samples were grown with different precursor source exposure times ranging from 0.2 to 0.7 s.

Table 1 lists the detailed parameters of the datasets used to model and predict the properties of HfO₂ and SiO₂ thin films, including the refractive index, thickness and stoichiometric ratio. As the annealing temperature increases, the thickness of the HfO₂ thin film decreases and the refractive index increases. In a vacuum environment, O₂ environment and N₂ environment, the thickness of HfO₂ thin films annealed at different temperatures changes in the range of 34.7–42.7, 38.5–49.1 and 36.3–46.7 nm, respectively, while the refractive index (at 355 nm) of HfO₂ thin films annealed at different temperatures changes in the range of 1.99–2.24, 1.83–1.97 and 1.88–2.00, respectively. This means that the packing density of the HfO₂ thin film increases with increasing annealing temperature^[26^]. In addition, the O/Hf ratio of HfO₂ thin films annealed in an O₂ environment fluctuates slightly around the ideal value of 2.0. However, the O/Hf ratio of HfO₂ thin films annealed in vacuum and N₂ environments decreases with increasing annealing temperature.


	HfO₂ thin films		SiO₂ thin films

	Variables	Range	Variables	Range
Input	Annealing atmosphere^*	0–3	Deposition temperature (°C)	50–200
	Annealing temperature (°C)	0–800	Precursor exposure time (s)	0.1–0.7

Output	Refractive index(at 355 nm)	1.83–2.24	Refractive index(at 355 nm)	1.48–1.49
	Thickness (nm)	34.7–50.3	Thickness (nm)	69.0–88.1
	O/Hf ratio	1.80–2.04	O/Si ratio	1.94–2.01

Table 1. Datasets for property prediction of HfO₂ and SiO₂ thin films.

View all Tables

Table 2 lists the detailed parameters of the datasets used for LIDT modeling and prediction. Compared with PEALD-HfO₂ thin films, PEALD-SiO₂ thin films have lower absorption and impurity content. Furthermore, properties such as absorption, impurity content and stoichiometric ratio influence each other. Detailed relationships are described in Refs. [26,27]. The LIDT was tested in one-on-one mode according to ISO 21254 using a Gaussian-shape 3ω neodymium-doped yttrium aluminum garnet (Nd:YAG) laser (355 nm, 7.8 ns). The LIDT test was performed under normal incidence, and the maximum laser fluence with zero damage probability was determined as the LIDT. It is worth mentioning that the LIDT of HfO₂ thin films is lower than that of SiO₂ thin films, which is attributed to the fact that the bandgap of HfO₂ is lower than that of SiO₂.


		Range

	Variables	HfO₂	SiO₂
		thin films	thin films
Input	Type^*	1	2
	Total impurity content	5.4–13.5	0.6–1.1
	(%, atomic fraction)
	Absorption (ppm)	211–10892	3.8–5.8
	Stoichiometric ratio	1.81–2.06	1.94–2.01
Output	LIDT (J/cm²)	1.2–6.3	22.0–39.4

Table 2. Datasets for LIDT prediction of HfO₂ and SiO₂ thin films.

View all Tables

2.2 Models

Six models, namely four BPNN models with different numbers of hidden layers (single-hidden-layer BPNN, double-hidden-layer BPNN, three-hidden-layer BPNN and four-hidden-layer BPNN), a support vector machine regression (SVR) model^[28^] using a Gaussian kernel function and a linear regression (LR) model^[29^], were used to establish the correlation between the annealing process and the refractive index, layer thickness and O/Hf ratio of PEALD-HfO₂ thin films. Except for the LR model, which belongs to the category of linear regression fitting, the other models belong to the category of nonlinear regression fitting. All models performed regression fitting by training on a training set, tuning the modeling parameters to achieve the highest accuracy (i.e., lowest error) and then validating on a validation set. When constructing the relationship between the annealing process and the properties of the PEALD-HfO₂ thin films, 6 samples were randomly selected as the validation set, and the remaining 13 samples (12 annealed samples and 1 as-deposited sample) were used as the training set. When predicting the LIDT of PEALD-grown thin films, 6 samples (3 HfO₂ samples and 3 SiO₂ samples) were randomly selected as the validation set, and the remaining 16 samples (9 HfO₂ samples and 7 SiO₂ samples) were used as the training set. When predicting the properties of PEALD-SiO₂ thin films, the leave-one-out cross-validation method was adopted owing to limited data. For each test, one sample was used as a validation set, and the remaining samples were used as a training set until every sample was used as a validation set. Subsequently, the average performance deviation was calculated for each model.

Figure 1 shows a schematic of the THL-BPNN model, including an input layer (layer 0), three hidden layers (layers 1–3) and an output layer (layer 4), with each layer containing one or more neurons. The number of neurons in the input and output layers was determined by the number of input and output variables in the dataset, whereas the number of neurons in the hidden layers was initially determined using Equation (1) (an empirical formula) and finally determined by a global traversal search:

$$\begin{align}l=\sqrt{u+v}+a,\end{align}$$  (1)

where u, v and l are the numbers of neurons in the input, output and hidden layers, respectively, and a is a random number between 1 and 10.

THL-BPNN model with all neurons in adjacent layers connected, where x = [x1; x2], y1 and hij represent the input, output and intermediate processing signals, respectively.

Figure 1.THL-BPNN model with all neurons in adjacent layers connected, where x = [x₁; x₂], y₁ and h_ij represent the input, output and intermediate processing signals, respectively.

The neurons receive input signals from the previous layer and generate output signals for the next layer^[30^,31^]. For example, the first neuron in layer 1 (from top to bottom), the circle where h₁₁ is located, receives input signals, x = [x₁; x₂], from layer 0. Then x undergoes linear transformation to get the weighted sum, z, which is expressed as follows:

$$\begin{align}z={\boldsymbol{w}}^{\mathrm{T}}\boldsymbol{x}+b,\end{align}$$  (2)

where w = [w₁; w₂] $\in$ R is a weight vector between the neurons, and b

$\in$

R is a bias.

Subsequently, z passes through a nonlinear activation function $f\left(\cdotp \right)$ ^[32^], and the output signal h₁₁ is generated as follows:

$$\begin{align}{h}_{11}=f(z).\end{align}$$  (3)

These processes were performed for each neuron in each layer to form the final output signal, y₁^[33^]. Obviously, mapping from the input space to the output space is initially established through layer-by-layer information transfer.

To further improve the mapping accuracy, a training loss was constructed in the output layer, and an appropriate training algorithm is selected to update the relevant parameters (weights w and bias b) in combination with the chain rule^[34^] until the loss or the number of iterations reaches the preset threshold^[35^]. The Levenberg–Marquardt algorithm^[36^] was used to solve the nonlinear least squares problem. The hyperbolic tangent function was selected as the activation function for all hidden layers. The initialization state of each run was fixed to avoid interference from other factors.

2.3 Model specification and evaluation

2.3.1 Variable scaling

Considering that different distribution ranges of the input and output values may lead to biased assessments, Equation (4) is used to scale the input and output of the data to [–1, 1]:

$$\begin{align}{X}_\mathrm{norm}=\frac{\left({Y}_\mathrm{max}-{Y}_\mathrm{min}\right)\left(X-{X}_\mathrm{min}\right)}{X_\mathrm{max}-{X}_\mathrm{min}}+{Y}_\mathrm{min},\end{align}$$  (4)

where X is the input or output vector; X_max and X_min are the maximum and minimum values of the input or output vector, respectively; and Y_max and Y_min are the maximum and minimum values after normalization, respectively.

2.3.2 Model evaluation metrics

The coefficient of determination (R²)^[37^] was used to evaluate the overall performance of each model. The average accuracy (AA) was used to evaluate the performance of each model on a validation set with only a single sample. The root mean square error (RMSE)^[38^] was used to measure the deviation between the predicted and measured values:

$$\begin{align}{R}^2&=1-{\frac{\sum \left({Y}_i-{T}_i\right)}{\sum {\left({Y}_i-\overline{Y}\right)}^2}}^2,\end{align}$$  (5)

$$\begin{align}\mathrm{AA}&=\frac{1}{n}\sum \limits_{i=1}^n\left(1-\frac{\left|{Y}_i-{T}_i\right|}{\left|{Y}_i\right|}\right),\end{align}$$  (6)

$$\begin{align}\mathrm{RMSE}&={\left[\frac{1}{n}\sum \limits_{i=1}^n{\left({Y}_i-{T}_i\right)}^2\right]}^{1/2},\end{align}$$  (7)

where n is the size of the dataset; Y_i and T_i are the measured and predicted values of the ith sample in the dataset, respectively; and $\overline{Y}$ is the average of the measured values. A lower RMSE (close to 0) and higher R² and AA (close to 1) indicate smaller differences between the measured and predicted values.

3 Results and discussion

3.1 Analysis of the number of hidden layers of the BPNN model

The influence of the number of hidden layers in the BPNN model on the modeling accuracy was studied using the measured data of the refractive index, layer thickness and O/Hf ratio of the PEALD-HfO₂ thin films treated with different annealing process parameters. The optimal number of neurons in each hidden layer was determined by a global traversal search on the training set corresponding to the lowest mean absolute error, and then the optimal model was applied to the validation set. For the refractive index and layer thickness datasets, the total number of neurons in the BPNN model with multiple hidden layers was consistent with that of the single-hidden-layer BPNN model. For the O/Hf ratio dataset, because the optimal number of neurons in the single-hidden-layer BPNN model is only five, this value is set as the maximum number of neurons in each hidden layer in the BPNN model with multiple hidden layers. The modeling and prediction accuracies are shown in Figure 2. Overall, as the number of hidden layers increased from one to three, the difference between the R² and RMSE in the training and validation sets decreased, indicating that the model moved from inexact to exact fitting. However, as the number of hidden layers was further increased to four, the difference between the R² and RMSE in the training and validation sets increased. This may be due to the fact that the combination of neurons in each layer grows exponentially with the number of hidden layers, which introduces the risk of overfitting while potentially obtaining better solutions. The only exception is the modeling of the refractive index, where a single-hidden-layer BPNN also exhibits good performance, which could be attributed to the small variation in the properties and the uncomplicated relationship between the input and output. With the three-hidden-layer BPNN model, the R² values of the refractive index, layer thickness and O/Hf ratio were higher than 0.90 in both the training and validation sets. The THL-BPNN model was selected for the follow-up study.

$Accuracy of BPNNs with one to four hidden layers based on (a) the refractive index (at 355 nm), (b) layer thickness and (c) O/Hf ratio of PEALD-HfO2. The four columns in each subgraph represent the R2 values of the model in the training and validation sets and the RMSE values in the training and validation sets, respectively. The table indicates the number of neurons in each hidden layer of each model.$

Figure 2.Accuracy of BPNNs with one to four hidden layers based on (a) the refractive index (at 355 nm), (b) layer thickness and (c) O/Hf ratio of PEALD-HfO₂. The four columns in each subgraph represent the R² values of the model in the training and validation sets and the RMSE values in the training and validation sets, respectively. The table indicates the number of neurons in each hidden layer of each model.

3.2 Comparison of the THL-BPNN model with other models

The performance of the THL-BPNN model was further evaluated and compared with the LR and SVR models. The refractive index, layer thickness and O/Hf ratio of the HfO₂ thin films predicted by the three models were compared with the measured values, as shown in Figure 3 and Table 3. As shown in Figures 3(a), 3(d) and 3(g), the poor performance of the LR model on all three datasets indicates a nonlinear relationship between the annealing process and the thin film properties. As shown in Figures 3(b), 3(e) and 3(h), the SVR model obtains a better fit than the LR model on the layer thickness and O/Hf ratio datasets, but it still does not perform well enough on the refractive index dataset. As shown in Figures 3(c), 3(f) and 3(i), the predicted and measured values of most samples are in good agreement, particularly for the refractive index dataset, indicating that the THL-BPNN model has a high accuracy in modeling and predicting the relationship between the annealing process parameters and HfO₂ thin film properties.

$Measured and predicted (a)–(c) refractive index, (d)–(f) layer thickness and (g)–(i) O/Hf ratio of HfO2 thin films. The data in the left-hand, middle and right-hand columns are predicted by the LR model, SVR model and THL-BPNN model, respectively. The blue line (with a slope of 1) serves as a guideline for perfect prediction.$

Figure 3.Measured and predicted (a)–(c) refractive index, (d)–(f) layer thickness and (g)–(i) O/Hf ratio of HfO₂ thin films. The data in the left-hand, middle and right-hand columns are predicted by the LR model, SVR model and THL-BPNN model, respectively. The blue line (with a slope of 1) serves as a guideline for perfect prediction.


	Refractive index				Layer thickness				O/Hf ratio

		Training data		Validation data		Training data		Validation data		Training data		Validation data

	R²	RMSE	R²	RMSE	R²	RMSE	R²	RMSE	R²	RMSE	R²	RMSE
LR	0.72	0.06	0.66	0.08	0.74	2.24	0.48	2.88	0.43	0.08	0.48	0.08
SVR	0.71	0.06	0.52	0.10	0.75	2.22	0.56	2.64	0.84	0.04	0.74	0.05
THL-BPNN	0.99	0.01	0.99	0.01	0.94	1.08	0.91	1.18	0.94	0.03	0.90	0.03

Table 3. Evaluation of the LR, SVR and THL-BPNN models.

View all Tables

Table 3 lists the specific performance of all models on the training and validation sets. The THL-BPNN model performs best among the three regression models, with R² values not lower than 0.90 for the refractive index, layer thickness and O/Hf ratio datasets. High R² values and low RMSE values indicate that the THL-BPNN model can capture the patterns and extend them to unknown data. In short, the THL-BPNN model shows good stability in constructing the relationship between the annealing process and HfO₂ thin film properties under several conditions.

3.3 Evaluation of the THL-BPNN model for other thin film applications

3.3.1 Prediction of the LIDT of PEALD-HfO₂ and PEALD-SiO₂ thin films

The LIDT value is a key specification for thin films used in laser systems^[39^,40^]. Firstly, we analyzed the main factors affecting the LIDT. According to Ref. [26], the main factors affecting the LIDT of HfO₂ thin films are the C impurity content, N impurity content, absorption and O/Hf ratio. Pearson’s correlation coefficient was used to further analyze the correlation between the main influencing factors and the LIDT. The results shown in Figure 4 indicate that, except for the O/Hf ratio, which is positively correlated with the LIDT, all other parameters are negatively correlated with the LIDT. The change in the C and N impurity contents can be represented by the total impurity content. Likewise, for SiO₂ thin films, factors affecting the LIDT include the total impurity contents, absorption and O/Si ratio. Then, we applied the THL-BPNN to the quantitative prediction of the LIDT based on these factors. The total impurity contents, absorption, stoichiometric ratio and type of thin film were fed into the THL-BPNN as input variables, and the LIDT was derived as the output variable.

Figure 4.Correlations between properties of HfO₂ thin films used in this section. Blue indicates a negative correlation, whereas red indicates a positive correlation. Darker colors and larger circles indicate higher correlations. The numbers inside the circles indicate the corresponding correlation coefficients of the two features.

Furthermore, the predicted LIDT and measured LIDT of each sample are shown in Figure 5. It is observed that the THL-BPNN model performs well in both training and validation sets with high accuracy and low error, which is smaller than the relative error of the LIDT. The relative error of damage probability is about ±15%, mainly due to the uncertainty of the nonuniformity among the samples (3%), the measurement of the laser spot area (5%) and the fluctuation of laser energy (5%)^[41^]. For the training set and validation set, the R² values are 1.00 and 0.97, respectively, and the RMSE values are 0.48 and 2.32, respectively. The results show that the THL-BPNN model is effective for predicting LIDT values of HfO₂ and SiO₂ thin films.

Figure 5.Comparison of measured and predicted LIDT values on the (a) training set and (b) validation set.

3.3.2 Prediction of other properties of PEALD-SiO₂ thin films

SiO₂ is the most common low-refractive-index material used for laser thin films in the ultraviolet to near-infrared wavelength region. It is of great significance to study the correlation between the properties of SiO₂ thin films and the deposition parameters. Therefore, we applied the THL-BPNN model to evaluate the relationship between the deposition parameters and the properties of PEALD-SiO₂ thin films. Figure 6 shows the excellent performance of the THL-BPNN model in predicting the properties of PEALD-SiO₂ thin films on the validation set, including the refractive index, layer thickness and O/Si ratio. For most samples, the prediction deviation was smaller than the measurement error.

$Comparison of measured and predicted values of (a) the refractive index (at 355 nm), (b) the layer thickness and (c) the O/Si ratio for SiO2 thin films in the validation set.$

Figure 6.Comparison of measured and predicted values of (a) the refractive index (at 355 nm), (b) the layer thickness and (c) the O/Si ratio for SiO₂ thin films in the validation set.

Table 4 lists the R², AA and RMSE values of the THL-BPNN model for SiO₂ thin film properties. Except for the average R² value of the O/Si ratio on the training set of 0.81, the other values, including the average R² value of the refractive index and layer thickness in the training set and the AA values of the three properties in the validation set, are higher than 0.98. Although the THL-BPNN model did not perform sufficiently well on the O/Si ratio training set, it still provided accurate predictions on the corresponding validation set. This could be attributed to the successful learning of correlations by the THL-BPNN model through training. Therefore, the THL-BPNN model can be used to construct the relationship between the deposition parameters and PEALD-SiO₂ thin film properties, thus proving the universality of the THL-BPNN model in studying the nonlinear relationship between the process parameters and thin film properties.


	Training data		Validation data

	R²	RMSE	AA	RMSE
Refractive index	0.99	0.00	1.00	0.00
Layer thickness	0.99	0.43	0.98	1.72
O/Si ratio	0.81	0.01	0.99	0.03

Table 4. Evaluation of the THL-BPNN model for SiO₂ thin film properties.

View all Tables

4 Conclusions

In this study, BPNN models with different numbers of hidden layers were used to establish the correlation between the properties of PEALD-HfO₂ thin films and annealing parameters. For modeling, the annealing parameters, including the annealing atmosphere and temperature, were used as inputs, and measured thin film properties, including the refractive index, layer thickness and O/Hf ratio, were used as outputs. The data were split into two categories: a training set and a validation set. Firstly, BPNN models with different numbers of hidden layers were compared. The results demonstrated that as the number of hidden layers was increased to achieve higher accuracy on the training sets, the risk of overfitting also increased. Considering the fitting accuracy and model stability, the THL-BPNN model was adopted in a follow-up study. The performance of the THL-BPNN model was then compared with that of the LR and SVR models. The poor performance of the LR model on most datasets indicated that the effect of the two input features on the dependent output variable was nonlinear. The THL-BPNN model achieved a high accuracy of not less than 0.90 on all training and validation datasets, confirming that the THL-BPNN model outperforms the SVR model, which also belongs to the category of nonlinear regression fitting. Finally, the THL-BPNN model was used to predict the LIDT of PEALD-HfO₂ and PEALD-SiO₂ thin films, and the mapping relationship between deposition parameters and PEALD-SiO₂ thin film properties was constructed. The modeling results showed that the predicted values are consistent with the measured values, proving that the THL-BPNN model is a reliable predictive learning-based model. We believe that the THL-BPNN model can be used to predict the properties of different types of thin films, thereby reducing the experimental cost of process optimization.

References

[1] N. Xu, M. Zhu, Y. Chai, B. Roshanzadeh, S. T. P. Boyd, W. Rudolph, Y. Zhao, R. Chen, J. Shao. Opt. Lett., 43, 4538(2018).

[2] B. Ma, J. Q. Han, J. Li, K. Wang, S. Guan, X. S. Niu, H. R. Li, J. L. Zhang, H. F. Jiao, X. B. Cheng, Z. S. Wang. Chin. Opt. Lett., 19, 081403(2021).

[3] Z. Xing, W. Fan, D. Huang, H. Cheng, T. Du. High Power Laser Sci. Eng., 10, e35(2022).

[4] E. S. Field, B. Galloway, D. Kletecka, P. Rambo, I. Smith, V. E. Gruzdev, C. W. Carr, D. Ristau, C. S. Menoni. Proc. SPIE, 11173, 1117314(2019).

[5] K. Ye, T. Xu, Q. Zhong, Y. Dong, S. Zheng, Z. Xu, T. Hu. Opt. Express, 30, 24852(2022).

[6] K. Shuai, X. Liu, Y. Zhao, K. Qiu, D. Li, H. Gong, J. Sun, L. Zhou, Y. Jiang, Y. Dai, J. Shao, Z. Xia. High Power Laser Sci. Eng., 10, e42(2022).

[7] S. Malobabic, M. Jupe, D. Ristau. Light Sci. Appl., 5, e16044(2016).

[8] C. Mahata, Y. C. Byun, C. H. An, S. Choi, Y. An, H. Kim. ACS Appl. Mater. Interfaces, 5, 4195(2013).

[9] T. Faraz, H. C. M. Knoops, M. A. Verheijen, C. A. A. van Helvoirt, S. Karwal, A. Sharma, V. Beladiya, A. Szeghalmi, D. M. Hausmann, J. Henri, M. Creatore, W. M. M. Kessels. ACS Appl. Mater. Interfaces, 10, 13158(2018).

[10] L. H. Kim, K. Kim, S. Park, Y. J. Jeong, H. Kim, D. S. Chung, S. H. Kim, C. E. Park. ACS Appl. Mater. Interfaces, 6, 6731(2014).

[11] H. Liu, L. Jensen, P. Ma, D. Ristau. Appl. Surf. Sci., 476, 521(2019).

[12] G. Abromavičius, S. Kičas, R. Buzelis. Opt. Mater., 95, 109245(2019).

[13] W. Li, P. Chen, B. Xiong, G. Liu, S. Dou, Y. Zhan, Z. Zhu, T. Chu, Y. Li, W. Ma. J. Phys. Mater., 5, 014003(2022).

[14] E. J. Liu, Z. M. Yu, Z. Q. Wan, L. Shu, K. X. Sun, L. L. Gui, K. Xu. Chin. Opt. Lett., 19, 113901(2021).

[15] A. Lininger, M. Hinczewski, G. Strangi. ACS Photonics, 8, 3641(2021).

[16] L. Xia, Y. Z. Hu, W. Y. Chen, X. G. Li. High Power Laser Sci. Eng., 8, e28(2020).

[17] G. Kimaev, L. A. Ricardez-Sandoval. J. Phys. Chem. C, 124, 18615(2020).

[18] Y. D. Ko, P. Moon, C. E. Kim, M. H. Ham, M. K. Jeong, A. Garcia-Diaz, J. M. Myoung, I. Yun. Surf. Interface Anal., 45, 1334(2013).

[19] Y. D. Ko, P. Moon, C. E. Kim, M. H. Ham, J. M. Myoung, I. Yun. Expert Syst. Appl., 36, 4061(2009).

[20] A. Bahramian. Surf. Interface Anal., 45, 1727(2013).

[21] M. Jafari Gukeh, S. Moitra, A. N. Ibrahim, S. Derrible, C. M. Megaridis. ACS Appl. Mater. Interfaces, 13, 46171(2021).

[22] M. Fetanat, M. Keshtiara, R. Keyikoglu, A. Khataee, R. Daiyan, A. Razmjou. Sep. Purif. Technol., 270, 118383(2021).

[23] G. F. Montufar. Neural Comput., 26, 1386(2014).

[24] D. Mengu, M. S. Sakib Rahman, Y. Luo, J. Li, O. Kulce, A. Ozcan. Adv. Opt. Photonics, 14, 209(2022).

[25] M. Liu, L. Chen, X. Du, L. Jin, M. Shang. IEEE Trans. Neural Network Learn. Syst., 34, 2156(2023).

[26] Z. Lin, M. Zhu, C. Song, T. Liu, C. Yin, T. Zeng, J. Shao. J. Alloys Compd., 946, 169443(2023).

[27] C. Yin, M. Zhu, T. Zeng, C. Song, Y. Chai, Y. Shao, R. Zhang, J. Zhao, D. Li, J. Shao. J. Alloys Compd., 859, 157875(2021).

[28] C. Cortes, V. Vapnik. Mach. Learn., 20, 273(1995).

[29] Y. Wang, W. Wu, X. Zheng, Y. Zeng, M. Ding, C. Zhang. J. Therm. Spray Technol., 20, 1177(2011).

[30] Y. Xu, X. Zhang, Y. Fu, Y. Liu. Photonics Res., 9, B135(2021).

[31] L. Ma, J. Li, Z. Liu, Y. Zhang, N. Zhang, S. Zheng, C. Lu. Chin. Opt. Lett., 19, 011301(2021).

[32] X. Qiu. Neural Networks and Deep Learning(2020).

[33] P. R. Wiecha, A. Arbouet, C. Girard, O. L. Muskens. Photonics Res., 9, B182(2021).

[34] Y. LeCun, Y. Bengio, G. Hinton. Nature, 521, 436(2015).

[35] X. Guo, T. D. Barrett, Z. M. Wang, A. I. Lvovsky. Photonics Res., 9, B71(2021).

[36] M. T. Hagan, M. B. Menhaj. IEEE Trans. Neural Netw., 5, 989(1994).

[37] D. Chicco, M. J. Warrens, G. Jurman. PeerJ Comput. Sci., 7, e623(2021).

[38] T. Chai, R. R. Draxler. Geosci. Model Dev., 7, 1247(2014).

[39] T. Y. Pu, W. W. Liu, Y. L. Wang, X. M. Pan, L. Q. Chen, X. F. Liu. High Power Laser Sci. Eng., 9, e19(2021).

[40] W. Du, M. Zhu, J. Shi, T. Liu, J. Sun, K. Yi, J. Shao. High Power Laser Sci. Eng., 11, e61(2023).

[41] W. Liu, C. Wei, J. Wu, Z. Yu, H. Cui, K. Yi, J. Shao. Opt. Express, 21, 22476(2013).