
- High Power Laser and Particle Beams
- Vol. 36, Issue 12, 122004 (2024)
Abstract
Keywords
High-resolution flow field data play an important role in fluid mechanics, fluid dynamics research, engineering design and technological innovations. Accurate acquisition and analysis of high-resolution flow field data is of great significance for understanding flow phenomena. There are two main ways to obtain high-resolution flow field data, one is experimental measurements, such as Particle Image Velocimetry (PIV)[1], the other is Computational Fluid Dynamics (CFD)[2], such as Direct Numerical Simulation (DNS)[3-5]. However, obtaining high-resolution flow field data through experiments or CFD is either costly or time-consuming, thus new methods need to be developed.
Artificial intelligence technology has been booming unprecedentedly nowadays[6]. As the main method of artificial intelligence, Convolutional Neural Networks (CNN)[7] has been widely used in various fields and promoted many innovations and progress. CNN is a deep learning model inspired by the biological vision system. The development of CNN has experienced many milestones. The earliest CNN model can be traced back to the 1990s, such as LeNet[8] proposed by LeCun for handwritten digit recognition. In the 2012 ImageNet Large-Scale Visual Recognition Challenge, the emergence of the AlexNet[9] model has greatly promoted the development of CNN, which uses a deeper networks structure and a large amount of image data for training, and achieved breakthrough results. Since then, various improved CNN models have emerged, such as VGGNet[10], GoogLeNet[11], ResNet[12], etc., which have achieved excellent performance in different computer vision tasks.
The vigorous development of CNN has been spread to various fields, including fluid mechanics. Especially in recent years, fluid mechanics has been closely combined with CNN. In 2018, Jin et al designed a fused CNN model[13]. Through this model, the velocity field around the cylinder can be predicted only according to the pressure fluctuation on the cylinder. This model has good performance under different Reynolds numbers. In 2019, Sekar et al first used CNN to extract the geometric features of the airfoil[14], and to predict the flow field around the airfoil through the fully connected layer with the help of other state information. In the direction of high-resolution reconstruction of the flow field, Fukaimi et al proposed a hybrid Down sampled Skip-Connection Multi-Scale model to reconstruct the flow field[15] in 2019. In this model, only dozens of training samples are used to produce a good performance. At the same year, Deng et al constructed two models based on generative adversarial networks to enhance the spatial resolution of the complex wake behind two side-by-side cylinders[16]. In 2020, Liu et al developed the multi-time-path CNN model for high-resolution reconstruction of turbulence[17]. In 2021, Zhou et al used the geometric parameters of the coarse velocity field and the pore structure as the input of the CNN model to achieve super-resolution reconstruction of the pore flow field in porous media[18]. In 2022, Jagodinski et al designed a three-dimensional CNN model to identify significant structures related to ejection events in wall-bounded turbulent flows[19].
The ablative Rayleigh–Taylor instability (ARTI)[20] flow field data is mainly obtained by numerical simulations and experiments. At present, high-precision simulation requires fine meshing, however, it is costly and time-consuming. Low-precision simulation takes a short time, but it cannot describe the physical characteristics of the flow field in detail. Therefore, we need to develop a new method that can obtain high-precision ablation Rayleigh-Taylor instability flow field data at a relatively small computational cost. In this study, two different CNN models are given out to perform the high-resolution reconstruction. These two models can quickly transform low-resolution data into high-resolution data, which allows us to obtain high-resolution ARTI flow field data rapidly.
1 Methods
1.1 ARTI and DNS
The Rayleigh–Taylor instability (RTI) occurs at the perturbation interface of two fluids with different densities, where the light fluid accelerates or supports the heavy fluid[21]. In inertial confinement fusion (ICF), RTI can break the symmetry of implosions by dismantling the integrity of the spherical ablator-fuel shell. Therefore, it is crucial to predict the growth of RTI to improve the success probability of ICF. The inclusion of thermal conduction influences leads to a different hydrodynamic instability, which is then identified as ARTI, occurring between the internal and external layers during the implosion phase of ICF[22].
The control equations of the ARTI over a constantly accelerating 2D reference frame are as follows
where
The data used in this work are derived from the DNS of ARTI in Ref.[4], and the Euler code which has been usually applicated in high energy density physics is used in our study. Before the formal training of the CNN model
1.2 Two CNN models
In this paper, the application of ordinary CNN model and multi-time-path CNN model are discussed, and the results are also compared with those of the BiCubic interpolation method[23].
The ordinary CNN model consists of three convolution layers and an up-sampling layer, as shown in Fig.1. The input of the model is the low-resolution flow field data and the output is the high-resolution flow field data at the same time. The role of the convolution layer is to extract the characteristics of the flow field data. The size of the convolution kernel is
Figure 1.Schematic diagram of ordinary CNN structure
The second model is a multi-time-path CNN. The schematic diagram of the model is shown in Fig.2.
Figure 2.Schematic diagram of the structure of a multi-time-path CNN
The input of the multi-time-path CNN is the low-resolution flow field data of the five moments at
Compared with the ordinary CNN, multi-time-path CNN has more input data and more complex networks structure, thus the expected results should also be better than those of the ordinary CNN. It is worth mentioning that the multi-time-path CNN takes the time series as the model input, combines the timing of the flow field changing with time, and has the superiority that the ordinary CNN cannot achieve.
1.3 Model training
After a series of data processing, we randomly divided
Before putting into training, the input data should be normalized, which will make the model converge faster. We select activation function as the ReLU function, which is often used in neural networks. Because our super-resolution reconstruction task is actually a regression task, we cannot use the activation function in the last convolution layer. For weight initialization, we also use the corresponding initialization method of ReLU function.
The loss function in the above two models is written as
where L is also called as Mean Square Error (MSE), N is the number of data point, H, P are the real value and the predicted value of the point, respectively.
All the models in this paper are in Python, which mainly depends on PyTorch to realize the construction of CNN model. As for gradient update, Adam[26] (Adaptive Moment Estimation) optimization algorithm is used to adaptively adjust the parameters in the neural networks, which can converge to the optimal solution quickly. In the parameter setting of Adam, we choose the learning rate 0.0001. The training process of the model involves iterating through 2000 epochs. The ordinary CNN takes 4 h and 2 min, and the multi-time-path CNN takes 4 h and 23 min. The structure of the multi-time-path CNN is more complex and has more parameters than the ordinary CNN, thus requiring a slightly longer computation time. After the model training is completed, the high-resolution reconstruction task can be achieved in a few seconds.
2 Results and analysis
The DNS data used here are ARTI flow field data with a disturbance wavelength
Figure 3.Error maps of two convolutional neural network models
In Fig.3, CNN represents the ordinary CNN, MTPC represents the multi-time-path CNN, training loss refers to the training set error, and testing loss refers to the test set error. Compared with the ordinary CNN, the error of the multi-time-path CNN is smaller, reduced by nearly 10 times. It can be said that the multi-time-path CNN has better reconstruction performance than the ordinary CNN.
To more clearly show the superiority of the multi-time-path CNN, we compare the reconstructed results of the two models. All the data shown in Fig.4 and Fig.5 come from the same flow field,which are the flow field velocity data of the same position at the same time.
Figure 4.Comparison of reconstructed results with average pooling (
Figure 5.Comparison of reconstructed results with maximum pooling (
Fig.4 is the comparison of the reconstruction results when the input is the flow field velocity data in x and y directions after average pooling. Fig.5 is the comparison of the reconstruction results when the input is the flow field velocity data in two directions after maximum pooling. The reconstruction magnification r of the two figures is 4, and the
Furthermore, the density of the flow field is also used for training the model as a separate input. To demonstrate the universality of the model, we analyze the flow field density data of four cases (r=4). The first case is the density data of the weak nonlinear stage with ablation (disturbance wavelength=12 μm), as shown in Fig.6.
Figure 6.Comparison of the high-resolution reconstructed density data (weak nonlinear stage with ablation, disturbance wavelength=12 μm)
BiCubic in Fig.6 denotes the results of the BiCubic interpolation reconstruction method, and the BiCubic interpolation method is widely used in image processing. The prediction results of the BiCubic interpolation method exhibit excessive smoothness, resulting in the loss of numerous detailed characteristics within the flow field. It can be clearly seen from Fig.6 that the performance of both ordinary CNN and multi-time-path CNN in high-resolution reconstruction of flow field data is far superior to that of BiCubic interpolation method. The multi-time-path CNN consistently demonstrates superior performance compared to the ordinary CNN, effectively recovering more flow field details with a lower error.
Fig.7 is the classical linear flow field density data (disturbance wavelength=12 μm), and Fig.8 is the density data of the nonlinear stage with ablation (disturbance wavelength=12 μm). In addition, we also discuss the flow field with the disturbance wavelength of 30 μm, as shown in Fig.9.
Figure 7.Comparison of the high-resolution reconstructed density data (classical linear stage, disturbance wavelength=12 μm)
Figure 8.Comparison of the high-resolution reconstructed density data (nonlinear stage with ablation, disturbance wavelength=12 μm)
Figure 9.Comparison of the high-resolution reconstructed density data (weak nonlinear stage with ablation, disturbance wavelength=30 μm)
In these cases, the prediction error of the multi-time-path CNN is generally at the order of 10−5, while the error of the ordinary CNN is at the order of 10−4. Both of these errors are significantly lower than that of the the BiCubic interpolation method. It is shown by these cases that the multi-time-path CNN still exhibits excellent performance in the face of different flow parameters, different stages and different flow field data.
For the velocity component input case and the density component input case, the error of reconstructing the flow field based on multi-time-path CNN is low enough, and the performance is sufficiently good. This also fully demonstrates the feasibility of combining machine learning with fluid mechanics. The methods used in this paper can also be applied to other flow field data. Given a sufficient amount of flow field data, a corresponding high-resolution reconstruction model can be trained.
3 Conclusion
We have built an ordinary CNN and a multi-time-path CNN to achieve high-resolution reconstruction of the low-resolution ablation Rayleigh-Taylor instability flow field. Compared with the ordinary CNN, the multi-time-path CNN shows better performance and smaller error. In terms of the acquisition of input data, we first pooled the existing high-precision DNS data of our research group to obtain low-resolution flow field data, and then we performed dislocation splicing on the data to obtain training sample data. The influence of input data obtained by two pooling methods on model training is compared. The prediction results of the average pooling model are smoother, and the accuracy of the maximum pooling model is slightly higher. Different cases are discussed in this paper, and it is found that the multi-time-path CNN model still maintains excellent performance in different flow fields. Once the CNN model is trained, the high-resolution reconstruction task can be completed in just a few seconds. The introduction of these two models has enriched the application of CNN in fluid instability. With the development of computer technology, the training speed of CNN model will be faster and faster. We can construct more complex models to make the reconstruction accuracy higher and higher.
References
[1] Adrian R J. Twenty years of particle image velocimetry[J]. Exp Fluids, 39, 159-169(2005).
[2] Yan Chao, Yu Jian, Xu Jinglei et al. On the achievements and prospects for the methods of computational fluid dynamics[J]. Adv Mech, 41, 562-589(2001).
[6] LeCun Y, Bengio Y, Hinton G. Deep learning[J]. Nature, 521, 436-444(2015).
[10] Simonyan K, Zisserman A. Very deep convolutional wks f largescale image recognition[DBOL]. arXiv preprint arXiv: 1409.1556, 2014.
[11] Szegedy C, Liu Wei, Jia Yangqing, et al. Going deeper with convolutions[C]Proceedings of 2015 IEEE Conference on Computer Vision Pattern Recognition (CVPR). 2015: 19.
[12] He Kaiming, Zhang Xiangyu, Ren Shaoqing, et al. Deep residual learning f image recognition[C]Proceedings of 2016 IEEE Conference on Computer Vision Pattern Recognition (CVPR). 2016: 770778.
[23] Zhong Baojiang, Lu Zhifang, Ji Jiahuan. Review on image interpolation techniques[J]. J Data Acquis Process, 31, 1083-1096(2016).
[24] Shi Wenzhe, Caballero J, Huszár F, et al. Realtime single image video superresolution using an efficient subpixel convolutional neural wk[C]Proceedings of 2016 IEEE Conference on Computer Vision Pattern Recognition (CVPR). 2016: 18741883.
[26] Kingma D P, Ba J. Adam: a method f stochastic optimization[C]Proceedings of the 3rd International Conference on Learning Representations. 2015.

Set citation alerts for the article
Please enter your email address