
- Chinese Optics Letters
- Vol. 23, Issue 7, 071407 (2025)
Abstract
Keywords
1. Introduction
Ultrafast lasers have fundamentally transformed the field of photonics, enabling diverse applications in material processing, biomedicine, and scientific research[1–3]. Among various ultrafast lasers, ultrafast Raman fiber lasers (URFLs) demonstrate superior performance due to their unique capabilities. URFLs provide high pulse energy and short pulse duration, with their most distinctive advantage being wavelength agility that far exceeds the limited spectral range of conventional rare-Earth-doped fiber lasers[4]. This broad spectral flexibility enables precise wavelength selection crucial for various advanced applications, including two-photon microscopy, optical frequency comb generation, and specialized material processing[5–7].
URFLs can be generated via three primary approaches: mode locking, synchronous pumping, and nonlinear optical gain modulation (NOGM). Mode-locked and synchronous pumped URFLs are both based on oscillators to generate ultrafast Raman pulses. To achieve stable Raman solitons, gain, loss, dispersion, and nonlinearity inside the oscillator need to meet a delicate balance[8]. Therefore, the output pulse energy is often restricted to tens of nanojoules[9–11]. Besides, other disadvantages like the requirement for long Raman gain fiber or a complex synchronization unit greatly limit the practical application of URFLs.
In contrast, NOGM presents a simpler and more promising approach to generate high-energy Raman pulses by replacing the oscillator with an amplifier structure. This technique employs ultrafast pulse pumping to modulate the fiber amplifier’s gain, enabling direct conversion of continuous-wave (CW) single-frequency (SF) seed light into ultrashort pulses[12,13]. The absence of a traditional resonant cavity not only simplifies the laser structure but also facilitates the generation of higher-energy, shorter-duration pulses by minimizing the deleterious effects of dispersion and nonlinearity typically associated with a long Raman fiber cavity[4].
Sign up for Chinese Optics Letters TOC. Get the latest issue of Chinese Optics Letters delivered right to you!Sign up now
Despite the significant advantages of NOGM, the numerical simulation of such systems presents challenges. The underlying physics involves multiple nonlinear effects in optical fibers, including stimulated Raman scattering (SRS), self-phase modulation (SPM), cross-phase modulation (XPM), and self-steepening, necessitating sophisticated numerical models to accurately describe the pulse envelope evolution along the propagation path. These physical processes can be comprehensively described by the nonlinear Schrödinger equation (NLSE), which incorporates various effects such as gain, loss, dispersion, and multiple nonlinear terms. Traditional numerical methods for solving the NLSE, such as the split-step Fourier method and fourth-order Runge–Kutta method, suffer from substantial computational overhead. Although advanced numerical algorithms, such as those based on the differential quadrature method (DQM), have been developed for solving NLSE, such as implementations using improved cubic B-Spline functions as test functions[14], these primarily enhance computational accuracy rather than efficiency. The computational constraint poses a significant challenge: the prolonged calculation times preclude their integration into systems that rely on real-time computation for dynamic control and operation.
Recently, deep learning techniques have emerged as powerful tools for analyzing and optimizing complex physical systems. In terms of ultrafast fiber lasers, pioneering works have demonstrated the capability of neural networks to predict system behaviors and learn from various data with remarkable accuracy and speed. Boscolo et al. employed fully connected neural networks (FCNNs) to predict self-similar parabolic pulse propagation in optical fibers[15]. Their model considered gain, loss, group velocity dispersion (GVD), and SPM, successfully predicting output pulse intensities and spectra at various propagation distances. Jiang et al. utilized physics-informed neural networks (PINNs) to solve the NLSE, predicting ultrashort pulse propagation under the combined effects of GVD and SPM[16]. Salmela et al. demonstrated FCNN-based prediction of supercontinuum generation[17]. By offering superior computational speed and the capability to learn from experimental data, deep learning approaches provide potential solutions to the aforementioned challenge. Yet, numerical simulations of NOGM Raman fiber amplifiers based on neural networks have not been reported so far.
In this work, we introduce a deep learning approach to learn and predict the first-order Stokes light generation in NOGM systems. Our study targets a NOGM Raman fiber amplifier employing a 1121 nm SF-CW seed source and a 1064 nm pulsed pump source, where the first-order Stokes pulses at 1121 nm are generated through SRS gain modulation. We generated training datasets based on the NLSE by constructing pump pulse parameter sets with experimental relevance, incorporating parameters from different types of Raman gain fibers. An FCNN architecture was developed and trained on these datasets, achieving accurate and rapid predictions of the first-order Stokes light evolution. This approach demonstrates significant computational acceleration compared to conventional methods. Furthermore, the model exhibits effective generalization capabilities, accurately predicting outputs for untrained parameter combinations even when trained on imperfect, noise-affected data without complete knowledge of all influencing factors.
2. Numerical Model of NOGM
Numerical simulations are performed to study the pulse evolution in a fiber-based NOGM system, the schematic of which is shown in Fig. 1. Gaussian-shaped pump pulses at 1064 nm are launched into a piece of Raman gain fiber, seeded by a 1121 nm SF-CW laser[14]. The NOGM process is accomplished in the Raman fiber, which eventually transforms the CW seed into ultrafast Raman pulses. A numerical model based on the generalized nonlinear Schrödinger equation (GNLSE) is used to simulate the pump and Raman pulse evolutions along the optical fiber through the NOGM process:
Figure 1.Schematic diagram of the simulated NOGM system: high-power pump pulses at 1064 nm and the continuous-wave seed at 1121 nm are coupled through a wavelength division multiplexer (WDM) into the Raman gain fiber.
The left side of Eq. (1) builds a model of linear propagation effects, where
To accurately simulate the pulse evolution along the fiber, the pump pulses are coupled with the seed laser through a WDM at the fiber input (
3. Architecture of the Neural Network
Based on previous studies[17], the FCNN was selected as our neural architecture for predicting Raman pulse evolution in the NOGM system, owing to its simple structure and excellent convergence properties compared with other network structures like convolutional neural networks (CNNs)[18] or recurrent neural networks (RNNs)[19]. Through extensive testing of architectures with varying widths and depths, we observed that, while increasing the network’s width and depth could improve overall accuracy, the enhancement in spectral prediction was not obvious. Therefore, as shown in Fig. 2, a relatively concise four-layer FCNN structure is selected and designed with hidden layer widths of 256, 256, 512, and 512, respectively. This architecture maintains prediction accuracy while reducing network complexity. The rectified linear unit (ReLU) activation function was implemented for all layers.
Figure 2.Architecture of the implemented fully connected neural network (FCNN): a four-layer structure with hidden layer widths of 256, 256, 512, and 512, employing ReLU activation functions. The input includes initial pump pulse parameters and propagation distances, while the output represents the predicted spectrum.
4. Data Preparation and Training Process
The FCNN was trained using raw data obtained from numerical solutions of the GNLSE described in Sec. 2. The training dataset was constructed with pulse widths uniformly distributed between 10 and 30 ps. For PM 980 fiber, initial pump pulse energies ranged from 20 to 500 nJ, while for PM Raman fiber, they varied from 20 to 300 nJ. The initial pump pulse width was varied in steps of 2 ps, and the initial pump pulse energy was incremented in steps of 20 nJ. The power of the seed light is 0.01 W. According to the ratio of their SRS gain coefficient, the total propagation distances were set to 100 and 20 cm for PM 980 and PM Raman fibers, respectively, with training data sampled at 1 cm intervals along the propagation length. Through iterative generation, we obtained 27,775 training samples for PM 980 fiber and 3465 samples for PM Raman fiber, which were subsequently organized into input–output pairs for network training.
The hidden layers were initialized using the Glorot method, and training was conducted with a batch size of 64. We employed mean square error (MSE) as the loss function, incorporating L2 regularization with a weight of 0.001 to ensure more uniform weight matrices. The learning rate was initially set to 0.001 and subsequently decreased following the cosine annealing schedule. Training was performed using the Adam optimizer over thousands of epochs. To prevent overfitting, 10% of the training data was randomly reserved for validation, with validation checks performed every 50 epochs. An early stopping strategy was implemented with a patience of 100 epochs, using validation loss as the optimization criterion. Prior to network input, the output data underwent logarithmic preprocessing to emphasize high-energy spectral features.
As shown in Fig. 3, both training and validation losses decrease steadily during the training process, eventually converging to stable values, indicating successful network optimization without overfitting.
Figure 3.Evolution of training and validation losses during the network training process, showing the convergence of both losses over training epochs.
5. Results
Figures 4 and 5 illustrate the propagation evolution of a Gaussian pulse in PM 980 fiber (initial pump pulse energy
Figure 4.Comparison of simulated and FCNN predicted spectra for pulse propagation in PM 980 fiber with initial pump pulse parameters of 400 nJ and 10 ps: (a) GNLSE-simulated spectral evolution; (b) FCNN-predicted spectral evolution; (c) absolute error distribution; (d)–(i) spectral comparisons at specific propagation distances (z = 0.1, 0.2, 0.4, 0.6, 0.8, 1.0 m).
Figure 5.Comparison of simulated and FCNN predicted spectra for pulse propagation in PM Raman fiber with initial pump pulse parameters of 300 nJ and 10 ps: (a) GNLSE-simulated spectral evolution; (b) FCNN-predicted spectral evolution; (c) absolute error distribution; (d)–(i) spectral comparisons at specific propagation distances (z = 0.02, 0.04, 0.08, 0.12, 0.16, 0.20 m).
Analysis of the error distribution reveals two primary sources of discrepancy. First, the red colored areas in Figs. 4(c) and 5(c) indicate errors in low-energy regions (particularly intensity below
5.1. Accuracy
To quantitatively evaluate the discrepancy between neural network predictions and GNLSE numerical simulations, we employed the normalized root mean square error (NRMSE) as our metric,
Figure 6 presents the NRMSE (
Figure 6.Evolution of NRMSE (−10 dB) with propagation distance for different initial pump pulse parameters: (a) PM 980 fiber; (b) PM Raman fiber. The NRMSE (−10 dB) is calculated for spectral regions above −10 dB relative to the maximum intensity of the generated spectrum.
Figure 7 compares the prediction NRMSE of three neural networks with different sizes. Net 2, with its architecture illustrated in Fig. 2, serves as our primary network structure. Net 1, which contains fewer parameters than Net 2, exhibits degraded prediction performance. Although Net 3, with more parameters than Net 2, achieves slightly higher accuracy, it shows no substantial advantages over Net 2 in predicting spectral evolution characteristics for most parameter combinations. This comparison demonstrates that our selected network architecture (Net 2) achieves an optimal balance between prediction performance and computational efficiency.
Figure 7.Performance comparison of neural networks with different architectures in predicting spectral evolution: (a) Net 1 with reduced parameters; (b) Net 2 with architecture shown in Fig.
5.2. Generalization ability
To investigate the generalization ability of the neural network model, we evaluated its prediction performance across various input parameters (pulse width and pulse energy). For each parameter combination, we calculated the NRMSE (
Figure 8 presents a heat map of the mean NRMSE (
Figure 8.Heat map showing the neural network’s generalization capability for different input pulse parameters in PM Raman fiber. The color represents the mean NRMSE (−10 dB) values across all propagation distances (0–0.20 m).
Figure 9.Comparison of simulated and FCNN predicted spectra for pulse propagation in PM Raman fiber with initial pump pulse parameters of 460 nJ and 36 ps: (a) GNLSE-simulated spectral evolution; (b) FCNN-predicted spectral evolution; (c) absolute error distribution; (d)–(i) spectral comparisons at specific propagation distances (z = 0.02, 0.04, 0.08, 0.12, 0.16, 0.20 m).
5.3. Calculation speed
To evaluate the computational efficiency of our neural network model, we compared the computation time between the deep learning approach and conventional GNLSE numerical solving under various conditions. All calculations were performed on a computer equipped with a 12th Generation Intel Core i7-12650 H processor (2.30 GHz) and an NVIDIA GeForce RTX 4060 Laptop GPU. The computation time was measured by averaging 100 consecutive calculations for each condition. For instance, when simulating a 20 cm propagation with an initial pump pulse energy of 280 nJ and an initial pump pulse width of 12 ps, 100 GNLSE numerical calculations took approximately 258 s (2.58 s per calculation) for PM 980 fiber and 1295 s (12.9 s per calculation) for PM Raman fiber, while 100 neural network predictions were completed in about 15 s (0.15 s per prediction) for both fiber types. Notably, the computation time of the GNLSE increases significantly with propagation distance, whereas the neural network’s computation time, primarily determined by the network parameters, remains largely independent of the propagation distance. This computational advantage makes the neural network model particularly suitable for practical applications requiring rapid predictions.
6. Conclusion
In this work, we have successfully developed and demonstrated a deep learning approach for predicting pulse evolution in an NOGM Raman fiber amplifier. Our four-layer FCNN architecture effectively captures the complex nonlinear dynamics involved in the NOGM process, achieving accurate predictions for both PM 980 and PM Raman fibers. Within the primary operational parameters (pulse widths at 10–30 ps, energies up to 500 nJ for PM 980 fiber and 300 nJ for PM Raman fiber), the neural network model demonstrates excellent prediction accuracy with the NRMSE (
References
[2] S. Svanberg. Some medical and biological applications of ultrafast lasers. Ultrafast Optics IV: Selected Contributions to the 4th International Conference on Ultrafast Optics, 437(2004).
[8] D. S. Kharenko, A. E. Bednyakova, I. Zhdanov et al. Raman dissipative solitons. Dissipative Optical Solitons, 163(2022).
[13] W. Qi, J. Zhou, Y. Feng. Nonlinear optical gain modulation: a novel method to generate highly-coherent femtosecond pulses. Conference on Lasers and Electro-Optics (CLEO), 1(2022).

Set citation alerts for the article
Please enter your email address