An adaptive microwave photonic angle-of-arrival (AOA) estimation approach based on a convolutional neural network with a bidirectional gated recurrent unit (BiGRU-CNN) is proposed and demonstrated. Compared with the previously reported AOA estimation methods based on phase-to-power mapping, the proposed method is unnecessary to know the frequency of the signal under test (SUT) in advance. The envelope voltage correlation matrix is obtained from dual-drive Mach–Zehnder modulator (N-DDMZM, N > 2) optical interferometer arrays first, and then AOA estimations are performed on different frequency signals with the aid of BiGRU-CNN. A three-DDMZM-based experiment is carried out to assess the estimation performance of microwave signals at three different frequencies, and the mean absolute error is only 0.1545°.

- Chinese Optics Letters
- Vol. 21, Issue 9, 090001 (2023)
Abstract
1. Introduction
Angle-of-arrival (AOA) estimation is a vital parameter in passive target perception fields such as radar, wireless communication, and electronic warfare[1]. Traditionally, AOA parameters are usually obtained by utilizing electronic techniques, in which methods based on phase interferometers are most widely studied due to their high sensitivity and speed. However, for modern electronic technology, it is a critical challenge to meet the need of large instantaneous bandwidth. Photonic technology is well known for its large bandwidth and low loss. Up to now, a variety of photonics-based techniques have been proposed and developed for microwave measurements[2]. For AOA estimation based on microwave photonics (MWP), different optical modulators such as the Mach–Zehnder modulator (MZM)[3], dual-drive Mach–Zehnder modulator (DDMZM)[4,5], phase modulator (PM)[6], dual-parallel Mach–Zehnder modulator (DPMZM)[7], and dual-polarization dual-drive Mach-Zehnder modulator (DPol-DDMZM)[8] are generally used as photonic interferometers to map the arrival phase or time difference of the signal under test (SUT) into easily measurable parameters related to power. Although the above-mentioned methods are simple and effective, the frequency of SUT should be known in advance. However, the frequencies of noncooperative signals are unknown as well as AOA in practice. Hence, it is necessary to perform instantaneous frequency measurement (IFM) prior to the AOA estimation. To address this, some simultaneous AOA and frequency measurement methods based on MWP are presented. A concurrent photonic measuring system based on a DPMZM and an asymmetry Mach–Zehnder interferometer (AMZI) is proposed, AOA and the chirp rate of a linear frequency-modulated (LFM) signal can be measured simultaneously, which has high accuracy but is only applicable to LFM signals[9]. In Ref. [10], frequencies and AOA of multiple targets can be measured simultaneously by combining the optical time-division channelized I/Q downconversion and van Cittert–Zernike theorem. However, the channelized structure is complex.
Recently, deep learning (DL) has drawn growing attention in not only computer vision but also the signal processing of optoelectronic systems. In Ref. [11], based on data-driven supervised learning training, an adaptive deep-learning algorithm was proposed for different MWP receiving systems by changing the training data sets and retraining the same neural network. In Ref. [12], an autoencoder-residual network was designed to adaptively mitigate the nonlinearity and noise of the received broadband without the need for calculating the multifactorial nonlinear transfer functions. Additionally, the application of DL to direction-of-arrival estimation based on array signal processing has also achieved significant success, and not only greatly improves estimation performance and generalization[13], but also can estimate the number of signal sources[14]. In summary, DL is based on a data-driven approach, which does not require setting an a priori observed model, but learns the model directly from the training data and adapts its own structure and parameters according to the dynamic environment and target characteristics.
In this paper, we propose a convolutional neural network with a bidirectional gated recurrent unit (BiGRU-CNN) to achieve adaptive AOA estimations without additional frequency measurements. First, the optical interferometer array with N-DDMZMs is constructed to obtain the envelope voltage vectors, which are associated with the frequency and AOA of the SUT. Then the BiGRU-CNN model is used to automatically extract features and learn the frequency information from the envelope voltage correlation matrix to establish mapping at different frequencies. Compared with the previous AOA estimation methods based on phase-to-power mapping that require extra IFM, the BiGRU-CNN exhibits strong generalization and has a simpler structure and measurement process. To the best of our knowledge, this is the first attempt to achieve MWP adaptive AOA estimation without frequency guidance based on a DL algorithm. In addition, the BiGRU-CNN model can perform IFM at a specific angle, which provides a new perspective for realizing two-dimensional parameter estimation of frequency and AOA.
Sign up for Chinese Optics Letters TOC. Get the latest issue of Chinese Optics Letters delivered right to you!Sign up now
2. Principle
Figure 1 shows a schematic diagram of the proposed system, which consists of four main modules: the antenna array, the optical interferometer array, the electrical processing array, and the adaptive algorithm. The antenna array consists of a reference antenna (
Figure 1.Schematic diagram of the proposed adaptive microwave photonic AOA estimation system using BiGRU-CNN. V, envelope voltage vector; R, correlation matrix.
Figure 2.(a) RF signal incident on two array elements of the DDMZM-i; (b) simulation prediction of the relationship between the output envelope voltage and AOA under different signal frequencies: di = 1.875 cm, ΔL = 0 cm; (c) model of the BiGRU-CNN.
DL algorithms can infinitely approximate arbitrarily complex mapping relationships based on the data-driven principle. Considering that the AOA information of motion targets is continuous in reality and the correlation of samples, the AOA estimation accuracy and generalization capability will be improved using a long short-term memory (LSTM) network[15], which is suitable for processing time-sequential data. The gated recurrent unit (GRU) neural network simplifies the complex structure of the LSTM cell and has faster convergence while maintaining accuracy. The bidirectional gated recurrent unit (BiGRU) is a layer of reverse GRU added on top of unidirectional GRU. BiGRU not only improves the gradient disappearance but also increases the number of neural units, which can get more accurate prediction results[16]. Similar to array signal processing, in order to obtain the correlation between spatial signal dimensions, we map the envelope voltage vector
Take
3. Results and Discussion
A proof-of-concept experiment based on a three-DDMZM, shown in Fig. 3, was carried out. The light is generated by a laser diode with 1550 nm wavelength and divided into three branches through a
Figure 3.Experimental setup of three-DDMZM-based AOA estimation system.
To ensure the authenticity of the data, data acquisition experiments were carried out in a microwave darkroom in order to exclude as much as possible the influence of external frequency bands. The receiving antenna array consists of four 2–18 GHz cavity-backed helical antennas, and the antenna separation distances were configured randomly:
Figure 4(a) shows the normalized output envelope voltage
Figure 4.(a) Normalized amplitude response for AOA at signal frequency 13 GHz; (b) normalized amplitude response for frequency at AOA 30°.
Layer | Output Shape | Units | Filter Size | Number of Filters |
---|---|---|---|---|
Input | 3 × 3 | / | / | / |
BiGRU | 3 × 512 | 256 | / | / |
Convolution-1 | 3 × 512 | / | 1 × 1 | 512 |
Convolution-2 | 3 × 256 | / | 1 × 1 | 256 |
Convolution-3 | 3 × 128 | / | 1 × 1 | 128 |
Flattened | 384 | / | / | / |
Fully connected | 128 | 128 | / | / |
Output | 1 | 1 | / | / |
Table 1. Parameters of the Optimized BiGRU-CNN
To avoid overfitting, the learning rate will be reduced by 10%, and training will be early stopped when the validation loss is no longer reduced after 5 epochs and 10 epochs, respectively. The training time and test time of BiGRU-CNN are about 1 h 18 min and 54 s, respectively, on a personal computer with Intel i7-6700 CPU (4-core). To demonstrate the superiority of our BiGRU-CNN model, we replace the BiGRU layer of the BiGRU-CNN model by a 1D convolutional layer with 512 filters to build a CNN model. What is more, we compare the BiGRU-CNN and CNN with and without using correlation matrix
Figure 5.(a) Training loss and (b) validation loss of different neural network architectures during training.
Figure 6 shows the experimental results and the corresponding errors over a
Figure 6.Experimental results of actual AOA and estimated AOA (blue circles) and the corresponding errors at different frequencies (3, 8, and 13 GHz) over a −80° to 80° measurement range. The blue solid line represents the ideal curve for actual AOA and estimated AOA.
To further illustrate that our model can capture and learn the frequency information of the SUT, we collected samples from 3 to 5 GHz with a step of 20 MHz at AOA of 30° as training data to perform frequency estimation. The training is also based on the BiGRU-CNN model implemented in Table 1. The final two separately trained prediction models for AOA estimation and frequency measurement are obtained, respectively. We apply a 3 GHz pulsed signal (width 1 µs, period 2 µs for a 20 µs duration) that two BiGRU-CNN models have not seen before to perform frequency and AOA estimation based on two separately trained models in real time. Figure 7 shows the comparison between the actual frequency and estimated frequency, and the corresponding error is indicated by a green crossing marker. The estimated frequency within the time of pulse appears to match very well with the actual frequency. The MAE of the IFM is 10.1 MHz. On the other hand, the MAE of AOA estimation is 0.3727° during the whole pulse duration, which demonstrates that our method is capable of measuring not only continuous wave signals, but also pulsed signals.
Figure 7.Real-time IFM with 3 GHz pulse signal (width 1 µs, period 2 µs) at AOA of 30°; the frequency of 0 GHz means no signal is incident.
Just like most methods based on phase-to-power mapping, the power variation of the SUT may influence the measurement results. Theoretically, the effect of SUT power can be calibrated by normalizing the output envelope signal voltage
4. Conclusions
In conclusion, an adaptive AOA estimation algorithm using BiGRU-CNN based on an N-DDMZM array system is proposed. The BiGRU-CNN model can perform high accuracy AOA estimations of different frequency signals without additional IFM. A proof-of-concept experiment was conducted to verify the performance of the AOA estimation at three different frequencies, and the MAE is 0.1545°. Moreover, our method demonstrates the ability to perform real-time IFM at specific angles, which has great potential for further application to measure AOA and frequency simultaneously.
References
[13] W. Zhu, M. Zhang. A deep learning architecture for broadband DOA estimation. International Conference on Communication Technology(2019).
[18] N. Zhang, Z. Wen, X. Hou, W. Wen. Digital automatic gain control design with large dynamic range in wireless communication receivers. International Conference on Communication Technology(2017).

Set citation alerts for the article
Please enter your email address