Xueli SHENG, Mengfei MU, Yao BI, et al. Joint optimization of transmit parameters of active sonar driven by multiagent reinforcement learning[J]. Journal of Harbin Engineering University, 2025, 46(8): 1557-1565.
DOI:
Xueli SHENG, Mengfei MU, Yao BI, et al. Joint optimization of transmit parameters of active sonar driven by multiagent reinforcement learning[J]. Journal of Harbin Engineering University, 2025, 46(8): 1557-1565. DOI: 10.11990/jheu.202507001.
Joint optimization of transmit parameters of active sonar driven by multiagent reinforcement learning
Inadequate environmental adaptability of traditional fixed transmission strategies in active sonar systems leads to poor detection stability in underwater acoustic channels. To address this issue
this paper proposes a joint optimization method for active sonar transmission waveform and source level based on multiagent reinforcement learning. First
a multiagent collaborative learning approach was adopted to decouple waveform optimization and source level optimization into multiple agent tasks. Then
a reward-shaping method was introduced to suppress reward signal noise induced by multipeak channel spectra
enhancing the optimization capability of the agents while avoiding subpulse frequency conflicts. Furthermore
a double deep
Q
-network was employed to reduce
Q
-value estimation bias and improve decision stability. Finally
numerical validation was conducted in a typical deep-sea channel scenario reconstructed using measured sound speed gradients from the South China Sea. The results demonstrate that the proposed algorithm outperforms baseline methods in terms of both channel adaptability and echo signal-to-noise ratio control accuracy
providing a viable technical approach for constructing intelligent active sonar systems with environmental self-adaptation capabilities.
关键词
Keywords
references
GUO Qijia, XIE Kean, YE Weibin, et al. A sparse Bayesian learning method for moving target detection and reconstruction[J]. IEEE transactions on instrumentation and measurement, 2025, 74: 4505413.
LAN Chaofeng,ZHENG Zhiwei,CHEN Huan.Method for assessing the combat effectiveness of underwater unmanned clusters based on a complex acoustic propagation environment[J].Journal of Harbin Engineering University,2025,46(1):166-172.
ZHANG Yi, VENKATESAN R, DOBRE O A, et al. Efficient estimation and prediction for sparse time-varying underwater acoustic channels[J]. IEEE journal of oceanic engineering, 2020, 45(3): 1112-1125.
ZHENG Qiaoning, ZHENG Haoci, LI Maolin, et al. Shallow water acoustic channel small sample estimation using enhanced support vector machines[J]. Journal of Harbin Engineering University, 2025, 46(3): 390-400.
LI Haoxin,XIAO Changshi,YUAN Haiwei,et al.Underwater acoustic target recognition method based on feature dimension reduction and fusion[J].Journal of Harbin Engineering University,2025,46(1):102-110.
HAYKIN S. Cognitive radar: a way of the future[J]. IEEE signal processing magazine, 2006, 23(1): 30-40.
LI Xiaohua, LI Yaan, CUI Lin, et al. Research of new concept sonar-cognitive sonar[J]. Journal of marine science and application, 2011, 10(4): 502-509.
LI Xiaohua, LI Yaan, LIN Guancheng, et al. Research of the principle of cognitive sonar and beamforming simulation analysis[C]//2011 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC). Piscataway, NJ, 2011: 1-5.
CLAUSSEN T, NGUYEN V D. Real-time cognitive sonar system with target-optimized adaptive signal processing through multi-layer data fusion[C]//2015 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI). Piscataway, NJ, 2015: 357-361.
LU Shuping, CHEN Yang, CHEN Fangxiang, et al. Cognitive continuous tracking algorithm for centralized multistatic sonar systems[C]//2021 OES China Ocean Acoustics (COA). Piscataway, NJ, 2021: 1021-1026.
HAGUE D A. Adaptive transmit waveform design for active cognitive sonar using multi-tone sinusoidal frequency modulation[J]. The journal of the acoustical society of America, 2022, 151(4): A101-A101.
PAKDEL A O, AMIRI H, RAZZAZI F. Enhanced target detection using a new cognitive sonar waveform design in shallow water[J]. Applied acoustics, 2023, 205: 109270.
ARULKUMARAN K, DEISENROTH M P, BRUNDAGE M, et al. Deep reinforcement learning: a brief survey[J]. IEEE signal processing magazine, 2017, 34(6): 26-38.
WANG Qiang, ZHAN Zhongli. Reinforcement learning model, algorithms and its application[C]//2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC). Piscataway, NJ, 2011: 1143-1146.
KOBER J, BAGNELL J A, PETERS J. Reinforcement learning in robotics: a survey[J]. The international journal of robotics research, 2013, 32(11): 1238-1274.
LEE J, NIYATO D, GUAN Yong liang, et al. Learning to schedule joint radar-communication with deep multi-agent reinforcement learning[J]. IEEE transactions on vehicular technology, 2022, 71(1): 406-422.
FU Yubin, MA Xiaochuan, FENG Chao, et al. Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonar[J]. EURASIP journal on advances in signal processing, 2023, 2023(1): 116.
WISNIEWSKA D M, JOHNSON M, BEEDHOLM K, et al. Acoustic gaze adjustments during active target selection in echolocating porpoises[J]. Journal of experimental biology, 2012, 215(pt 24): 4358-4373.
KAM C, KOMPELLA S, NGUYEN G D, et al. Frequency selection and relay placement for energy efficiency in underwater acoustic networks[J]. IEEE Journal of oceanic engineering, 2013, 39(2): 331-342.
WATKINS C J C H, DAYAN P. Q-learning[J]. Machine learning, 1992, 8: 279-299.
VAN H H, GUEZ A, SILVER D. Deep reinforcement learning with double Q-learning[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Phoenix, 2016: 431-442.
XI Lei, YU Lu, XU Yanchun, et al. A novel multi-agent DDQN-AD method-based distributed strategy for automatic generation control of integrated energy systems[J]. IEEE transactions on sustainable energy, 2020, 11(4): 2417-2426.