基于PPO算法的一对一空战格斗决策方法

首页 > 过刊浏览>2025年第33卷第10期 >165-173

基于PPO算法的一对一空战格斗决策方法
DOI:
                        
CSTR:
                        
作者:
                        
作者单位:中国人民解放军海军航空大学青岛校区
作者简介:
通讯作者:
中图分类号:TP393???????????
基金项目:中国人民解放军海军航空大学基金(H3202204022)。

1V1 Close-range Air Combat Maneuvering Decision-Making Method Based On PPO Algorithm

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

摘要: 空战格斗具有作战要素多、态势变化快和作战氛围紧张等特点，其决策方法是人工智能领域的热点研究课题。目前关于近距空战算法的研究大都在简化的低精度场景或现有仿真平台中进行，受实际问题的复杂性和仿真效能的限制大多简化了空战决策模型，降低了研究结果的参考价值。针对此问题，基于Unity3D搭建了满足研究需求的可视化空战平台并设计了飞机的机动动作集，根据空空格斗时的敌我态势特点定义了态势评估函数和奖励函数，在此基础上构建了基于近端策略优化算法的一对一空战格斗决策框架。实验结果表明，决策模型能够驱动智能体根据战场态势进行灵活的机动决策，具备较强的自主决策的能力，验证了方法的有效性。

Abstract:

Abstract: Close-range air combat has the characteristics of multiple combat elements, rapid situational changes, and tense combat atmosphere, it’s decision-making method is a hot research topic in the field of artificial intelligence. At present, research on close range air combat algorithms is mostly conducted in simplified low precision scenarios or existing simulation systems, due to the complexity of practical problems and limitations in simulation effectiveness, the decision models for air combat are mostly simplified, which reduces the reference value of research results. In response to this issue, a visual air combat platform that meets research requirements was built based on Unity3D, and a set of aircraft maneuvering actions was designed. Based on the characteristics of the enemy friendly situation during close-range air combat, situation evaluation functions and reward functions were defined. On this basis, a one-on-one close-range air combat decision-making framework based on proximal policy optimization algorithm was constructed. The experimental results show that the decision model can drive the intelligent agent to make flexible maneuvering decisions based on the battlefield situation, and has strong autonomous decision-making ability, which verifies the effectiveness of the method.

参考文献

相似文献

引证文献

引用本文

周琪栋,江志东,霍立平,赵冬梅.基于PPO算法的一对一空战格斗决策方法计算机测量与控制[J].,2025,33(10):165-173.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-09-21
最后修改日期:2024-10-31
录用日期:2024-11-06
在线发布日期: 2025-10-27
出版日期:

引用本文

分享

相关视频

文章指标

历史

文章二维码