面向智能博弈的决策Transformer方法综述 - 中国期刊全文数据库

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

爱看球的梨子 · 葛庭燧：参研世界上第一颗原子弹，他是名字被写 ...· 2 天前 ·

没读研的日记本 · 从制胜机理演变看作战指导之“变” - ...· 3 天前 ·

忐忑的风衣 · 《中国人民解放军联合作战纲要（试行）》施行一 ...· 4 天前 ·

安静的海豚 · 培塑联合作战人才良好素养 - 解放军报 - ...· 4 天前 ·

含蓄的汽水 · 抓实联合作战国防动员演练_国防动员知识_天津 ...· 4 天前 ·

沉稳的油条 · 未来10年，谁是全国最有前景的城市？-虎嗅网· 3 周前 ·

谦虚好学的跑步机 · 创世理想乡一千零一夜的属性和获得方法是什么_ ...· 3 月前 ·

无邪的小蝌蚪 · 【新币赏析】暴涨的英国钱币“美惠三女神”和“ ...· 5 月前 ·

气势凌人的鸡蛋面 · 对不起老师，我的论文用了“文献研究法”· 5 月前 ·

年轻有为的滑板 · 分类汇总统计出现次数- FineBI ...· 8 月前 ·

【Author】 LUO Junren;ZHANG Wanpeng;SU Jiongming;WANG Yao;CHEN Jing;College of Intelligence Science and Technology, National University of Defense Technology;

【通讯作者】国防科技大学智能科学学院；【摘要】智能博弈是认知决策智能领域的挑战性问题,是辅助联合作战筹划与智能任务规划的关键支撑.从协作式团队博弈、竞争式零和博弈和混合式一般和博弈共3个角度梳理了智能博弈模型,从认知角度出发定义了运筹型博弈（完全/有限理性）、不确定型博弈（经验/知识）、涌现探索型博弈（直觉+灵感）、群体交互型博弈（协同演化）共4类智能博弈认知模型,从问题可信任解、策略训练平台、问题求解范式共3个视角给出智能博弈求解方案.基于Transformer架构重点梳理了架构增强（表示学习、网络组合、模型扩展）与序列建模（离线预训练、在线适变、模型扩展）共2大类6小类决策Transformer方法,相关研究为开展“离线预训练+在线适变”范式下满足多主体、多任务、多模态及虚实迁移等应用场景的决策预训练模型构建提供了初始参考.为智能博弈领域的决策基石模型相关研究提供可行借鉴. 更多还原

【Abstract】 Intelligent gaming is a challenging problem in the field of cognitive decision-making intelligence, and it is the key support for assisting joint combat planning and intelligent mission planning. The intelligent gaming model is sorted out from three perspectives:collaborative team game, competitive zero-sum game and mixed general-sum game, four kinds of cognitive models of intelligent gaming are defined from the perspective of cognition: operational game(complete or bounded rationality), uncertain game(experience/knowledge),emerging exploratory game(intuition and inspiration), and population interactive game(co-evolution). Solutions of intelligent gaming are given from three perspectives: trustworthy solution of problems, benchmark learning method, and strategy training platform. Secondly, based on Transformer framework, the decision-making Transformer methods are analyzed from architecture enhancement(presentation learning,network combination, model extension)and sequence modeling(offline pre-training, online adaptation, model extension). Relevant research provides an initial reference for the construction of decision-making pre-trained model in multi-agent, multi-task, multi-mode and sim-to-real transfer application scenarios under the paradigm of "offline pre-training + online adaptation". It is expected to provide feasible reference for the research on the decision-making foundation model in the field of intelligent gaming. 更多还原

【关键词】智能博弈；智能规划与决策；认知建模；离线预训练；在线适变；决策基石模型；【Key words】 intelligent gaming ； intelligent planning and decision-making ； cognitive modeling ； offline pre-training ； online adaptation ； decision-making foundation model ；【基金】国家自然科学基金（61806212）;湖南省研究生创新项目（CX20210011）资助~~

推荐文章

爱看球的梨子 · 葛庭燧：参研世界上第一颗原子弹，他是名字被写进物理学词典的中国人-中国科技网

2 天前

没读研的日记本 · 从制胜机理演变看作战指导之“变” - 解放军报 - 中国军网

3 天前

忐忑的风衣 · 《中国人民解放军联合作战纲要（试行）》施行一周年综述 - 中华人民共和国国防部

4 天前

安静的海豚 · 培塑联合作战人才良好素养 - 解放军报 - 中国军网

4 天前

含蓄的汽水 · 抓实联合作战国防动员演练_国防动员知识_天津市国防动员办公室

4 天前

沉稳的油条 · 未来10年，谁是全国最有前景的城市？-虎嗅网

3 周前

谦虚好学的跑步机 · 创世理想乡一千零一夜的属性和获得方法是什么_一千零一夜属性及获得方法介绍_3DM单机

3 月前

无邪的小蝌蚪 · 【新币赏析】暴涨的英国钱币“美惠三女神”和“尤娜与狮子”_邮币卡_什么值得买

5 月前

气势凌人的鸡蛋面 · 对不起老师，我的论文用了“文献研究法”

5 月前

年轻有为的滑板 · 分类汇总统计出现次数- FineBI Document

8 月前