CASIA OpenIR

浏览/检索结果: 共185条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:3/1  |  提交时间:2024/06/11
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:25/5  |  提交时间:2024/06/05
FireFly: A High-Throughput Hardware Accelerator for Spiking Neural Networks With Efficient DSP and Memory Optimization 期刊论文
IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2023, 页码: 1178 - 1191
作者:  Li, Jindong;  Shen, Guobin;  Zhao, Dongcheng;  Zhang, Qian;  Zeng, Yi
Adobe PDF(5840Kb)  |  收藏  |  浏览/下载:12/0  |  提交时间:2024/06/05
SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations 会议论文
, Vancouver, BC, Canada, 2023-6-17至2023-6-24
作者:  Li, Pu;  Guo, Jianwei;  Zhang, Xiaopeng;  Yan, Dong-Ming
Adobe PDF(9384Kb)  |  收藏  |  浏览/下载:7/0  |  提交时间:2024/06/03
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:19/7  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning  
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:14/6  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:14/3  |  提交时间:2024/05/28
Parallel Learning Based Foundation Model for Networked Traffic Signal Control 会议论文
, Bilbao, Bizkaia, Spain, 2022-9-24
作者:  Zhao, Chen;  Dai, Xingyuan;  Chen, Yuanyuan;  Yilun, Lin;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1112Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/05/28
基于自适应噪声的最大熵进化强化学习方法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 54-66
作者:  王君逸;  王志;  李华雄;  陈春林
Adobe PDF(6435Kb)  |  收藏  |  浏览/下载:22/6  |  提交时间:2024/05/09
深度强化学习  进化策略  进化强化学习  最大熵  自适应噪声  
基于单声矢量传声器虚拟扩展的多机动声目标跟踪算法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 2, 页码: 383-398
作者:  张君;  鲍明;  赵静;  陈志菲;  杨建华
Adobe PDF(5403Kb)  |  收藏  |  浏览/下载:17/8  |  提交时间:2024/05/09
声矢量传声器  高阶累积量  虚拟扩展  广义标签多伯努利滤波  多目标跟踪