CASIA OpenIR

浏览/检索结果: 共39条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:23/5  |  提交时间:2024/06/05
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:10/2  |  提交时间:2024/06/05
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:23/8  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:16/3  |  提交时间:2024/05/28
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:180/40  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Energy Based Optimal Dynamic Stealth False Data Injection Attacks on the Smart Grid 会议论文
Proceedings of 2020 7th International Conference on Information, Cybernetics, and Computational Social Systems, Guangzhou, China, 2020.11.13-15
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(1191Kb)  |  收藏  |  浏览/下载:123/41  |  提交时间:2023/06/29
smart grid, security, false data injection attack, optimal control  
Optimal defense resource allocation and geographically feasible hexagonal topology construction for power grid security 会议论文
Communications in Computer and Information Science, Hangzhou, China, 2021 22-24 October
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(756Kb)  |  收藏  |  浏览/下载:124/44  |  提交时间:2023/06/28
Joint Modeling of Document and Label with Clause Interaction Hypergraph for ICD Medical Code Assignment 会议论文
, Padua, Italy, 18-23 July 2022
作者:  Wu HR(吴浩然);  Meng LH(孟令辉);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:93/36  |  提交时间:2023/06/26
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:112/49  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data  
PKD: General Distillation Framework for Object Detectors via Pearson Correlation Coefficient 会议论文
, New Orleans, America, Monday November 28th through Friday December 9th
作者:  Weihan, Cao;  Yifan, Zhang;  Jianfei, Gao;  Anda, Cheng;  Ke, Cheng;  Jian, Cheng
Adobe PDF(2614Kb)  |  收藏  |  浏览/下载:101/27  |  提交时间:2023/06/21
Knowledge Distillation  Model Compression  Object Detection