CASIA OpenIR

浏览/检索结果: 共60条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training 会议论文
, New Orleans Convention Center ,America, 2022-11-28至 2022-12-9
作者:  Yulong Liu;  Guibo Zhu;  Bin Zhu;  Qi Song;  Guojing Ge;  Haoran Chen;  Guanhui Qiao;  Ru Peng;  Lingxiang Wu;  Jinqiao Wang
Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:35/6  |  提交时间:2024/06/06
BFRFormer: Transformer-based generator for Real-World Blind Face Restoration 会议论文
, Seoul, Korea, 2024年4月14日到2024年4月19日
作者:  Guojing Ge;  Qi Song;  Guibo Zhu;  Yuting Zhang;  Jinglu Chen;  Miao Xin;  Ming Tang;  Jinqiao Wang
Adobe PDF(6872Kb)  |  收藏  |  浏览/下载:37/8  |  提交时间:2024/06/06
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks 会议论文
, New Orleans, Louisiana & Online, 2022-11-28
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Li, Wei;  Wang, Haixin;  Zhao, Chaoyang;  Wu, Liwei;  Zhao, Rui;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(1289Kb)  |  收藏  |  浏览/下载:33/8  |  提交时间:2024/05/28
transformer  general visual framework  sequence prediction  multi-task  
MENet: A Memory-Based Network with Dual-Branch for Efficient Event Stream Processing 会议论文
, TELAVIV, 2022-6
作者:  Linhui Sun;  Yifan Zhang;  Ke Cheng;  Jian Cheng;  Hanqing Lu
Adobe PDF(1728Kb)  |  收藏  |  浏览/下载:125/29  |  提交时间:2024/01/22
Event-based model  Dual-branch structure  Memory bank  
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition 会议论文
, Greece, 2023-6-4
作者:  Liu, Jiawei;  Wang, Hao;  Wang, Weining;  He, Xingjian;  Liu, Jing
Adobe PDF(1673Kb)  |  收藏  |  浏览/下载:172/39  |  提交时间:2023/07/06
ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation 会议论文
, Queensland, Australia, 2023-6-18
作者:  Liu, Jiawei;  Wang, Weining;  Liu, Wei;  He, Qian;  Liu, Jing
Adobe PDF(4537Kb)  |  收藏  |  浏览/下载:208/44  |  提交时间:2023/05/04
Decoupling GCN with DropGraph Module for Skeleton-Based Action Recognition 会议论文
, 线上, 2020-8
作者:  Ke Cheng;  Yifan Zhang;  Congqi Cao;  Lei Shi;  Jian Cheng;  Hanqing Lu
Adobe PDF(2350Kb)  |  收藏  |  浏览/下载:209/57  |  提交时间:2022/06/27
skeleton-based action recognition, decoupling GCN, DropGraph  
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering 会议论文
, 线上, 2021-10
作者:  Liu, Fei;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(1174Kb)  |  收藏  |  浏览/下载:213/48  |  提交时间:2022/06/15
Erasing-based Attention Learning for Visual Question Answering 会议论文
, Nice, France, 2019-10
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2319Kb)  |  收藏  |  浏览/下载:188/57  |  提交时间:2022/06/15
Language and Visual Relations Encoding for Visual Question Answering 会议论文
, 中国台湾, 2019-9
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Lu, Hanqing
Adobe PDF(694Kb)  |  收藏  |  浏览/下载:166/62  |  提交时间:2022/06/15