CASIA OpenIR

浏览/检索结果: 共277条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:35/7  |  提交时间:2024/02/23
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:78/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
A Closer Look at Self-Supervised Lightweight Vision Transformers 会议论文
, Honolulu, Hawaii, USA, 2023-7
作者:  Wang, Shaoru;  Gao, Jin;  Li, Zeming;  Zhang, Xiaoqin;  Weiming, Hu
Adobe PDF(3478Kb)  |  收藏  |  浏览/下载:172/60  |  提交时间:2023/09/20
Vision Transformer  Self-supervised Learning  Lightweight Networks  Knowledge Distillation  
A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram 会议论文
, 中国 澳门, 2023-7-19
作者:  Zhang Ming-Liang;  Yin Fei;  Liu Cheng-Lin
Adobe PDF(1110Kb)  |  收藏  |  浏览/下载:27/9  |  提交时间:2024/04/03
RSDet++: Point-based modulated loss for more accurate rotated object detection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 32, 期号: 11, 页码: 7869-7879
作者:  Wen Qian;  Xue Yang;  Silong Peng;  Xiujuan Zhang;  Junchi Yan
Adobe PDF(6998Kb)  |  收藏  |  浏览/下载:156/62  |  提交时间:2023/06/07
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:168/55  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:150/41  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:111/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Multimodal motor imagery decoding method based on temporal spatial feature alignment and fusion 期刊论文
Journal of Neural Engineering, 2023, 卷号: 20, 期号: 2, 页码: 026009
作者:  Zhang,Yukun;  Qiu,Shuang;  He,Huiguang
Adobe PDF(16882Kb)  |  收藏  |  浏览/下载:113/20  |  提交时间:2023/05/31
brain-computer interface  motor imagery  multimodal  EEG-fNIRS  center loss