CASIA OpenIR

浏览/检索结果: 共22条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Stage-Aware Hierarchical Attentive Relational Network for Diagnosis Prediction 期刊论文
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023, 卷号: 36, 期号: 4, 页码: 1773-1784
作者:  Liping Wang;  Qiang Liu;  Mengqi Zhang;  Yaxuan Hu;  Shu Wu;  Liang Wang
Adobe PDF(2088Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/21
Medical diagnostic imaging  Knowledge graphs  Ontologies  Codes  Data models  Predictive models  Graph neural networks  Diagnosis prediction  electronic health record  knowledge graph  relational graph neural network  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:46/15  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models 期刊论文
ACM TRANSACTIONS ON GRAPHICS, 2023, 卷号: 42, 期号: 6, 页码: 14
作者:  Zhang, Yuxin;  Dong, Weiming;  Tang, Fan;  Huang, Nisha;  Huang, Haibin;  Ma, Chongyang;  Lee, Tong-Yee;  Deussen, Oliver;  Xu, Changsheng
收藏  |  浏览/下载:52/0  |  提交时间:2024/03/26
Image generation  Diffusion models  Attribute-aware editing  Model personalization  
Coarse Mask Guided Interactive Object Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5808-5822
作者:  Li, Jing;  Fan, Junsong;  Wang, Yuxi;  Yang, Yuran;  Zhang, Zhaoxiang
Adobe PDF(4323Kb)  |  收藏  |  浏览/下载:51/1  |  提交时间:2024/02/22
Segmentation  interactive  transformer  annotation tool  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:88/23  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Robust Video-Text Retrieval Via Noisy Pair Calibration 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8632-8645
作者:  Zhang, Huaiwen;  Yang, Yang;  Qi, Fan;  Qian, Shengsheng;  Xu, Changsheng
收藏  |  浏览/下载:50/0  |  提交时间:2024/02/22
Noise calibration  uncertainty  video text retrieval  
A New Model for Emotion-Driven Behavior Extraction from Text 期刊论文
APPLIED SCIENCES-BASEL, 2023, 卷号: 13, 期号: 15, 页码: 16
作者:  Sun, Yawei;  He, Saike;  Han, Xu;  Zhang, Ruihua
收藏  |  浏览/下载:85/0  |  提交时间:2023/11/17
emotion analysis  emotion-driven behavior  dataset  prompt paradigm  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:138/1  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:136/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training