CASIA OpenIR

浏览/检索结果: 共33条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:32/17  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:66/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Coarse Mask Guided Interactive Object Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5808-5822
作者:  Li, Jing;  Fan, Junsong;  Wang, Yuxi;  Yang, Yuran;  Zhang, Zhaoxiang
Adobe PDF(4323Kb)  |  收藏  |  浏览/下载:73/5  |  提交时间:2024/02/22
Segmentation  Interactive  Transformer  Annotation Tool  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:102/28  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Physics-Based Modeling and Fluttering Dynamic Process Simulation for Catkins 期刊论文
FORESTS, 2023, 卷号: 14, 期号: 12, 页码: 28
作者:  Zhang, Jiaxiu;  Yang, Meng;  Xi, Benye;  Duan, Jie;  Huang, Qingqing;  Meng, Weiliang
Adobe PDF(10070Kb)  |  收藏  |  浏览/下载:86/16  |  提交时间:2024/02/22
clustering  catkin modeling  wind field simulation  catkin control  
Exploring Explicitly Disentangled Features for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6360-6373
作者:  Li, Jingwei;  Li, Yuan;  Wang, Huanjie;  Liu, Chengbao;  Tan, Jie
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:132/14  |  提交时间:2023/12/21
Domain generalization  feature disentanglement  Fourier transform  data augmentation  
IDO: Instance dual-optimization for weakly supervised object detection 期刊论文
APPLIED INTELLIGENCE, 2023, 页码: 18
作者:  Ren, Zhida;  Tang, Yongqiang;  Zhang, Wensheng
Adobe PDF(3668Kb)  |  收藏  |  浏览/下载:76/7  |  提交时间:2023/11/17
Deep learning  Weakly supervised learning  Object detection  Multiple instance learning  
CSIR: Cascaded Sliding CVAEs With Iterative Socially-Aware Rethinking for Trajectory Prediction 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 页码: 13
作者:  Zhou, Hao;  Yang, Xu;  Ren, Dongchun;  Huang, Hai;  Fan, Mingyu
Adobe PDF(4364Kb)  |  收藏  |  浏览/下载:101/2  |  提交时间:2023/11/17
Cascaded prediction  sliding sequence prediction  iterative social-aware rethinking  trajectory prediction  
Dual feature enhanced video super-resolution network based on low-light scenarios 期刊论文
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 卷号: 115, 页码: 8
作者:  Zhang, Huan;  Cao, Yihao;  Cai, Jianghui;  Cai, Xingjuan;  Zhang, Wensheng
收藏  |  浏览/下载:106/0  |  提交时间:2023/11/17
Video super-resolution (VSR)  Feature enhancement  Information re-fusion  Attention mechanism  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:94/7  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning