CASIA OpenIR

浏览/检索结果: 共39条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:29/15  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:65/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:52/19  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Improved Video Emotion Recognition with Alignment of CNN and Human Brain Representations 期刊论文
IEEE Transactions on Affective Computing, 2023, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(3907Kb)  |  收藏  |  浏览/下载:66/23  |  提交时间:2024/05/28
CNN-brain Alignment  Brain-guided Deep Learning  Video Emotion Recognition  Representation Similarity Analysis  
Pixel-Wise Grasp Detection via Twin Deconvolution and Multi-Dimensional Attention 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 33, 期号: 8, 页码: 4002-4010
作者:  Ren, Guangli;  Geng, Wenjie;  Guan, Peiyu;  Cao, Zhiqiang;  Yu, Junzhi
Adobe PDF(4013Kb)  |  收藏  |  浏览/下载:43/14  |  提交时间:2024/05/28
Transformer-based Spiking Neural Networks for Multimodal Audio-Visual Classification 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 页码: DOI 10.1109/TCDS.2023.3327081
作者:  Guo LY(郭凌月);  Zeyu Gao;  Jinye Qu;  Suiwu Zheng;  Runhao Jiang;  Yanfeng Lu;  Hong Qiao
Adobe PDF(3922Kb)  |  收藏  |  浏览/下载:45/14  |  提交时间:2024/05/28
Social Vision for Intelligent Vehicles: From Computer Vision to Foundation Vision 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4474-4476
作者:  Yu, Hui;  Wang, Yutong;  Tian, Yonglin;  Zhang, Hui;  Zheng, Wenbo;  Wang, Fei-Yue
Adobe PDF(135Kb)  |  收藏  |  浏览/下载:77/12  |  提交时间:2024/03/27
Social Vision  Parallel Vision  Knowledge Vision  Foundation Vision  intelligent vehicles  social interaction  sustainability  
Topographic representation of visually evoked emotional experiences in the human cerebral cortex 期刊论文
iScience, 2023, 卷号: 26, 期号: 9, 页码: 1-18
作者:  Du, Changde;  Fu, Kaicheng;  Wen, Bincheng;  He, Huiguang
Adobe PDF(7257Kb)  |  收藏  |  浏览/下载:88/19  |  提交时间:2024/03/26
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:57/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:99/27  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation