CASIA OpenIR

浏览/检索结果: 共34条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:29/15  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:65/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Triple Robustness Augmentation Local Features for multi-source image registration 期刊论文
ISPRS Journal of Photogrammetry and Remote Sensing, 2023, 卷号: 199, 期号: 0, 页码: 1-14
作者:  Changwei Wang;  Lele Xu;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Ruisheng Wang;  Xiaopeng Zhang
Adobe PDF(6581Kb)  |  收藏  |  浏览/下载:49/14  |  提交时间:2024/05/29
Pixel-Wise Grasp Detection via Twin Deconvolution and Multi-Dimensional Attention 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 33, 期号: 8, 页码: 4002-4010
作者:  Ren, Guangli;  Geng, Wenjie;  Guan, Peiyu;  Cao, Zhiqiang;  Yu, Junzhi
Adobe PDF(4013Kb)  |  收藏  |  浏览/下载:43/14  |  提交时间:2024/05/28
Transformer-based Spiking Neural Networks for Multimodal Audio-Visual Classification 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 页码: DOI 10.1109/TCDS.2023.3327081
作者:  Guo LY(郭凌月);  Zeyu Gao;  Jinye Qu;  Suiwu Zheng;  Runhao Jiang;  Yanfeng Lu;  Hong Qiao
Adobe PDF(3922Kb)  |  收藏  |  浏览/下载:45/14  |  提交时间:2024/05/28
Social Vision for Intelligent Vehicles: From Computer Vision to Foundation Vision 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4474-4476
作者:  Yu, Hui;  Wang, Yutong;  Tian, Yonglin;  Zhang, Hui;  Zheng, Wenbo;  Wang, Fei-Yue
Adobe PDF(135Kb)  |  收藏  |  浏览/下载:78/12  |  提交时间:2024/03/27
Social Vision  Parallel Vision  Knowledge Vision  Foundation Vision  intelligent vehicles  social interaction  sustainability  
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:57/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:99/27  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Enhancing Dimensional Emotion Recognition from Speech through Modulation-Filtered Cochleagram and Parallel Attention Recurrent Network 期刊论文
ELECTRONICS, 2023, 卷号: 12, 期号: 22, 页码: 15
作者:  Peng, Zhichao;  Zeng, Hua;  Li, Yongwei;  Du, Yegang;  Dang, Jianwu
收藏  |  浏览/下载:82/0  |  提交时间:2024/02/22
modulation-filtered cochleagram  parallel attention recurrent neural network  dimensional emotion recognition  auditory signal processing  noise-robust  
A Graded Assessment System for Parkinsons Upper-Limb Bradykinesia Based on a Temporal Convolutional Network Model 期刊论文
IEEE SENSORS JOURNAL, 2023, 卷号: 23, 期号: 23, 页码: 29283-29292
作者:  Tong, Lina;  Liu, Dai-Song;  Peng, Liang;  Hao, Hong-Lin;  Wang, Chen
Adobe PDF(9425Kb)  |  收藏  |  浏览/下载:74/9  |  提交时间:2024/02/21
Bradykinesia grade  inertial sensors  Parkinson's disease (PD)  temporal convolutional network (TCN)  wearable device