CASIA OpenIR

浏览/检索结果: 共45条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 卷号: 15, 期号: 1, 页码: 285-296
作者:  Niu, Mingyue;  Tao, Jianhua;  Li, Yongwei;  Qin, Yong;  Li, Ya
收藏  |  浏览/下载:15/0  |  提交时间:2024/07/03
Assessment block  depression level prediction  representation block  speech signals  WavDepressionNet  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:21/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:81/24  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:70/17  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
GPT-4V with Emotion: A Zero-shot Benchmark for Generalized Emotion Recognition 期刊论文
Information Fusion, 2024, 页码: 1-12
作者:  Zheng Lian;  Licai Sun;  Haiyang Sun;  Kang Chen;  Zhuofan Wen;  Hao Gu;  Bin Liu;  Jianhua Tao
Adobe PDF(6888Kb)  |  收藏  |  浏览/下载:79/14  |  提交时间:2024/05/31
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:59/5  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Transformer-based Spiking Neural Networks for Multimodal Audio-Visual Classification 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 页码: DOI 10.1109/TCDS.2023.3327081
作者:  Guo LY(郭凌月);  Zeyu Gao;  Jinye Qu;  Suiwu Zheng;  Runhao Jiang;  Yanfeng Lu;  Hong Qiao
Adobe PDF(3922Kb)  |  收藏  |  浏览/下载:59/20  |  提交时间:2024/05/28
Social Vision for Intelligent Vehicles: From Computer Vision to Foundation Vision 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4474-4476
作者:  Yu, Hui;  Wang, Yutong;  Tian, Yonglin;  Zhang, Hui;  Zheng, Wenbo;  Wang, Fei-Yue
Adobe PDF(135Kb)  |  收藏  |  浏览/下载:92/17  |  提交时间:2024/03/27
Social Vision  Parallel Vision  Knowledge Vision  Foundation Vision  intelligent vehicles  social interaction  sustainability  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:115/34  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Enhancing Dimensional Emotion Recognition from Speech through Modulation-Filtered Cochleagram and Parallel Attention Recurrent Network 期刊论文
ELECTRONICS, 2023, 卷号: 12, 期号: 22, 页码: 15
作者:  Peng, Zhichao;  Zeng, Hua;  Li, Yongwei;  Du, Yegang;  Dang, Jianwu
收藏  |  浏览/下载:88/0  |  提交时间:2024/02/22
modulation-filtered cochleagram  parallel attention recurrent neural network  dimensional emotion recognition  auditory signal processing  noise-robust