CASIA OpenIR

浏览/检索结果: 共25条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/05/31
Learning and Controlling Multiscale Dynamics in Spiking Neural Networks Using Recursive Least Square Modifications 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(8060Kb)  |  收藏  |  浏览/下载:54/8  |  提交时间:2024/03/27
Direct dynamic programming (DDP)  Lorenz system  multiscale dynamics  point-to-point control  recursive least square (RLS)  spiking neural network (SNN)  
Everybody’s Talkin’: Let Me Talk as You Want 期刊论文
IEEE Transactions on Information Forensics and Security, 2022, 卷号: 17, 期号: 1, 页码: 585 - 598
作者:  宋林森;  吴文岩;  钱晨;  赫然;  Loy, Chen Change
Adobe PDF(15432Kb)  |  收藏  |  浏览/下载:88/11  |  提交时间:2023/06/29
Talking face generation  Video generation  GAN  Audio dubbing  
Topic-Oriented Dialogue Summarization 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 卷号: 31, 页码: 1797 - 1810
作者:  Lin, Haitao;  Zhu, Junnan;  Xiang, Lu;  Zhai, Feifei;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(3037Kb)  |  收藏  |  浏览/下载:225/86  |  提交时间:2023/06/13
dialogue summarization  abstractive summarization  controllable text generation  natural language processing  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:228/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire 期刊论文
Signal Processing Letters, 2022, 页码: 1551-1554
作者:  Fan ZY(范志赟);  Dong LH(董林昊);  Cai M(蔡猛);  Ma ZJ(马泽君);  Xu B(徐波)
Adobe PDF(404Kb)  |  收藏  |  浏览/下载:180/42  |  提交时间:2022/09/17
Decoupled Representation Learning for Character Glyph Synthesis 期刊论文
IEEE Transactions on Multimedia, 2021, 卷号: 2021, 期号: 2021, 页码: 1-13
作者:  Xiyan Liu;  Gaofeng Meng;  Jianlong Chang;  Ruiguang Hu;  Shiming Xiang;  Chunhong Pan
Adobe PDF(4588Kb)  |  收藏  |  浏览/下载:198/48  |  提交时间:2022/01/24
Character glyph synthesis  Decoupled representation  generative adversarial networks  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:194/54  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images 期刊论文
NEURAL PROCESSING LETTERS, 2021, 期号: 53, 页码: 18
作者:  Gao, Zishu;  Li, En;  Wang, Zhe;  Yang, Guodong;  Lu, Jiwu;  Ouyang, Bo;  Xu, Dawei;  Liang, Zize
Adobe PDF(1338Kb)  |  收藏  |  浏览/下载:284/58  |  提交时间:2021/03/01
Object reconstruction  Convolutional LSTM  Visual attention  Robotic application  
Monaural speech separation based on MAXVQ and CASA for robust speech recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2010, 卷号: 24, 期号: 1, 页码: 30-44
作者:  Li, Peng;  Guan, Yong;  Wang, Shijin;  Xu, Bo;  Liu, Wenju
收藏  |  浏览/下载:67/0  |  提交时间:2020/10/27
Monaural Speech Separation  Computational Auditory Scene Analysis (Casa)  Factorial-max Vector Quantization (Maxvq)  Automatic Speech Recognition (Asr)