CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共22条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
作者:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  收藏  |  浏览/下载:38/12  |  提交时间:2024/06/06
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:66/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:53/12  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:246/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
作者:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:274/69  |  提交时间:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition 会议论文
, shanghai, 2020
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:142/38  |  提交时间:2021/06/25
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:210/59  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Evaluation of Linear Regression for Speaker Adaptation in HMM-Based Articulatory Movements Estimation 会议论文
, Brisbane,Australia, Apr.19-24,2015
作者:  Hao Li;  Jianhua Tao;  Yang Wang
收藏  |  浏览/下载:11/0  |  提交时间:2020/10/27
Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech 期刊论文
SPEECH COMMUNICATION, 2015, 卷号: 72, 页码: 59-73
作者:  Li, Ya;  Tao, Jianhua;  Hirose, Keikichi;  Xu, Xiaoying;  Lai, Wei
收藏  |  浏览/下载:96/0  |  提交时间:2020/10/27
Prosody  Stress  Hierarchical Modeling  Fujisaki Model  Speech Synthesis  
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
作者:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
收藏  |  浏览/下载:126/0  |  提交时间:2020/10/27
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech