CASIA OpenIR

浏览/检索结果: 共32条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Topic-Oriented Dialogue Summarization 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 卷号: 31, 页码: 1797 - 1810
作者:  Lin, Haitao;  Zhu, Junnan;  Xiang, Lu;  Zhai, Feifei;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(3037Kb)  |  收藏  |  浏览/下载:176/74  |  提交时间:2023/06/13
dialogue summarization  abstractive summarization  controllable text generation  natural language processing  
Everybody’s Talkin’: Let Me Talk as You Want 期刊论文
IEEE Transactions on Information Forensics and Security, 2022, 卷号: 17, 期号: 1, 页码: 585 - 598
作者:  宋林森;  吴文岩;  钱晨;  赫然;  Loy, Chen Change
Adobe PDF(15432Kb)  |  收藏  |  浏览/下载:64/11  |  提交时间:2023/06/29
Talking face generation  Video generation  GAN  Audio dubbing  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:180/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
作者:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:217/57  |  提交时间:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images 期刊论文
NEURAL PROCESSING LETTERS, 2021, 期号: 53, 页码: 18
作者:  Gao, Zishu;  Li, En;  Wang, Zhe;  Yang, Guodong;  Lu, Jiwu;  Ouyang, Bo;  Xu, Dawei;  Liang, Zize
Adobe PDF(1338Kb)  |  收藏  |  浏览/下载:236/49  |  提交时间:2021/03/01
Object reconstruction  Convolutional LSTM  Visual attention  Robotic application  
Decoupled Representation Learning for Character Glyph Synthesis 期刊论文
IEEE Transactions on Multimedia, 2021, 卷号: 2021, 期号: 2021, 页码: 1-13
作者:  Xiyan Liu;  Gaofeng Meng;  Jianlong Chang;  Ruiguang Hu;  Shiming Xiang;  Chunhong Pan
Adobe PDF(4588Kb)  |  收藏  |  浏览/下载:167/45  |  提交时间:2022/01/24
Character glyph synthesis  Decoupled representation  generative adversarial networks  
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:321/83  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
Design and Tension Modeling of a Novel Cable-Driven Rigid Snake-Like Manipulator 期刊论文
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 卷号: 99, 期号: 2, 页码: 211-228
作者:  Xu, Dawei;  Li, En;  Liang, Zize;  Gao, Zishu
Adobe PDF(5913Kb)  |  收藏  |  浏览/下载:322/58  |  提交时间:2020/03/30
Snake-like manipulator  Cable-driven  Kinematics  Tension model  Neural network  Reinforcement learning  
Opportunities and challenges for biometrics 专著
Switzerland:Springer, 2020
作者:  Sun, Zhenan;  Li, Qi;  Liu, Yunfan;  Zhu, Yuhao
Adobe PDF(590Kb)  |  收藏  |  浏览/下载:54/22  |  提交时间:2024/02/23
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:211/73  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation