CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:44/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:199/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
Fact-Driven Abstractive Summarization by Utilizing Multi-Granular Multi-Relational Knowledge 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 1665-1678
作者:  Mao, Qianren;  Li, Jianxin;  Peng, Hao;  He, Shizhu;  Wang, Lihong;  Yu, Philip S.;  Wang, Zheng
收藏  |  浏览/下载:127/0  |  提交时间:2022/07/25
Fact consistency  graph neural network  language model  pointer network  text summarization  
A Graph-to-Sequence Learning Framework for Summarizing Opinionated Texts 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 期号: 1, 页码: 1650-1660
作者:  Wei, Penghui;  Zhao, Jiahao;  Mao, Wenji
Adobe PDF(1818Kb)  |  收藏  |  浏览/下载:175/39  |  提交时间:2021/06/15
Opinionated text summarization  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:333/58  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 1340-1351
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Tian, Zhengkun;  Zhang, Shuai
收藏  |  浏览/下载:161/0  |  提交时间:2021/06/07
End-to-End  language modeling  speech recognition  teacher-student learning  transfer learning  
Medical Term and Status Generation From Chinese Clinical Dialogue With Multi-Granularity Transformer 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3362-3374
作者:  Li, Mei;  Xiang, Lu;  Kang, Xiaomian;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(3036Kb)  |  收藏  |  浏览/下载:251/56  |  提交时间:2021/12/28
Medical diagnostic imaging  Transformers  Task analysis  Medical services  Computational modeling  Semantics  Data mining  Medical dialogue  multi-granularity  attention mechanism  natural language understanding  sequence to sequence learning  
Forward-Backward Decoding Sequence for Regularizing End-to-End TTS 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 12, 页码: 2067-2079
作者:  Zheng, Yibin;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan
收藏  |  浏览/下载:323/0  |  提交时间:2020/03/30
Decoding  Training  Speech processing  Linguistics  Acoustics  Speech recognition  Forward-backward  regularization  encoder-decoder with attention  end-to-end  joint-training  TTS  
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 3, 页码: 621-630
作者:  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Bai, Ye
Adobe PDF(907Kb)  |  收藏  |  浏览/下载:395/84  |  提交时间:2019/07/12
Adversarial training  transfer learning  cross-lingual  low-resource  speech recognition  
Learning the Multilingual Translation Representations for Question Retrieval in Community Question Answering via Non-Negative Matrix Factorization 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 卷号: 24, 期号: 7, 页码: 1305-1314
作者:  Zhou, Guangyou;  Xie, Zhiwen;  He, Tingting;  Zhao, Jun;  Hu, Xiaohua Tony
收藏  |  浏览/下载:60/0  |  提交时间:2020/10/27
Natural Language Processing  Information Retrieval  Community Question Answering  Question Retrieval  Text Mining