CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:36/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing 期刊论文
PATTERN RECOGNITION, 2022, 卷号: 132, 页码: 14
作者:  Li, Xiao-Hui;  Yin, Fei;  Dai, He-Sen;  Liu, Cheng-Lin
收藏  |  浏览/下载:246/0  |  提交时间:2022/11/14
Table detection  Table structure recognition  Template -free form parsing  Graph neural network  End -to -end training  
Unconstrained end-to-end text reading with feature rectification 期刊论文
PATTERN RECOGNITION LETTERS, 2021, 卷号: 149, 页码: 1-8
作者:  Du, Chen;  Wang, Yanna;  Wang, Chunheng;  Xiao, Baihua;  Shi, Cunzhao
Adobe PDF(1133Kb)  |  收藏  |  浏览/下载:301/63  |  提交时间:2021/11/02
Text recognition  Text detection  Position-sensitive network  Features incompatibility  End-to-end  
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 1340-1351
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Tian, Zhengkun;  Zhang, Shuai
收藏  |  浏览/下载:172/0  |  提交时间:2021/06/07
End-to-End  language modeling  speech recognition  teacher-student learning  transfer learning  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:400/50  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:305/65  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Forward-Backward Decoding Sequence for Regularizing End-to-End TTS 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 12, 页码: 2067-2079
作者:  Zheng, Yibin;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan
收藏  |  浏览/下载:351/0  |  提交时间:2020/03/30
Decoding  Training  Speech processing  Linguistics  Acoustics  Speech recognition  Forward-backward  regularization  encoder-decoder with attention  end-to-end  joint-training  TTS