CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Boost 3-D Object Detection via Point Clouds Segmentation and Fused 3-D GIoU-L-1 Loss 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 762-773
作者:  Chen, Yaran;  Li, Haoran;  Gao, Ruiyuan;  Zhao, Dongbin
Adobe PDF(2082Kb)  |  收藏  |  浏览/下载:226/45  |  提交时间:2022/03/17
3-D object detection  generalized Intersection of Union (GIoU) loss  segmentation  
Unconstrained end-to-end text reading with feature rectification 期刊论文
PATTERN RECOGNITION LETTERS, 2021, 卷号: 149, 页码: 1-8
作者:  Du, Chen;  Wang, Yanna;  Wang, Chunheng;  Xiao, Baihua;  Shi, Cunzhao
Adobe PDF(1133Kb)  |  收藏  |  浏览/下载:284/56  |  提交时间:2021/11/02
Text recognition  Text detection  Position-sensitive network  Features incompatibility  End-to-end  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:371/48  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
Adversarial learning based attentional scene text recognizer 期刊论文
PATTERN RECOGNITION LETTERS, 2020, 卷号: 138, 期号: 1, 页码: 217-222
作者:  Zhao, Jinyuan;  Wang, Yanna;  Xiao, Baihua;  Shi, Cunzhao;  Jiang, Jingzhong;  Wang, Chunheng
Adobe PDF(1152Kb)  |  收藏  |  浏览/下载:339/82  |  提交时间:2021/01/07
Scene text recognition  Generative adversarial network  Image rectification  
DetectGAN: GAN-based text detector for camera-captured document images 期刊论文
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2020, 卷号: 23, 期号: 4, 页码: 267-277
作者:  Zhao, Jinyuan;  Wang, Yanna;  Xiao, Baihua;  Shi, Cunzhao;  Jia, Fuxi;  Wang, Chunheng
Adobe PDF(3817Kb)  |  收藏  |  浏览/下载:295/53  |  提交时间:2020/09/21
Text detection  Camera-captured document images  Multi-scale context features  Generative adversarial networks  
Multi-branch guided attention network for irregular text recognition 期刊论文
Neurocomputing, 2020, 卷号: 425, 期号: 0, 页码: 0
作者:  Wang, Cong;  Liu, Cheng-Lin
浏览  |  Adobe PDF(2370Kb)  |  收藏  |  浏览/下载:216/56  |  提交时间:2020/07/16
Irregular text recognition, Mutual guidance, Multi-branch guided attention network (MBAN)  
Forward-Backward Decoding Sequence for Regularizing End-to-End TTS 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 12, 页码: 2067-2079
作者:  Zheng, Yibin;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan
收藏  |  浏览/下载:323/0  |  提交时间:2020/03/30
Decoding  Training  Speech processing  Linguistics  Acoustics  Speech recognition  Forward-backward  regularization  encoder-decoder with attention  end-to-end  joint-training  TTS