CASIA OpenIR

浏览/检索结果: 共45条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:41/20  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:109/31  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Patch Loss: A generic multi-scale perceptual loss for single image super-resolution 期刊论文
Pattern Recognition, 2023, 卷号: 139, 页码: 109510
作者:  An T(安泰);  Mao BJ(毛彬杰);  Xue B(薛斌);  Huo CL(霍春雷);  Xiang SM(向世明);  Pan CH(潘春洪)
Adobe PDF(5876Kb)  |  收藏  |  浏览/下载:148/28  |  提交时间:2024/01/17
Single-image super-resolution  Multi-scale loss functions  Image visual perception  Perceptual metrics  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:56/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
Towards Better Word Importance Ranking in Textual Adversarial Attacks 会议论文
, Gold Coast, Australia, June 18-23, 2023
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel Dajun
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:275/114  |  提交时间:2023/09/27
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
收藏  |  浏览/下载:268/0  |  提交时间:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM  
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:218/46  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution  
Key point localization and recurrent neural network based water meter reading recognition 期刊论文
Displays, 2022, 卷号: 74, 期号: 2022, 页码: 0-0
作者:  Jiguang Zhang;  Wenrui Liu;  Shibiao Xu;  Xiaopeng Zhang
Adobe PDF(4271Kb)  |  收藏  |  浏览/下载:242/60  |  提交时间:2022/05/06
Mechanical water meters reading  Reading region detection  Digit wheels recognition  Key point location  Recurrent convolutional network  
Decoupled Representation Learning for Character Glyph Synthesis 期刊论文
IEEE Transactions on Multimedia, 2021, 卷号: 2021, 期号: 2021, 页码: 1-13
作者:  Xiyan Liu;  Gaofeng Meng;  Jianlong Chang;  Ruiguang Hu;  Shiming Xiang;  Chunhong Pan
Adobe PDF(4588Kb)  |  收藏  |  浏览/下载:223/55  |  提交时间:2022/01/24
Character glyph synthesis  Decoupled representation  generative adversarial networks  
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning 会议论文
, Hong Kong, 24-27 Jan. 2021
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi;  Song, Leichao
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:266/66  |  提交时间:2021/06/01