CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:91/23  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Patch Loss: A generic multi-scale perceptual loss for single image super-resolution 期刊论文
Pattern Recognition, 2023, 卷号: 139, 页码: 109510
作者:  An T(安泰);  Mao BJ(毛彬杰);  Xue B(薛斌);  Huo CL(霍春雷);  Xiang SM(向世明);  Pan CH(潘春洪)
Adobe PDF(5876Kb)  |  收藏  |  浏览/下载:132/22  |  提交时间:2024/01/17
Single-image super-resolution  Multi-scale loss functions  Image visual perception  Perceptual metrics  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:51/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
收藏  |  浏览/下载:260/0  |  提交时间:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM  
Key point localization and recurrent neural network based water meter reading recognition 期刊论文
Displays, 2022, 卷号: 74, 期号: 2022, 页码: 0-0
作者:  Jiguang Zhang;  Wenrui Liu;  Shibiao Xu;  Xiaopeng Zhang
Adobe PDF(4271Kb)  |  收藏  |  浏览/下载:230/57  |  提交时间:2022/05/06
Mechanical water meters reading  Reading region detection  Digit wheels recognition  Key point location  Recurrent convolutional network  
Decision-based adversarial attack with frequency mixup 期刊论文
IEEE Trans. Information Forensics and Security (TIFS), 2022, 期号: 17, 页码: 1038-1052
作者:  Xiu-Chuan Li;  Xu-Yao Zhang;  Fei Yin;  Cheng-Lin Liu
Adobe PDF(5863Kb)  |  收藏  |  浏览/下载:288/54  |  提交时间:2022/04/07
Decision-based attack , detection , frequency domain  
Decoupled Representation Learning for Character Glyph Synthesis 期刊论文
IEEE Transactions on Multimedia, 2021, 卷号: 2021, 期号: 2021, 页码: 1-13
作者:  Xiyan Liu;  Gaofeng Meng;  Jianlong Chang;  Ruiguang Hu;  Shiming Xiang;  Chunhong Pan
Adobe PDF(4588Kb)  |  收藏  |  浏览/下载:207/50  |  提交时间:2022/01/24
Character glyph synthesis  Decoupled representation  generative adversarial networks  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:208/57  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Exploiting the directional coherence function for multichannel source extraction 期刊论文
SPEECH COMMUNICATION, 2021, 卷号: 128, 页码: 1-14
作者:  Liang, Shan;  Li, Guanjun;  Nie, Shuai;  Yang, ZhanLei;  Liu, WenJu;  Tao, Jianhua
收藏  |  浏览/下载:233/0  |  提交时间:2021/05/06
Directional coherence function  Coherent-to-Diffuse Ratio  General sidelobe canceller  Desired Speech Presence Probability