CASIA OpenIR

浏览/检索结果: 共59条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:39/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:48/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Dual-stream Representation Fusion Learning for accurate medical image segmentation 期刊论文
Engineering Applications of Artificial Intelligence, 2023, 卷号: 123, 页码: 106402
作者:  Xu RT(许镕涛);  Wang CW(王常维);  Xu SB(徐士彪);  Meng WL(孟维亮);  Zhang XP(张晓鹏)
Adobe PDF(1893Kb)  |  收藏  |  浏览/下载:217/57  |  提交时间:2023/05/18
A Framework and Operational Procedures for Metaverses-Based Industrial Foundation Models 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 10
作者:  Wang, Jiangong;  Tian, Yonglin;  Wang, Yutong;  Yang, Jing;  Wang, Xingxia;  Wang, Sanjin;  Kwan, Oliver
Adobe PDF(3322Kb)  |  收藏  |  浏览/下载:131/38  |  提交时间:2023/02/22
Cyber-physical-social intelligence (CPSI)  cyber-physical-social systems (CPSSs)  industrial foundation models (IFMs)  intelligent enterprises  metaverses  operational processes  parallel intelligence  
ASCL: Adversarial supervised contrastive learning for defense against word substitution attacks 期刊论文
NEUROCOMPUTING, 2022, 卷号: 510, 页码: 59-68
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel
Adobe PDF(1054Kb)  |  收藏  |  浏览/下载:221/25  |  提交时间:2022/11/14
Adversarial example  Adversarial training  Model robustness  Contrastive learning  Natural language processing  
Robust Texture-Aware Computer-Generated Image Forensic: Benchmark and Algorithm 期刊论文
IEEE Transactions on Image Processing, 2021, 卷号: 30, 页码: 8439-8453
作者:  Bai, Weiming;  Zhang, Zhipeng;  Li, Bing;  Wang, Pei;  Li, Yangxi;  Zhang, Congxuan;  Hu, Weiming
Adobe PDF(4552Kb)  |  收藏  |  浏览/下载:179/55  |  提交时间:2022/06/14
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:256/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
Decoupled Representation Learning for Character Glyph Synthesis 期刊论文
IEEE Transactions on Multimedia, 2021, 卷号: 2021, 期号: 2021, 页码: 1-13
作者:  Xiyan Liu;  Gaofeng Meng;  Jianlong Chang;  Ruiguang Hu;  Shiming Xiang;  Chunhong Pan
Adobe PDF(4588Kb)  |  收藏  |  浏览/下载:180/47  |  提交时间:2022/01/24
Character glyph synthesis  Decoupled representation  generative adversarial networks  
A Unified Shared-Private Network with Denoising for Dialogue State Tracking 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2021, 卷号: 36, 期号: 6, 页码: 1407-1419
作者:  Liu QB(刘庆斌);  He SZ(何世柱);  Liu K(刘康);  Liu SP(刘升平);  Zhao J(赵军)
Adobe PDF(997Kb)  |  收藏  |  浏览/下载:215/68  |  提交时间:2022/01/19
dialogue state tracking  unified strategy  shared-private network  reinforcement learning  
F-0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3375-3383
作者:  Li, Yongwei;  Tao, Jianhua;  Erickson, Donna;  Liu, Bin;  Akagi, Masato
收藏  |  浏览/下载:120/0  |  提交时间:2021/12/28
Speech recognition  Iterative methods  Production  Estimation  Brain modeling  Shape  Low-frequency noise  Glottal source  vocal tract  source-filter model  ARX-LF model