CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:41/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
HMDRL: Hierarchical Mixed Deep Reinforcement Learning to Balance Vehicle Supply and Demand 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 12
作者:  Xi, Jinhao;  Zhu, Fenghua;  Ye, Peijun;  Lv, Yisheng;  Tang, Haina;  Wang, Fei-Yue
Adobe PDF(3316Kb)  |  收藏  |  浏览/下载:244/30  |  提交时间:2022/09/19
deep reinforcement learning  online ride-hailing system  hierarchical repositioning framework  parallel coordination mechanism  mixed state  
Adaptable Global Network for Whole-Brain Segmentation with Symmetry Consistency Loss 期刊论文
COGNITIVE COMPUTATION, 2022, 页码: 14
作者:  Zhao, Yuan-Xing;  Zhang, Yan-Ming;  Song, Ming;  Liu, Cheng-Lin
Adobe PDF(2496Kb)  |  收藏  |  浏览/下载:268/73  |  提交时间:2022/07/25
Whole-brain segmentation  Adaptable global network  Semi-supervised learning  Symmetry consistency loss  
Boost 3-D Object Detection via Point Clouds Segmentation and Fused 3-D GIoU-L-1 Loss 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 762-773
作者:  Chen, Yaran;  Li, Haoran;  Gao, Ruiyuan;  Zhao, Dongbin
Adobe PDF(2082Kb)  |  收藏  |  浏览/下载:222/43  |  提交时间:2022/03/17
3-D object detection  generalized Intersection of Union (GIoU) loss  segmentation  
Deep Neural Network Self-Distillation Exploiting Data Representation Invariance 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 257-269
作者:  Xu, Ting-Bing;  Liu, Cheng-Lin
收藏  |  浏览/下载:175/0  |  提交时间:2022/02/16
Training  Nonlinear distortion  Data models  Neural networks  Knowledge engineering  Network architecture  Generalization error  network compression  representation invariance  self-distillation (SD)  
Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 229, 页码: 11
作者:  Liu, Bo;  Dong, Qiulei;  Hu, Zhanyi
Adobe PDF(1224Kb)  |  收藏  |  浏览/下载:318/66  |  提交时间:2021/11/04
Zero-shot learning  Visual-semantic embedding  Out-of-distribution detection  
EAT-NAS: elastic architecture transfer for accelerating large-scale neural architecture search 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2021, 卷号: 64, 期号: 9, 页码: 13
作者:  Fang, Jiemin;  Chen, Yukang;  Zhang, Xinbang;  Zhang, Qian;  Huang, Chang;  Meng, Gaofeng;  Liu, Wenyu;  Wang, Xinggang
Adobe PDF(377Kb)  |  收藏  |  浏览/下载:289/49  |  提交时间:2021/11/02
architecture transfer  neural architecture search  evolutionary algorithm  large-scale dataset  
An Iterative Co-Training Transductive Framework for Zero Shot Learning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 6943-6956
作者:  Liu, Bo;  Hu, Lihua;  Dong, Qiulei;  Hu, Zhanyi
Adobe PDF(2452Kb)  |  收藏  |  浏览/下载:234/52  |  提交时间:2021/11/02
Visualization  Semantics  Training  Feature extraction  Testing  Detectors  Predictive models  Zero-shot learning  transductive learning co-training  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:364/47  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:328/58  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling