CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2024, 页码: 10.1109/TITS.2024.3400227
作者:  Zeyu Gao;  Yao Mu;  Chen Chen;  Jingliang Duan;  Ping Luo;  Yanfeng Lu;  Shengbo Eben Li
Adobe PDF(3954Kb)  |  收藏  |  浏览/下载:38/13  |  提交时间:2024/06/06
End-to-end autonomous driving  deep reinforcement learning  world model  
RTDOD: A large-scale RGB-thermal domain-incremental object detection dataset for UAVs 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 9
作者:  Feng, Hangtao;  Zhang, Lu;  Zhang, Siqi;  Wang, Dong;  Yang, Xu;  Liu, Zhiyong
Adobe PDF(3013Kb)  |  收藏  |  浏览/下载:121/9  |  提交时间:2024/02/22
Domain -incremental object detection  Dataset  RGB-T dataset  Object detection dataset  UAVs dataset  Object detection  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:90/5  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2103Kb)  |  收藏  |  浏览/下载:158/11  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
Cross stage partial connections based weighted Bi-directional feature pyramid and enhanced spatial transformation network for robust object detection 期刊论文
NEUROCOMPUTING, 2022, 卷号: 513, 页码: 70-82
作者:  Lu, Yan-Feng;  Yu, Qian;  Gao, Jing-Wen;  Li, Yi;  Zou, Jun-Cheng;  Qiao, Hong
Adobe PDF(3025Kb)  |  收藏  |  浏览/下载:263/11  |  提交时间:2022/11/14
Robust object detection  Structural deformation  Image detection  Spatial transformation  
Weakly Aligned Feature Fusion for Multimodal Object Detection 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:  Zhang, Lu;  Liu, Zhiyong;  Zhu, Xiangyu;  Song, Zhan;  Yang, Xu;  Lei, Zhen;  Qiao, Hong
Adobe PDF(19222Kb)  |  收藏  |  浏览/下载:246/7  |  提交时间:2022/01/27
Object detection  Feature extraction  Detectors  Robustness  Cameras  Automation  Training  Deep learning  feature fusion  multimodal object detection  pedestrian detection  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:398/63  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:341/73  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training