CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:70/19  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:104/28  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 301-328
作者:  Yuchuang Tong;  Haotian Liu;  Zhengtao Zhang
Adobe PDF(7587Kb)  |  收藏  |  浏览/下载:176/53  |  提交时间:2024/01/23
Future trends and challenges  humanoid robots  human-robot interaction  key technologies  potential applications  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:98/9  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
ScoreMix: A Scalable Augmentation Strategy for Training GANs With Limited Data 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8920-8935
作者:  Cao, Jie;  Luo, Mandi;  Yu, Junchi;  Yang, Ming-Hsuan;  He, Ran
Adobe PDF(1823Kb)  |  收藏  |  浏览/下载:132/9  |  提交时间:2023/11/17
Generative adversarial networks  image synthesis  data augmentation  few-shot image-to-image translation  
A Framework and Operational Procedures for Metaverses-Based Industrial Foundation Models 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 10
作者:  Wang, Jiangong;  Tian, Yonglin;  Wang, Yutong;  Yang, Jing;  Wang, Xingxia;  Wang, Sanjin;  Kwan, Oliver
Adobe PDF(3322Kb)  |  收藏  |  浏览/下载:181/59  |  提交时间:2023/02/22
Cyber-physical-social intelligence (CPSI)  cyber-physical-social systems (CPSSs)  industrial foundation models (IFMs)  intelligent enterprises  metaverses  operational processes  parallel intelligence  
A Parallel Teacher for Synthetic-to-Real Domain Adaptation of Traffic Object Detection 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 卷号: 7, 期号: 3, 页码: 441-455
作者:  Wang, Jiangong;  Shen, Tianyu;  Tian, Yonglin;  Wang, Yutong;  Gou, Chao;  Wang, Xiao;  Yao, Fei;  Sun, Changyin
Adobe PDF(2602Kb)  |  收藏  |  浏览/下载:327/73  |  提交时间:2022/11/28
Object detection  Feature extraction  Data models  Training  Knowledge engineering  Detectors  Computational modeling  Computer vision  Unsupervised Domain Adaptation  Teacher-student learning  Traffic object detection  
Memory-Modulated Transformer Network for Heterogeneous Face Recognition 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 卷号: 17, 页码: 2095-2109
作者:  Luo, Mandi;  Wu, Haoxue;  Huang, Huaibo;  He, Weizan;  He, Ran
Adobe PDF(4712Kb)  |  收藏  |  浏览/下载:395/135  |  提交时间:2022/07/25
Face recognition  Task analysis  Transformers  Encoding  Feature extraction  Image recognition  Memory modules  Heterogeneous face recognition  style transformer  memory network  
Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1800-1814
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:272/0  |  提交时间:2022/06/10
Noise measurement  Face recognition  Data models  Task analysis  Training data  Training  Annotations  Facial expression recognition  noisy labeled data  clean labels  end-to-end  pose modeling  noise modeling  
VAG: A Uniform Model for Cross-Modal Visual-Audio Mutual Generation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 13
作者:  Hao, Wangli;  Guan, He;  Zhang, Zhaoxiang
Adobe PDF(37909Kb)  |  收藏  |  浏览/下载:245/1  |  提交时间:2022/06/10
Task analysis  Instruments  Visualization  Image reconstruction  Generators  Decoding  Generative adversarial networks  Cross modality  cross-modal generation  mutual generation  visual and audio