CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:33/17  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Molecular Contrastive Pretraining with Collaborative Featurizations 期刊论文
Journal of Chemical Information and Modeling (JCIM), 2024, 卷号: 64, 期号: 4, 页码: 1112–1122
作者:  Yanqiao Zhu;  Dingshuo Chen;  Yuanqi Du;  Yingze Wang;  Qiang Liu;  Shu Wu
Adobe PDF(1868Kb)  |  收藏  |  浏览/下载:28/10  |  提交时间:2024/06/21
Stage-Aware Hierarchical Attentive Relational Network for Diagnosis Prediction 期刊论文
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023, 卷号: 36, 期号: 4, 页码: 1773-1784
作者:  Liping Wang;  Qiang Liu;  Mengqi Zhang;  Yaxuan Hu;  Shu Wu;  Liang Wang
Adobe PDF(2088Kb)  |  收藏  |  浏览/下载:39/14  |  提交时间:2024/06/21
Medical diagnostic imaging  Knowledge graphs  Ontologies  Codes  Data models  Predictive models  Graph neural networks  Diagnosis prediction  electronic health record  knowledge graph  relational graph neural network  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:67/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:33/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
The Journey/DAO/TAO of Embodied Intelligence: From Large Models to Foundation Intelligence and Parallel Intelligence 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1313-1316
作者:  Tianyu Shen;  Jinlin Sun;  Shihan Kong;  Yutong Wang;  Juanjuan Li;  Xuan Li;  Fei-Yue Wang
Adobe PDF(682Kb)  |  收藏  |  浏览/下载:62/24  |  提交时间:2024/05/22
Artificial intelligence  Chatbots  Autonomous systems  Intelligent systems  Robots  Digital humans  Robot kinematics  Learning (artificial intelligence)  Biological system modeling  Computational modeling  Human-robot interaction  Complex systems  Deep learning  Reinforcement learning  Large language models  
SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 1299-1314
作者:  Jin, Qizhao;  Zhang, Xinbang;  Xiao, Xinyu;  Wang, Ying;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(8766Kb)  |  收藏  |  浏览/下载:90/9  |  提交时间:2024/02/21
Data mining  multimodal knowledge discovery  precipitation nowcasting  
Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 3, 页码: 673-689
作者:  Cong Pan;  Junran Peng;  Zhaoxiang Zhang
Adobe PDF(37784Kb)  |  收藏  |  浏览/下载:86/24  |  提交时间:2024/02/19
Monocular 3D object detection  normalizing flows  Swin Transformer  
Description-Enhanced Label Embedding Contrastive Learning for Text Classification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 14
作者:  Zhang, Kun;  Wu, Le;  Lv, Guangyi;  Chen, Enhong;  Ruan, Shulan;  Liu, Jing;  Zhang, Zhiqiang;  Zhou, Jun;  Wang, Meng
收藏  |  浏览/下载:152/0  |  提交时间:2023/11/17
Contrastive learning (CL)  label embedding  representation learning  text classification  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:202/50  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation