CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
ChatGPT-Based Scenario Engineer: A New Framework on Scenario Generation for Trajectory Prediction 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 卷号: 9, 期号: 3, 页码: 4422-4431
作者:  Li, Xuan;  Liu, Enlu;  Shen, Tianyu;  Huang, Jun;  Wang, Fei-Yue
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Parallel driving  scenarios engineering  foundation model  vehicle operating system  generative pre-trained transformer  trajectory prediction  
SASOD: Saliency-Aware Ship Object Detection in High-Resolution Optical Images 期刊论文
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 卷号: 62, 页码: 15
作者:  Ren, Zhida;  Tang, Yongqiang;  Yang, Yang;  Zhang, Wensheng
Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/07/03
ship detection  Saliency detection  high-resolution optical images  remote sensing  Deep learning  Feature extraction  Marine vehicles  Object detection  Remote sensing  Optical sensors  Optical imaging  saliency detection  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:52/17  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Soccer player tracking and data correction based on attention with full-field videos 期刊论文
VISUAL COMPUTER, 2024, 页码: 13
作者:  Yang, Chao;  Yang, Meng;  Li, Hongyu;  Jiang, Linlu;  Suo, Xiang;  Li, Zhen;  Meng, Weiliang;  Mao, Lijuan
Adobe PDF(8335Kb)  |  收藏  |  浏览/下载:45/13  |  提交时间:2024/05/30
Soccer player tracking  Data correction  Field mapping  
Generalized Feature Learning for Detection of Novel Objects 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 卷号: 16, 期号: 1, 页码: 388-395
作者:  Jierui Liu;  Xilong Liu;  Zhiqiang Cao;  Junzhi Yu;  Min Tan
Adobe PDF(8844Kb)  |  收藏  |  浏览/下载:25/6  |  提交时间:2024/05/28
Object detection  few-shot  generalized features  source aggregation  
Bird's-Eye-View Semantic Segmentation With Two-Stream Compact Depth Transformation and Feature Rectification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4546-4558
作者:  Liu, Jierui;  Cao, Zhiqiang;  Yang, Jing;  Liu, Xilong;  Yang, Yuequan;  Qu, Zhiyou
Adobe PDF(21890Kb)  |  收藏  |  浏览/下载:76/11  |  提交时间:2024/03/27
Bird's-eye-view  semantic segmentation  two-stream compact depth transformation  feature rectification  
CGFormer: ViT-Based Network for Identifying Computer-Generated Images With Token Labeling 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 235-250
作者:  Quan, Weize;  Deng, Pengfei;  Wang, Kai;  Yan, Dong-Ming
Adobe PDF(2517Kb)  |  收藏  |  浏览/下载:74/3  |  提交时间:2024/02/22
CG image forensics  transformer  token labeling  generalization  robustness  
SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 1299-1314
作者:  Jin, Qizhao;  Zhang, Xinbang;  Xiao, Xinyu;  Wang, Ying;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(8766Kb)  |  收藏  |  浏览/下载:77/6  |  提交时间:2024/02/21
Data mining  multimodal knowledge discovery  precipitation nowcasting  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:196/49  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation