CASIA OpenIR

浏览/检索结果: 共152条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Vision Transformers with Hierarchical Attention 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 670-683
作者:  Yun Liu;   Yu-Huan Wu;   Guolei Sun;    Le Zhang;  Ajad Chhatkuli;   Luc Van Gool
Adobe PDF(1358Kb)  |  收藏  |  浏览/下载:16/6  |  提交时间:2024/07/18
Vision transformer  hierarchical attention  global attention  local attention  scene understanding  
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:25/6  |  提交时间:2024/07/08
Image captioning: Semantic selection unit with stacked residual attention 期刊论文
IMAGE AND VISION COMPUTING, 2024, 卷号: 144, 页码: 12
作者:  Song, Lifei;  Li, Fei;  Wang, Ying;  Liu, Yu;  Wang, Yuanhua;  Xiang, Shiming
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/03
Image captioning  Semantic attributes  Semantic selection unit  Transformer  Stacked residual attention  
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:43/17  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 卷号: 32, 期号: 6, 页码: 3880-3894
作者:  Li, Zhenbang;  Shi, Yaya;  Gao, Jin;  Wang, Shaoru;  Li, Bing;  Liang, Pengpeng;  Hu, Weiming
Adobe PDF(4397Kb)  |  收藏  |  浏览/下载:40/22  |  提交时间:2024/06/21
Improving diversity of speech‐driven gesture generation with memory networks as dynamic dictionaries. 期刊论文
CAAI Transactions on Intelligence Technology., 2024, 页码: 1–15
作者:  Zeyu Zhao;  Nan Gao;  Zhi Zeng;  Guixuan Zhang;  Jie Liu;  Shuwu Zhang
Adobe PDF(2067Kb)  |  收藏  |  浏览/下载:53/19  |  提交时间:2024/06/20
The survey on multi-source data fusion in cyber-physical-social systems: Foundational infrastructure for industrial metaverses and industries 5.0 期刊论文
Information Fusion, 2024, 卷号: 107, 页码: 1-16
作者:  Xiao Wang;  Yutong Wang;  Jing Yang;  Xiaofeng Jia;  Lijun Li;  Weiping Ding;  Fei-Yue Wang
Adobe PDF(4446Kb)  |  收藏  |  浏览/下载:46/7  |  提交时间:2024/06/06
Multi-source data fusion  CPSS  Industrial metaverses  Parallel manufacturing  Social manufacturing  
SlowFastFormer for 3D human pose estimation 期刊论文
Computer Vision and Image Understanding, 2024, 卷号: 243, 期号: 243, 页码: 103992
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(989Kb)  |  收藏  |  浏览/下载:51/19  |  提交时间:2024/06/03
SlowFastFormer  Transformer  Blending  3D human pose estimation  Hierarchical supervision  
Dual-Path Transformer for 3D Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 5, 页码: 3260-3270
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(2410Kb)  |  收藏  |  浏览/下载:46/20  |  提交时间:2024/06/03
GPT-4V with Emotion: A Zero-shot Benchmark for Generalized Emotion Recognition 期刊论文
Information Fusion, 2024, 页码: 1-12
作者:  Zheng Lian;  Licai Sun;  Haiyang Sun;  Kang Chen;  Zhuofan Wen;  Hao Gu;  Bin Liu;  Jianhua Tao
Adobe PDF(6888Kb)  |  收藏  |  浏览/下载:63/9  |  提交时间:2024/05/31