CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共25条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
BEVBert: Multimodal Map Pre-training for Language-guided Navigation 会议论文
Proceedings of the IEEE International Conference on Computer Vision, Paris, France, 2023-10-2
作者:  Dong An;  Yuankai Qi;  Yangguang Li;  Yan Huang;  Liang Wang;  Tieniu Tan;  Jing Shao
Adobe PDF(1722Kb)  |  收藏  |  浏览/下载:24/6  |  提交时间:2024/05/28
Neighbor-view Enhanced Model for Vision and Language Navigation 会议论文
Proceedings of the ACM International Conference on Multimedia, Chengdu, China, 2021-10-20
作者:  Dong An;  Yuankai Qi;  Yan Huang;  Qi Wu;  Liang Wang;  Tieniu Tan
Adobe PDF(2412Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/05/28
视觉语言导航研究进展 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 1-14
作者:  司马双霖;  黄岩;  何科技;  安东;  袁辉;  王亮
Adobe PDF(6272Kb)  |  收藏  |  浏览/下载:32/9  |  提交时间:2024/05/09
视觉语言导航  视觉语言理解  跨模态匹配  具身智能  
Latent Structure Mining With Contrastive Modality Fusion for Multimedia Recommendation 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 卷号: 35, 期号: 9, 页码: 9154-9167
作者:  Zhang, Jinghao;  Zhu, Yanqiao;  Liu, Qiang;  Zhang, Mengqi;  Wu, Shu;  Wang, Liang
收藏  |  浏览/下载:122/0  |  提交时间:2023/11/17
Multimedia recommendation  graph structure learning  contrastive learning  
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:208/70  |  提交时间:2023/07/06
Identifying Sinus Invasion in Meningioma Patients before Surgery with Deep Learning 会议论文
, 线上, 2022-4
作者:  Qi Qiu;  Kai Sun;  Jing Zhang;  Panpan Liu;  Liang Wang;  Junting Zhang;  Junlin Zhou;  Zhenyu Liu;  Jie Tian
Adobe PDF(277Kb)  |  收藏  |  浏览/下载:177/47  |  提交时间:2023/06/28
Deep learning  Meningioma  Sinus invasion  Multimodal fusion  
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation 会议论文
, 加拿大温和华, 2023-6
作者:  Luo, Zhengxiong;  Chen, Dayou;  Zhang, Yingya;  Huang, Yan;  Wang, Liang;  Shen, Yujun;  Zhao, Deli;  Zhou, Jingren;  Tan, Tieniu
Adobe PDF(6699Kb)  |  收藏  |  浏览/下载:194/48  |  提交时间:2023/06/09
CASIA-E: A Large Comprehensive Dataset for Gait Recognition 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 卷号: 45, 期号: 3, 页码: 2801-2815
作者:  Chunfeng Song;  Yongzhen Huang;  Weining Wang;  Liang Wang
Adobe PDF(2441Kb)  |  收藏  |  浏览/下载:191/40  |  提交时间:2023/05/04
Joint Token and Feature Alignment Framework for Text-Based Person Search 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 卷号: 29, 页码: 2238-2242
作者:  Li, Shangze;  Lu, Andong;  Huang, Yan;  Li, Chenglong;  Wang, Liang
收藏  |  浏览/下载:214/0  |  提交时间:2022/12/27
Feature extraction  Visualization  Representation learning  Logic gates  Image reconstruction  Transformers  Training  Cross-modal generation  feature alignment  text-based person search  token alignment  transformer  
RGBT Tracking by Trident Fusion Network 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 579-592
作者:  Zhu, Yabin;  Li, Chenglong;  Tang, Jin;  Luo, Bin;  Wang, Liang
收藏  |  浏览/下载:225/0  |  提交时间:2022/06/06
Feature extraction  Convolution  Target tracking  Training  Aggregates  Visualization  Benchmark testing  RGBT tracking  feature aggregation  feature pruning  trident architecture