CASIA OpenIR

浏览/检索结果: 共50条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Completed Part Transformer for Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 2303-2313
作者:  Zhang, Zhong;  He, Di;  Liu, Shuang;  Xiao, Baihua;  Durrani, Tariq S.
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Person ReID  transformer  adaptive refined tokens  
A Two-Stream Hybrid Convolution-Transformer Network Architecture for Clothing-Change Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 5326-5339
作者:  Wu, Junyi;  Huang, Yan;  Gao, Min;  Gao, Zhipeng;  Zhao, Jianqiang;  Zhang, Huiji;  Zhang, Anguo
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/03
Clothing-change person re-identification  ID-unique feature  feature supplement module  hierarchical supervision  
Exploring Rich Semantics for Open-Set Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 5410-5421
作者:  Hu, Yufan;  Gao, Junyu;  Dong, Jianfeng;  Fan, Bin;  Liu, Hongmin
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Semantics  Prototypes  Knowledge graphs  Visualization  Task analysis  Uncertainty  Training  Open-set action recognition  video action recognition  semantic relation modeling  
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Few-shot  image classification  vision-language models  
Contextualized Relation Predictive Model for Self-Supervised Group Activity Representation Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 353-366
作者:  Zhou, Wanting;  Kong, Longteng;  Han, Yushan;  Qin, Jie;  Sun, Zhenan
收藏  |  浏览/下载:2/0  |  提交时间:2024/07/03
Group activity representation learning  group activity recognition  self-supervised learning  transformer  predictive coding  
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:27/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Binary Similarity Few-Shot Object Detection With Modeling of Hard Negative Samples 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4805-4818
作者:  Lu, Yue;  Chen, Xingyu;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
收藏  |  浏览/下载:33/0  |  提交时间:2024/05/30
Few-shot learning  object detection  computer vision  deep learning  
AnANet: Association and Alignment Network for Modeling Implicit Relevance in Cross-Modal Correlation Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 7867-7880
作者:  Xu, Nan;  Wang, Junyan;  Tian, Yuan;  Zhang, Ruike;  Mao, Wenji
收藏  |  浏览/下载:46/0  |  提交时间:2024/03/26
Association and alignment network  classification scheme  cross-modal correlation  implicit relevance  
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 4830-4841
作者:  Chen, Zhuo;  Yin, Fei;  Yang, Qing;  Liu, Cheng-Lin
收藏  |  浏览/下载:54/0  |  提交时间:2024/02/22
Cross-lingual text image recognition  cross-modal mimic  multihierarchy mimic  
Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 6946-6957
作者:  Fang, Yuchun;  Cai, Sirui;  Cao, Yiting;  Li, Zhengchen;  Zhang, Zhaoxiang
收藏  |  浏览/下载:56/0  |  提交时间:2024/02/22
Index Terms-Multi-task learning  deep learning  task relatedness