CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Few-shot  image classification  vision-language models  
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:27/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Soccer player tracking and data correction based on attention with full-field videos 期刊论文
VISUAL COMPUTER, 2024, 页码: 13
作者:  Yang, Chao;  Yang, Meng;  Li, Hongyu;  Jiang, Linlu;  Suo, Xiang;  Li, Zhen;  Meng, Weiliang;  Mao, Lijuan
Adobe PDF(8335Kb)  |  收藏  |  浏览/下载:46/14  |  提交时间:2024/05/30
Soccer player tracking  Data correction  Field mapping  
Generalized Feature Learning for Detection of Novel Objects 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 卷号: 16, 期号: 1, 页码: 388-395
作者:  Jierui Liu;  Xilong Liu;  Zhiqiang Cao;  Junzhi Yu;  Min Tan
Adobe PDF(8844Kb)  |  收藏  |  浏览/下载:25/6  |  提交时间:2024/05/28
Object detection  few-shot  generalized features  source aggregation  
Bird's-Eye-View Semantic Segmentation With Two-Stream Compact Depth Transformation and Feature Rectification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4546-4558
作者:  Liu, Jierui;  Cao, Zhiqiang;  Yang, Jing;  Liu, Xilong;  Yang, Yuequan;  Qu, Zhiyou
Adobe PDF(21890Kb)  |  收藏  |  浏览/下载:76/11  |  提交时间:2024/03/27
Bird's-eye-view  semantic segmentation  two-stream compact depth transformation  feature rectification  
DomainFeat: Learning Local Features With Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 46-59
作者:  Xu, Rongtao;  Wang, Changwei;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(6039Kb)  |  收藏  |  浏览/下载:85/11  |  提交时间:2024/03/26
Feature extraction  Location awareness  Visualization  Robustness  Image matching  Detectors  Decoding  Local features  domain adaptation  cross-domain data  consistency loss  
CGFormer: ViT-Based Network for Identifying Computer-Generated Images With Token Labeling 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 235-250
作者:  Quan, Weize;  Deng, Pengfei;  Wang, Kai;  Yan, Dong-Ming
Adobe PDF(2517Kb)  |  收藏  |  浏览/下载:74/3  |  提交时间:2024/02/22
CG image forensics  transformer  token labeling  generalization  robustness  
SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 1299-1314
作者:  Jin, Qizhao;  Zhang, Xinbang;  Xiao, Xinyu;  Wang, Ying;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(8766Kb)  |  收藏  |  浏览/下载:78/6  |  提交时间:2024/02/21
Data mining  multimodal knowledge discovery  precipitation nowcasting  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:196/49  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation 期刊论文
IEEE Transactions on Multimedia, 2024, 卷号: 26, 页码: 581-592
作者:  Xu RT(许镕涛);  Wang CW(王常维);  Xu SB(徐士彪);  Meng WL(孟维亮);  Zhang XP(张晓鹏)
Adobe PDF(6463Kb)  |  收藏  |  浏览/下载:518/78  |  提交时间:2023/05/04
Class activation map  representation fusion  wave function  weakly supervised semantic segmentation