CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
收藏  |  浏览/下载:332/0  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:341/66  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1933-1942
作者:  Yao, Hantao;  Min, Shaobo;  Zhang, Yongdong;  Xu, Changsheng
收藏  |  浏览/下载:216/0  |  提交时间:2022/06/10
Semantics  Visualization  Bridges  Training  Knowledge transfer  Image recognition  Topology  Transductive Zero-Shot Learning  Graph Attribute Embedding  Attribute-Induced Bias Eliminating  Semantic-Visual Alignment  
Heterogeneous Hierarchical Feature Aggregation Network for Personalized Micro-Video Recommendation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 805-818
作者:  Cai, Desheng;  Qian, Shengsheng;  Fang, Quan;  Xu, Changsheng
收藏  |  浏览/下载:276/0  |  提交时间:2022/06/06
Graph neural networks  Task analysis  Semantics  Aggregates  Data structures  Collaboration  Visualization  Heterogeneous graph  micro-video recommendation  multi-modal  
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:  Zheng, Aihua;  Hu, Menglan;  Jiang, Bo;  Huang, Yan;  Yan, Yan;  Luo, Bin
收藏  |  浏览/下载:240/0  |  提交时间:2022/03/17
Visualization  Task analysis  Measurement  Speech recognition  Videos  Location awareness  Image recognition  Adversarial learning  audio-visual matching  cross-modal learning  metric learning  
Multimodal Disentangled Domain Adaption for Social Media Event Rumor Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 4441-4454
作者:  Zhang, Huaiwen;  Qian, Shengsheng;  Fang, Quan;  Xu, Changsheng
收藏  |  浏览/下载:225/0  |  提交时间:2022/01/27
Social networking (online)  Feature extraction  Task analysis  Adaptation models  Writing  Visualization  Training  Disentanglement representation learning  domain adaptation  event rumor detection  social media  
Emotion Knowledge Driven Video Highlight Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3999-4013
作者:  Qi, Fan;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:208/0  |  提交时间:2021/12/28
Visualization  Training data  Predictive models  Training  Semantics  Emotion recognition  Computational modeling  Deep ranking  knowledge graph  video highlight detection  
Domain-Oriented Semantic Embedding for Zero-Shot Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3919-3930
作者:  Min, Shaobo;  Yao, Hantao;  Xie, Hongtao;  Zha, Zheng-Jun;  Zhang, Yongdong
收藏  |  浏览/下载:248/0  |  提交时间:2021/12/28
Semantics  Visualization  Image recognition  Image reconstruction  Training  Gallium nitride  Search problems  Zero-shot learning  multi-modality embedding  recognition  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:296/61  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions