CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:  Wang, Wenxuan;  He, Xingjian;  Zhang, Yisi;  Guo, Longteng;  Shen, Jiachen;  Li, Jiangyun;  Liu, Jing
收藏  |  浏览/下载:2/0  |  提交时间:2024/07/03
Referring image segmentation  cross-modality guidance  masked self-distillation  vision and language  
Non-Maximum Suppression Guided Label Assignment for Object Detection in Crowd Scenes 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 2207-2218
作者:  Jiang, Hangzhi;  Zhang, Xin;  Xiang, Shiming
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/03
Object detection  Crowd scenes  Label assignment  Non-maximum suppression  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:165/22  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:401/1  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1800-1814
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:264/0  |  提交时间:2022/06/10
Noise measurement  Face recognition  Data models  Task analysis  Training data  Training  Annotations  Facial expression recognition  noisy labeled data  clean labels  end-to-end  pose modeling  noise modeling  
Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1933-1942
作者:  Yao, Hantao;  Min, Shaobo;  Zhang, Yongdong;  Xu, Changsheng
收藏  |  浏览/下载:232/0  |  提交时间:2022/06/10
Semantics  Visualization  Bridges  Training  Knowledge transfer  Image recognition  Topology  Transductive Zero-Shot Learning  Graph Attribute Embedding  Attribute-Induced Bias Eliminating  Semantic-Visual Alignment  
Capturing Relevant Context for Visual Tracking 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 4232-4244
作者:  Zhang, Yuping;  Ma, Bo;  Wu, Jiahao;  Huang, Lianghua;  Shen, Jianbing
收藏  |  浏览/下载:144/0  |  提交时间:2021/12/28
Local neighborhood graph  long-range dependencies  long-term tracking  visual object tracking  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:342/67  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Bidirectional Attention-Recognition Model for Fine-Grained Object Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 7, 页码: 1785-1795
作者:  Liu, Chuanbin;  Xie, Hongtao;  Zha, Zhengjun;  Yu, Lingyun;  Chen, Zhineng;  Zhang, Yongdong
收藏  |  浏览/下载:207/0  |  提交时间:2020/08/03
Fine-grained object classification  interpretable machine learning  visual attention  pattern recognition  data augmentation  
Cross-Modality Bridging and Knowledge Transferring for Image Understanding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2675-2685
作者:  Yan, Chenggang;  Li, Liang;  Zhang, Chunjie;  Liu, Bingtao;  Zhang, Yongdong;  Dai, Qionghai
收藏  |  浏览/下载:299/0  |  提交时间:2019/12/16
Object and scene recognition  image semantic search  cross-modality bridging  multi-task learning  knowledge transferring