CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Dual Instance-Consistent Network for Cross-Domain Object Detection 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 7338-7352
作者:  Jiao, Yifan;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:52/0  |  提交时间:2023/11/17
Feature extraction  Object detection  Detectors  Visualization  Proposals  Head  Task analysis  Cross-domain object detection  domain-specific description  dual instance-consistent network  
Learning Semantic-Aware Spatial-Temporal Attention for Interpretable Action Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 8, 页码: 5213-5224
作者:  Fu, Jie;  Gao, Junyu;  Xu, Changsheng
收藏  |  浏览/下载:360/0  |  提交时间:2022/09/19
Visualization  Semantics  Task analysis  Three-dimensional displays  Feature extraction  Solid modeling  Predictive models  Semantic-aware  spatial-temporal attention  interpretable  action recognition  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:404/2  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1800-1814
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:264/0  |  提交时间:2022/06/10
Noise measurement  Face recognition  Data models  Task analysis  Training data  Training  Annotations  Facial expression recognition  noisy labeled data  clean labels  end-to-end  pose modeling  noise modeling  
Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1933-1942
作者:  Yao, Hantao;  Min, Shaobo;  Zhang, Yongdong;  Xu, Changsheng
收藏  |  浏览/下载:233/0  |  提交时间:2022/06/10
Semantics  Visualization  Bridges  Training  Knowledge transfer  Image recognition  Topology  Transductive Zero-Shot Learning  Graph Attribute Embedding  Attribute-Induced Bias Eliminating  Semantic-Visual Alignment  
Tell, Imagine, and Search: End-to-end Learning for Composing Text and Image to Image Retrieval 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 卷号: 18, 期号: 2, 页码: 23
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:230/0  |  提交时间:2022/06/10
Composing text and image to image retrieval  end-to-end  image generation  generative adversarial network  global-local  
Learning Video Moment Retrieval Without a Single Annotated Video 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1646-1657
作者:  Gao, Junyu;  Xu, Changsheng
收藏  |  浏览/下载:232/0  |  提交时间:2022/06/06
Visualization  Task analysis  Generators  Training  Graph neural networks  Semantics  Detectors  Video moment retrieval  graph neural network  unpaired learning  
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1681-1695
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(4827Kb)  |  收藏  |  浏览/下载:262/1  |  提交时间:2022/06/06
Face recognition  Task analysis  Generative adversarial networks  Image synthesis  Image recognition  Faces  Training  Facial expression recognition  facial image synthesis  generative adversarial network  representation learning  
Geometry Sensitive Cross-Modal Reasoning for Composed Query Based Image Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 1000-1011
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:263/0  |  提交时间:2022/02/16
Visualization  Image retrieval  Semantics  Cognition  Geometry  Task analysis  Electronic mail  Composed query based image retrieval  semantic gap  spatial structure  inter-modal attention  text-guided visual reasoning