CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:280/35  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Learning to Model Relationships for Zero-Shot Video Classification 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 卷号: 43, 期号: 10, 页码: 3476-3491
作者:  Gao, Junyu;  Zhang, Tianzhu;  Xu, Changsheng
收藏  |  浏览/下载:249/0  |  提交时间:2021/11/04
Zero-shot video classification  graph neural networks  zero-shot learning  deep attention model  
Multi-Target Multi-Camera Tracking With Optical-Based Pose Association 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 8, 页码: 3105-3117
作者:  You, Sisi;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:212/0  |  提交时间:2021/11/02
Target tracking  Trajectory  Cameras  Visualization  Feature extraction  Proposals  Object detection  Multi-target multi-camera tracking  pose estimation  optical flow  pose matching  
SiamCPN: Visual tracking with the Siamese center-prediction network 期刊论文
COMPUTATIONAL VISUAL MEDIA, 2021, 卷号: 7, 期号: 2, 页码: 253-265
作者:  Chen, Dong;  Tang, Fan;  Dong, Weiming;  Yao, Hanxing;  Xu, Changsheng
收藏  |  浏览/下载:233/0  |  提交时间:2021/06/15
Siamese network  single object tracking  anchor-free  center point detection  
PEN: Pose-Embedding Network for Pedestrian Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 3, 页码: 1150-1162
作者:  Jiao, Yifan;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:164/0  |  提交时间:2021/05/10
Visualization  Proposals  Detectors  Feature extraction  Object detection  Pose estimation  Fuses  Pedestrian detection  pedestrian recognization network  pose-embedding  pose information  
Emotion Knowledge Driven Video Highlight Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3999-4013
作者:  Qi, Fan;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:202/0  |  提交时间:2021/12/28
Visualization  Training data  Predictive models  Training  Semantics  Emotion recognition  Computational modeling  Deep ranking  knowledge graph  video highlight detection  
Adversarial Multimodal Network for Movie Story Question Answering 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1744-1756
作者:  Yuan, Zhaoquan;  Sun, Siyuan;  Duan, Lixin;  Li, Changsheng;  Wu, Xiao;  Xu, Changsheng
收藏  |  浏览/下载:167/0  |  提交时间:2021/08/15
Knowledge discovery  Motion pictures  Visualization  Task analysis  Generators  Gallium nitride  Natural languages  Movie question answering  adversarial network  multimodal understanding  
Multimodal Disentangled Domain Adaption for Social Media Event Rumor Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 4441-4454
作者:  Zhang, Huaiwen;  Qian, Shengsheng;  Fang, Quan;  Xu, Changsheng
收藏  |  浏览/下载:220/0  |  提交时间:2022/01/27
Social networking (online)  Feature extraction  Task analysis  Adaptation models  Writing  Visualization  Training  Disentanglement representation learning  domain adaptation  event rumor detection  social media  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:312/62  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:321/44  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy