CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Global Instance Tracking: Locating Target More Like Humans 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 1, 页码: 576-592
作者:  Hu, Shiyu;  Zhao, Xin;  Huang, Lianghua;  Huang, Kaiqi
Adobe PDF(15055Kb)  |  收藏  |  浏览/下载:214/54  |  提交时间:2023/02/22
Global instance tracking  single object tracking  benchmark dataset  performance evaluation  human tracking ability  
Cross-Modality Synergy Network for Referring Expression Comprehension and Segmentation 期刊论文
Neurocomputing, 2022, 卷号: 467, 期号: /, 页码: 99-114
作者:  Li, Qianzhong;  Zhang, Yujia;  Sun, Shiying;  Wu, Jinting;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(4555Kb)  |  收藏  |  浏览/下载:301/44  |  提交时间:2021/12/28
Referring expression comprehension  Referring expression segmentation  Cross-modality synergy  Attention mechanism  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:264/34  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Are You Confident That You Have Successfully Generated Adversarial Examples? 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 6, 页码: 2089-2099
作者:  Wang, Bo;  Zhao, Mengnan;  Wang, Wei;  Wei, Fei;  Qin, Zhan;  Ren, Kui
Adobe PDF(2235Kb)  |  收藏  |  浏览/下载:309/33  |  提交时间:2021/08/15
Perturbation methods  Iterative methods  Computational modeling  Neural networks  Security  Training  Robustness  Deep neural networks  adversarial examples  structural black box  buffer  
FA-GAN: Face Augmentation GAN for Deformation-Invariant Face Recognition 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 卷号: 16, 期号: 0, 页码: 2341-2355
作者:  Luo, Mandi;  Cao, Jie;  Ma, Xin;  Zhang, Xiaoyu;  He, Ran
Adobe PDF(4742Kb)  |  收藏  |  浏览/下载:318/54  |  提交时间:2021/04/21
Face recognition  Strain  Geometry  Frequency division multiplexing  Training  Task analysis  Semantics  Face augmentation  deformation-invariant face recognition  face disentanglement  graph convolutional networks  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:292/60  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:307/42  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Trip Purposes Mining From Mobile Signaling Data 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 卷号: 99, 期号: 99, 页码: 13
作者:  Li, Zhishuai;  Xiong, Gang;  Wei, Zebing;  Zhang, Yu;  Zheng, Meng;  Liu, Xiaoli;  Tarkoma, Sasu;  Huang, Min;  Lv, Yisheng;  Wu, Chuheng
Adobe PDF(3962Kb)  |  收藏  |  浏览/下载:346/70  |  提交时间:2022/01/27
Cellular networks  Trajectory  Semantics  Unsupervised learning  Supervised learning  Resource management  Public transportation  Trip purpose inference  cellular network data  latent Dirichlet allocation  travel behavior  big data  
Knowledge-driven Egocentric Multimodal Activity Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 21
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyu;  Sang, Jitao;  Xu, Changsheng
Adobe PDF(1875Kb)  |  收藏  |  浏览/下载:336/46  |  提交时间:2021/03/08
Egocentric videos  wearable sensors  graph neural networks  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:201/30  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination