CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Structure Preserving Convolutional Attention for Image Captioning 期刊论文
APPLIED SCIENCES-BASEL, 2019, 卷号: 9, 期号: 14, 页码: 10
作者:  Lu, Shichen;  Hu, Ruimin;  Liu, Jing;  Guo, Longteng;  Zheng, Fei
Adobe PDF(2351Kb)  |  收藏  |  浏览/下载:300/41  |  提交时间:2019/12/16
image captioning  attention  spatial structure  deep learning  computer vision  
Pyrboxes: An efficient multi-scale scene text detector with feature pyramids 期刊论文
PATTERN RECOGNITION LETTERS, 2019, 卷号: 125, 期号: 2019, 页码: 228-234
作者:  Sheng, Fenfen;  Chen, Zhineng;  Zhang, Wei;  Xu, Bo
浏览  |  Adobe PDF(1558Kb)  |  收藏  |  浏览/下载:353/57  |  提交时间:2019/12/16
Scene text detection  Multi-scale text detection  Grouped pyramid module  Efficient and effective  
Boosted Transformer for Image Captioning 期刊论文
APPLIED SCIENCES-BASEL, 2019, 卷号: 9, 期号: 16, 页码: 15
作者:  Li, Jiangyun;  Yao, Peng;  Guo, Longteng;  Zhang, Weicun
Adobe PDF(2184Kb)  |  收藏  |  浏览/下载:334/47  |  提交时间:2019/12/16
image captioning  self-attention  deep learning  transformer  
Inductive Zero-Shot Image Annotation via Embedding Graph 期刊论文
IEEE ACCESS, 2019, 卷号: 7, 页码: 107816-107830
作者:  Wang, Fangxin;  Liu, Jie;  Zhang, Shuwu;  Zhang, Guixuan;  Li, Yuejun;  Yuan, Fei
浏览  |  Adobe PDF(1472Kb)  |  收藏  |  浏览/下载:371/97  |  提交时间:2019/10/08
Contextualized word embeddings  graph convolutional network  image annotation  Node2Vec  zero-shot  
Name-face association with web facial image supervision 期刊论文
MULTIMEDIA SYSTEMS, 2019, 卷号: 25, 期号: 1, 页码: 1-20
作者:  Chen, Zhineng;  Zhang, Wei;  Deng, Bin;  Xie, Hongtao;  Gu, Xiaoyan
浏览  |  Adobe PDF(3705Kb)  |  收藏  |  浏览/下载:324/32  |  提交时间:2019/07/12
Name-face association  Image matching  Multimedia fusion  Web facial images  Weakly supervised  
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
作者:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
浏览  |  Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:430/107  |  提交时间:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision  
Scene text detection and recognition with advances in deep learning: a survey 期刊论文
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 卷号: 22, 期号: 2, 页码: 143-162
作者:  Liu, Xiyan;  Meng, Gaofeng;  Pan, Chunhong
Adobe PDF(2418Kb)  |  收藏  |  浏览/下载:315/37  |  提交时间:2019/07/11
Natural image  Text detection  Text recognition  Survey