CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting 会议论文
, graz, 2019
作者:  Ye Bai;  Jiangyan Yi;  Zhengqi Wen;  Zhengkun Tian;  Chenghao Zhao;  Cunhang Fan
Adobe PDF(290Kb)  |  收藏  |  浏览/下载:188/68  |  提交时间:2021/06/25
Attention-Based Pedestrian Attribute Analysis 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 卷号: 28, 期号: 12, 页码: 6126-6140
作者:  Zichang Tan;  Yang Yang;  Jun Wan;  Hanyuan Hang;  Guodong Guo;  Stan Z. Li
Adobe PDF(3457Kb)  |  收藏  |  浏览/下载:258/50  |  提交时间:2020/10/27
Pedestrian attribute analysis  attention mechanism  pedestrian parsing  
Sequence Generation: From Both Sides to the Middle 会议论文
, Macao, China, August 10-16, 2019
作者:  Zhou, Long;  Zhang, Jiajun;  Zong, Chengqing;  Yu, Heng
浏览  |  Adobe PDF(738Kb)  |  收藏  |  浏览/下载:168/46  |  提交时间:2020/06/23
Image Captioning with Bidirectional Semantic Attention-Based Guiding of Long Short-Term Memory 期刊论文
NEURAL PROCESSING LETTERS, 2019, 卷号: 50, 期号: 1, 页码: 103-119
作者:  Pengfei Cao;  Yang, Zhongyi;  Sun, Liang;  Liang, Yanchun;  Yang, Mary Qu;  Guan, Renchu
Adobe PDF(1965Kb)  |  收藏  |  浏览/下载:358/43  |  提交时间:2019/12/16
Image captioning  Semantic attention mechanism  Convolution neural network  Bidirectional guiding LSTM  
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 9, 页码: 2419-2431
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(2142Kb)  |  收藏  |  浏览/下载:384/49  |  提交时间:2019/12/16
Unsupervised domain adaptation  triplet loss  stacked attention  multi-modality  social event recognition  
Sequence Generation: From Both Sides to the Middle 会议论文
, Macau, China, 2019
作者:  Long Zhou;  Jiajun Zhang;  Chengqing Zong;  Heng Yu
浏览  |  Adobe PDF(738Kb)  |  收藏  |  浏览/下载:200/46  |  提交时间:2019/10/09
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
作者:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
浏览  |  Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:436/107  |  提交时间:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision  
Scene text detection and recognition with advances in deep learning: a survey 期刊论文
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 卷号: 22, 期号: 2, 页码: 143-162
作者:  Liu, Xiyan;  Meng, Gaofeng;  Pan, Chunhong
Adobe PDF(2418Kb)  |  收藏  |  浏览/下载:320/39  |  提交时间:2019/07/11
Natural image  Text detection  Text recognition  Survey  
复杂场景视频表示方法及其应用研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2019
作者:  于廷照
Adobe PDF(25438Kb)  |  收藏  |  浏览/下载:301/15  |  提交时间:2019/06/04
视频表示  时空卷积  注意力机制  低秩分解  无监督学习