CASIA OpenIR

Browse/Search Results:  1-10 of 726 Help

Filters    
Selected(0)Clear Items/Page:    Sort:
Adaptive Attention Annotation Model: Optimizing the Prediction Path Through Dependency Fusion 期刊论文
KSII Transactions on Internet and Information Systems, 2019, 期号: 9, 页码: 4665-4683
Authors:  Wang Fangxin(王方心);  Liu Jie;  Zhang Shuwu;  Zhang Guixuan;  Zheng Yang;  Li Xiaoqian;  Liang Wei;  Li Yuejun
View  |  Adobe PDF(1061Kb)  |  Favorite  |  View/Download:13/2  |  Submit date:2019/10/08
Image Annotation  Multiple Dependencies  Self-attention  Prediction Path  Triplet Margin Loss  
Blind image quality assessment via learnable attention-based pooling 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 91, 页码: 332-344
Authors:  Gu, Jie;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
View  |  Adobe PDF(3081Kb)  |  Favorite  |  View/Download:108/40  |  Submit date:2019/05/15
Image quality assessment  Perceptual image quality  Visual attention  Convolutional neural network  Learnable pooling  
Improving visual question answering using dropout and enhanced question encoder 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 90, 期号: 1, 页码: 404-414
Authors:  Fang, Zhiwei;  Liu, Jing;  Li, Yong;  Qiao, Yanyuan;  Lu, Hanqing
View  |  Adobe PDF(1624Kb)  |  Favorite  |  View/Download:58/12  |  Submit date:2019/04/23
Visual question answering  Coherent dropout  Siamese dropout  Enhanced question encoder  
Local Semantic-Aware Deep Hashing With Hamming-Isometric Quantization 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 卷号: 28, 期号: 6, 页码: 2665-2679
Authors:  Wang, Yunbo;  Liang, Jian;  Cao, Dong;  Sun, Zhenan
View  |  Adobe PDF(1672Kb)  |  Favorite  |  View/Download:44/9  |  Submit date:2019/04/23
Image retrieval  deep hashing  similarity-preserving  local structures  Hamming-isometric  
The ParallelEye Dataset: A Large Collection of Virtual Images for Traffic Vision Research 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 卷号: 20, 期号: 6, 页码: 2072-2084
Authors:  Li, Xuan;  Wang, Kunfeng;  Tian, Yonglin;  Yan, Lan;  Deng, Fang;  Wang, Fei-Yue
Favorite  |  View/Download:14/0  |  Submit date:2019/07/11
Traffic vision  complex environments  parallel vision  artificial scenes  ParallelEye  virtual dataset  
Inductive Zero-Shot Image Annotation via Embedding Graph 期刊论文
IEEE Access, 2019, 卷号: 7, 期号: 0, 页码: 107816-107830
Authors:  Wang Fangxin(王方心);  Liu Jie;  Zhang Shuwu;  Zhang Guixuan;  Li Yuejun;  Yuan Fei
View  |  Adobe PDF(1472Kb)  |  Favorite  |  View/Download:18/6  |  Submit date:2019/10/08
Contextualized Word Embeddings  Graph Convolutional Network  Image Annotation  Node2vec  Zero-shot  
Real-Time Multi-Scale Face Detector on Embedded Devices 期刊论文
Sensors, 2019, 卷号: 2019, 期号: 9, 页码: 2158
Authors:  Zhao X(赵旭);  Liang XQ(梁孝庆);  Zhao CY(赵朝阳);  Tang M(唐明);  Wang JQ(王金桥)
View  |  Adobe PDF(3135Kb)  |  Favorite  |  View/Download:47/10  |  Submit date:2019/05/16
人脸检测,目标检测,轻量级网络  
A Hierarchical Contextual Attention-based Network for Sequential Recommendation 期刊论文
Neurocomputing, 2019, 期号: no, 页码: no
Authors:  Qiang Cui;  Shu Wu;  Yan Huang;  Liang Wang
View  |  Adobe PDF(545Kb)  |  Favorite  |  View/Download:30/8  |  Submit date:2019/05/09
Sequential Recommendation  Short-term Interest  Context  Attention Mechanism  Recurrent Neural Network  
Joint face alignment and segmentation via deep multi-task learning 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 卷号: 78, 期号: 10, 页码: 13131-13148
Authors:  Zhao, Yucheng;  Tang, Fan;  Dong, Weiming;  Huang, Feiyue;  Zhang, Xiaopeng
View  |  Adobe PDF(3380Kb)  |  Favorite  |  View/Download:187/71  |  Submit date:2018/05/04
Face alignment  Face segmentation  Multi-task learning  Virtual makeup  Face swap  
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
Authors:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
View  |  Adobe PDF(2826Kb)  |  Favorite  |  View/Download:11/0  |  Submit date:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision