CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection (Aug, 10.1007/s11263-023-01855-1, 2023) 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 131, 页码: 3170–3192
作者:  Zhang, Libo;  Jiang, Lutao;  Ji, Ruyi;  Fan, Heng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:40/0  |  提交时间:2023/11/17
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:130/3  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:216/33  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Food det: Detecting foods in refrigerator with supervised transformer network 期刊论文
NEUROCOMPUTING, 2020, 卷号: 379, 期号: 28, 页码: 162-171
作者:  Zhu, Yousong;  Zhao, Xu;  Zhao, Chaoyang;  Wang, Jinqiao;  Lu, Hanqing
Adobe PDF(2790Kb)  |  收藏  |  浏览/下载:489/76  |  提交时间:2020/03/30
Food detection  Spatial transformer  Object detection  
Improving visual question answering using dropout and enhanced question encoder 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 90, 期号: 1, 页码: 404-414
作者:  Fang, Zhiwei;  Liu, Jing;  Li, Yong;  Qiao, Yanyuan;  Lu, Hanqing
浏览  |  Adobe PDF(1624Kb)  |  收藏  |  浏览/下载:482/128  |  提交时间:2019/04/23
Visual question answering  Coherent dropout  Siamese dropout  Enhanced question encoder  
EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 5, 页码: 1038-1050
作者:  Zhang, Yifan;  Cao, Congqi;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:945/362  |  提交时间:2018/05/05
Benchmark  Dataset  Egocentric Vision  Gesture Recognition  First-person View  
Automatic group activity annotation for mobile videos 期刊论文
MULTIMEDIA SYSTEMS, 2017, 卷号: 23, 期号: 6, 页码: 667-677
作者:  Zhao, Chaoyang;  Wang, Jinqiao;  Li, Jianqiang;  Lu, Hanqing
Adobe PDF(1533Kb)  |  收藏  |  浏览/下载:430/91  |  提交时间:2018/01/07
Activity Annotation  Group Activity  Context Learning  
Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 7, 页码: 1550-1559
作者:  Zhang, Chunjie;  Liang, Chao;  Li, Liang;  Liu, Jing;  Huang, Qingming;  Tian, Qi
浏览  |  Adobe PDF(1901Kb)  |  收藏  |  浏览/下载:558/225  |  提交时间:2017/09/12
Fine Grained  Image Representation  Semantic Space  Visual Recognition  
A Coupled Hidden Conditional Random Field Model for Simultaneous Face Clustering and Naming in Videos 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 卷号: 25, 期号: 12, 页码: 5780-5792
作者:  Zhang, Yifan;  Tang, Zhiqiang;  Wu, Baoyuan;  Ji, Qiang;  Lu, Hanqing
Adobe PDF(3012Kb)  |  收藏  |  浏览/下载:428/134  |  提交时间:2017/02/14
Face Clustering  Face Naming  Conditional Random Field  
ActiveAd: A novel framework of linking ad videos to online products 期刊论文
NEUROCOMPUTING, 2016, 期号: 185, 页码: 82-92
作者:  Wang, Jinqiao;  Xu, Min;  Lu, Hanqing;  Burnett, Ian
浏览  |  Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:414/143  |  提交时间:2016/10/20
Ad Video Analysis  Visual Search  Tag Aggregation  Textual Search