CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:263/58  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:201/30  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 5, 页码: 1038-1050
作者:  Zhang, Yifan;  Cao, Congqi;  Cheng, Jian;  Lu, Hanqing
Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:920/355  |  提交时间:2018/05/05
Benchmark  Dataset  Egocentric Vision  Gesture Recognition  First-person View  
Exploiting Visual-Audio-Textual Characteristics for Automatic TV Commercial Block Detection and Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 卷号: 13, 期号: 5, 页码: 961-973
作者:  Liu, Nan;  Zhao, Yao;  Zhu, Zhenfeng;  Lu, Hanqing
收藏  |  浏览/下载:182/0  |  提交时间:2015/08/12
Commercial Detection  Commercial Segmentation  Multi-modal Fusion  Text Detection  Video Analysis  
A multimodal scheme for program segmentation and representation in broadcast video streams 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 卷号: 10, 期号: 3, 页码: 393-408
作者:  Wang, Jinqiao;  Duan, Lingyu;  Liu, Qingshan;  Lu, Hanqing;  Jin, Jesse S.
收藏  |  浏览/下载:150/0  |  提交时间:2015/11/08
Broadcast Video  Latent Semantic Analysis  Multimodal Fusion  Tv Program Segmentation  
A multimodal scheme for program segmentation and representation in broadcast video streams 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 卷号: 10, 期号: 3, 页码: 393-408
作者:  Wang, Jinqiao;  Duan, Lingyu;  Liu, Qingshan;  Lu, Hanqing;  Jin, Jesse S.
浏览  |  Adobe PDF(2319Kb)  |  收藏  |  浏览/下载:300/95  |  提交时间:2015/11/08
Broadcast Video  Latent Semantic Analysis  Multimodal Fusion  Tv Program Segmentation