CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Weakly-Supervised Video Object Grounding via Stable Context Learning 会议论文
, New York, USA, 2021-10-20
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(2062Kb)  |  收藏  |  浏览/下载:48/21  |  提交时间:2023/04/25
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:259/48  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Object Relational Graph with Teacher-Recommended Learning for Video Captioning 会议论文
2020, 线上, 2020.6.14-19
作者:  Zhang,Ziqi;  Shi,Yaya;  Yuan,Chunfeng;  Li,Bing;  Wang,Peijin;  Hu,Weiming;  Zha,Zhengjun
Adobe PDF(1547Kb)  |  收藏  |  浏览/下载:191/69  |  提交时间:2022/06/16
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:332/124  |  提交时间:2022/06/14
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation 会议论文
, New Orleans, Louisiana, 2022-06
作者:  Zongyang Ma;  Guan Luo;  Jin Gao;  Liang L;  Yuxin Chen;  Shaoru Wang;  Congxuan Zhang;  Weiming Hu
Adobe PDF(1668Kb)  |  收藏  |  浏览/下载:257/64  |  提交时间:2022/04/06
Graph-based Multimodal Ranking Models for Multimodal Summarization 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 卷号: 20, 期号: 4, 页码: 21
作者:  Zhu, Junnan;  Xiang, Lu;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(4193Kb)  |  收藏  |  浏览/下载:284/52  |  提交时间:2021/12/28
Multimodal summarization  single-modal  multimodal ranking  unsupervised  
图像生成对抗模型与应用研究 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2021
作者:  张晨阳
Adobe PDF(8500Kb)  |  收藏  |  浏览/下载:187/7  |  提交时间:2021/06/23
生成对抗网络  深度学习  图像序列生成  领域自适应  行人重识别  
A Co-Memory Network for Multimodal Sentiment Analysis 会议论文
, Ann Arbor, MI, USA, July 8-12, 2018
作者:  Xu, Nan;  Mao, Wenji;  Chen, Guandan
浏览  |  Adobe PDF(1334Kb)  |  收藏  |  浏览/下载:270/116  |  提交时间:2020/06/10
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
作者:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
浏览  |  Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:397/99  |  提交时间:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision  
Scene text detection and recognition with advances in deep learning: a survey 期刊论文
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 卷号: 22, 期号: 2, 页码: 143-162
作者:  Liu, Xiyan;  Meng, Gaofeng;  Pan, Chunhong
Adobe PDF(2418Kb)  |  收藏  |  浏览/下载:291/32  |  提交时间:2019/07/11
Natural image  Text detection  Text recognition  Survey