CASIA OpenIR

浏览/检索结果: 共80条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Emotion-Aware Music Driven Movie Montage 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 3, 页码: 540-553
作者:  Liu, Wu-Qin;  Lin, Min-Xuan;  Huang, Hai-Bin;  Ma, Chong-Yang;  Song, Yu;  Dong, Wei-Ming;  Xu, Chang-Sheng
收藏  |  浏览/下载:92/0  |  提交时间:2023/12/21
movie montage  emotion analysis  audio-visual modality  contrastive learning  
ArtCap: A Dataset for Image Captioning of Fine Art Paintings 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 页码: 12
作者:  Lu, Yue;  Guo, Chao;  Dai, Xingyuan;  Wang, Fei-Yue
Adobe PDF(5137Kb)  |  收藏  |  浏览/下载:218/42  |  提交时间:2023/02/22
Dataset construction  image captioning  painting captioning  
Anchor-free temporal action localization via Progressive Boundary-aware Boosting 期刊论文
Information Processing & Management, 2022, 卷号: 60, 期号: 1, 页码: 103141
作者:  Tang, Yepeng;  Wang, Weining;  Yang, Yanwu;  Zhang, Chunjie;  Liu, Jing
Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:106/37  |  提交时间:2023/05/03
Temporal action localization  Anchor-free  Video understanding  
Learning Hierarchical Video Graph Networks for One-Stop Video Delivery 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2022, 卷号: 18, 期号: 1, 页码: 1-23
作者:  Song, Yaguang;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(7608Kb)  |  收藏  |  浏览/下载:130/40  |  提交时间:2023/04/25
Cross modal  video retrieval  deep learning  graph neural networks  
Paradigm Shift in Natural Language Processing 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 3, 页码: 169-183
作者:  Tian-Xiang Sun;  Xiang-Yang Liu;  Xi-Peng Qiu;  Xuan-Jing Huang
Adobe PDF(1449Kb)  |  收藏  |  浏览/下载:3/1  |  提交时间:2024/04/23
Face detection  global context  attention mechanism  computer vision  deep learning  
Text Based Video Retrieval among Video Clips 会议论文
, Beijing, China, November 18-21, 2021
作者:  Chi Zhang;  Guixuan Zhang;  Shuwu Zhang
Adobe PDF(316Kb)  |  收藏  |  浏览/下载:175/63  |  提交时间:2022/04/08
3D-SceneCaptioner: Visual Scene Captioning Network for Three-Dimensional Point Clouds 会议论文
, 广东省珠海市, 2021-12
作者:  Yu, Qiang;  Pan, Xianbing;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(3412Kb)  |  收藏  |  浏览/下载:146/25  |  提交时间:2022/01/14
Scene Captioning  Three-Dimensional Vision  Point Cloud  
Multimodal Global Relation Knowledge Distillation for Egocentric Action Anticipation 会议论文
MM '21: Proceedings of the 29th ACM International Conference on Multimedia, Chengdu, China, 2021.10.20—2021.10.24
作者:  Huang Yi;  Yang Xiaoshan;  Xu Changsheng
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:139/56  |  提交时间:2023/06/21
Graph-based Multimodal Ranking Models for Multimodal Summarization 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 卷号: 20, 期号: 4, 页码: 21
作者:  Zhu, Junnan;  Xiang, Lu;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(4193Kb)  |  收藏  |  浏览/下载:258/48  |  提交时间:2021/12/28
Multimodal summarization  single-modal  multimodal ranking  unsupervised  
GraphAIR: Graph representation learning with neighborhood aggregation and interaction 期刊论文
PATTERN RECOGNITION, 2021, 卷号: 112, 页码: 11
作者:  Hu, Fenyu;  Zhu, Yanqiao;  Wu, Shu;  Huang, Weiran;  Wang, Liang;  Tan, Tieniu
Adobe PDF(839Kb)  |  收藏  |  浏览/下载:444/190  |  提交时间:2021/03/29
Graph representation learning  Neighborhood aggregation  Graph neural networks  Neighborhood interaction  Node classification  Link prediction