CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:92/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:139/26  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Decentralized Autonomous Operations and Organizations in TransVerse: Federated Intelligence for Smart Mobility 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 卷号: 53, 期号: 4, 页码: 2062-2072
作者:  Zhao, Chen;  Dai, Xingyuan;  Lv, Yisheng;  Niu, Jinglong;  Lin, Yilun
Adobe PDF(1921Kb)  |  收藏  |  浏览/下载:218/2  |  提交时间:2023/02/22
Intelligent Transportation Systems (ITS)  Artificial Systems, Computational Experiments, Parallel Execution (ACP)  Cyber–Physical–Social Systems (CPSS)  
Multi-View Multi-Label Fine-Grained Emotion Decoding From Human Brain Activity 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:288/68  |  提交时间:2022/12/27
Fine-grained Emotion Decoding  Multi-view Learning  Multi-label Learning  Variational Autoencoder  Product of Experts  
Towards open-set text recognition via label-to-prototype learning 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 134, 页码: 13
作者:  Liu, Chang;  Yang, Chun;  Qin, Hai-Bo;  Zhu, Xiaobin;  Liu, Cheng-Lin;  Yin, Xu-Cheng
收藏  |  浏览/下载:238/0  |  提交时间:2022/12/27
Open-set recognition  Scene text recognition  Low-shot recognition  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:332/45  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
MEAD: a Mask-guidEd Anchor-free Detector for oriented aerial object detection 期刊论文
Applied Intelligence, 2021, 期号: 0, 页码: 0
作者:  Zewen He;  Zhida Ren;  Xuebing Yang;  Yang Yang;  Wensheng Zhang
Adobe PDF(4447Kb)  |  收藏  |  浏览/下载:236/51  |  提交时间:2021/06/28
Oriented aerial object detection  Anchor-free detector  Mask-guided mechanism  Cascade structure  
End -to -end video text detection with online tracking 期刊论文
PATTERN RECOGNITION, 2021, 卷号: 113, 页码: 12
作者:  Yu, Hongyuan;  Huang, Yan;  Pi, Lihong;  Zhang, Chengquan;  Li, Xuan;  Wang, Liang
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:328/61  |  提交时间:2021/05/06
End-to-end  Video text detection  Online tracking  
Robot learning through observation via coarse-to-fine grained video summarization 期刊论文
APPLIED SOFT COMPUTING, 2021, 卷号: 99, 期号: /, 页码: 106913
作者:  Zhang, Yujia;  Li, Qianzhong;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(5989Kb)  |  收藏  |  浏览/下载:358/74  |  提交时间:2021/03/08
Robotic vision  Learning through observation  Coarse-to-fine video summarization  
Part-based Structured Representation Learning for Person Re-identification 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 22
作者:  Li, Yaoyu;  Yao, Hantao;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(19052Kb)  |  收藏  |  浏览/下载:293/42  |  提交时间:2021/03/08
Person re-identification  representation learning  graph convolutional network