CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A Framework and Operational Procedures for Metaverses-Based Industrial Foundation Models 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 10
作者:  Wang, Jiangong;  Tian, Yonglin;  Wang, Yutong;  Yang, Jing;  Wang, Xingxia;  Wang, Sanjin;  Kwan, Oliver
Adobe PDF(3322Kb)  |  收藏  |  浏览/下载:126/34  |  提交时间:2023/02/22
Cyber-physical-social intelligence (CPSI)  cyber-physical-social systems (CPSSs)  industrial foundation models (IFMs)  intelligent enterprises  metaverses  operational processes  parallel intelligence  
SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation 期刊论文
MEDICAL IMAGE ANALYSIS, 2022, 卷号: 76, 页码: 102310
作者:  Ni, Zhen-Liang;  Zhou, Xiao-Hu;  Wang, Guan-An;  Yue, Wen-Qian;  Li, Zhen;  Bian, Gui-Bin;  Hou, Zeng-Guang
Adobe PDF(1944Kb)  |  收藏  |  浏览/下载:517/207  |  提交时间:2022/02/16
Surgical Insturment Segmentation  Class-wise Self-Distillation  Pyramid Attention  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:309/62  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Weakly-Supervised Video Object Grounding Via Learning Uni-Modal Associations 期刊论文
IEEE Transactions on Multimedia, 2022, 卷号: 25, 页码: 1-12
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(5406Kb)  |  收藏  |  浏览/下载:84/23  |  提交时间:2023/04/25
Visualization  Grounding  Task analysis  Prototypes  Annotations  Uncertainty  Proposals  Cross-modal retrieval  weakly-supervised learning  video object grounding  uni-modal association  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:267/34  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:307/75  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Adversarial Multimodal Network for Movie Story Question Answering 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1744-1756
作者:  Yuan, Zhaoquan;  Sun, Siyuan;  Duan, Lixin;  Li, Changsheng;  Wu, Xiao;  Xu, Changsheng
收藏  |  浏览/下载:162/0  |  提交时间:2021/08/15
Knowledge discovery  Motion pictures  Visualization  Task analysis  Generators  Gallium nitride  Natural languages  Movie question answering  adversarial network  multimodal understanding  
Learning Aligned Image-Text Representations Using Graph Attentive Relational Network 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 期号: 30, 页码: 1840-1852
作者:  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(4532Kb)  |  收藏  |  浏览/下载:312/50  |  提交时间:2021/03/08
Graph neural networks  Visualization  Semantics  Task analysis  Feature extraction  Annotations  Recurrent neural networks  Image-text matching  cross-modal retrieval  person search  graph neural network  
Extracting Effective Image Attributes with Refined Universal Detection 期刊论文
SENSORS, 2021, 卷号: 21, 期号: 1, 页码: 16
作者:  Yu, Qiang;  Xiao, Xinyu;  Zhang, Chunxia;  Song, Lifei;  Pan, Chunhong
Adobe PDF(2391Kb)  |  收藏  |  浏览/下载:296/53  |  提交时间:2021/03/01
attribute extraction  Refined Universal Detection  word tree  image captioning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:262/58  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions