CASIA OpenIR

浏览/检索结果: 共57条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:151/41  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:159/46  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2023, 卷号: 19, 期号: 1s, 页码: 1-23
作者:  Song, Yaguang;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(1381Kb)  |  收藏  |  浏览/下载:147/52  |  提交时间:2023/06/12
Food recommendation  recipe calories  heterogeneous graph  selfsupervised learning  
A Framework and Operational Procedures for Metaverses-Based Industrial Foundation Models 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 10
作者:  Wang, Jiangong;  Tian, Yonglin;  Wang, Yutong;  Yang, Jing;  Wang, Xingxia;  Wang, Sanjin;  Kwan, Oliver
Adobe PDF(3322Kb)  |  收藏  |  浏览/下载:121/33  |  提交时间:2023/02/22
Cyber-physical-social intelligence (CPSI)  cyber-physical-social systems (CPSSs)  industrial foundation models (IFMs)  intelligent enterprises  metaverses  operational processes  parallel intelligence  
Region Probability Map-Guided Fast Wide-Area Multiobject Detection 期刊论文
IEEE Transactions on Instrumentation and Measurement, 2022, 页码: 1-12
作者:  Long XL(龙宪磊);  Chen, Mengjuan;  Li, Zhikai;  Gu, Qingyi
Adobe PDF(9818Kb)  |  收藏  |  浏览/下载:218/43  |  提交时间:2022/12/19
High-speed vision  object detection  particle filter  region probability map  wide-area surveillance  
ArtCap: A Dataset for Image Captioning of Fine Art Paintings 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 页码: 12
作者:  Lu, Yue;  Guo, Chao;  Dai, Xingyuan;  Wang, Fei-Yue
Adobe PDF(5137Kb)  |  收藏  |  浏览/下载:209/42  |  提交时间:2023/02/22
Dataset construction  image captioning  painting captioning  
MSMFN: An ultrasound based multi-step modality fusion network for identifying the histologic subtypes of metastatic cervical lymphadenopathy 期刊论文
IEEE Transactions on Medical Imaging, 2022, 页码: 1-13
作者:  Zheling, Meng;  Yangyang, Zhu;  Wenjing, Pang;  Jie, Tian;  Fang, Nie;  Kun, Wang
Adobe PDF(3049Kb)  |  收藏  |  浏览/下载:242/48  |  提交时间:2023/03/27
Anchor-free temporal action localization via Progressive Boundary-aware Boosting 期刊论文
Information Processing & Management, 2022, 卷号: 60, 期号: 1, 页码: 103141
作者:  Tang, Yepeng;  Wang, Weining;  Yang, Yanwu;  Zhang, Chunjie;  Liu, Jing
Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:104/36  |  提交时间:2023/05/03
Temporal action localization  Anchor-free  Video understanding  
SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation 期刊论文
MEDICAL IMAGE ANALYSIS, 2022, 卷号: 76, 页码: 102310
作者:  Ni, Zhen-Liang;  Zhou, Xiao-Hu;  Wang, Guan-An;  Yue, Wen-Qian;  Li, Zhen;  Bian, Gui-Bin;  Hou, Zeng-Guang
Adobe PDF(1944Kb)  |  收藏  |  浏览/下载:490/202  |  提交时间:2022/02/16
Surgical Insturment Segmentation  Class-wise Self-Distillation  Pyramid Attention  
Learning Hierarchical Video Graph Networks for One-Stop Video Delivery 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2022, 卷号: 18, 期号: 1, 页码: 1-23
作者:  Song, Yaguang;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(7608Kb)  |  收藏  |  浏览/下载:120/38  |  提交时间:2023/04/25
Cross modal  video retrieval  deep learning  graph neural networks