CASIA OpenIR

浏览/检索结果: 共88条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Segment Anything Is Not Always Perfect: An Investigationof SAM on Different Real-world Applications 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 617-630
作者:  Wei Ji;   Jingjing Li;   Qi Bi;   Tingwei Liu;  Wenbo Li;   Li Cheng
Adobe PDF(11623Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/07/18
Segment anything model (SAM)  visual perception  segmentation  foundational model  computer vision  
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/07/08
多尺度视觉语义增强的多模态命名实体识别方法 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1234-1245
作者:  王海荣;  徐玺;  王彤;  陈芳萍
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:37/16  |  提交时间:2024/07/02
多模态命名实体识别  多任务学习  多模态融合  Transformer  
基于多模态表征学习与融合的情感识别研究 学位论文
, 2024
作者:  孙立才
Adobe PDF(5844Kb)  |  收藏  |  浏览/下载:42/4  |  提交时间:2024/06/27
情感识别  表征学习  自监督学习  多模态融合  注意力机制  
基于脉冲神经网络的多模态视听分类 学位论文
, 2024
作者:  郭凌月
Adobe PDF(3051Kb)  |  收藏  |  浏览/下载:30/0  |  提交时间:2024/06/27
脉冲神经网络  多模态对齐  多模态融合  视听分类  
Pro-tuning: Unified Prompt Tuning for Vision Tasks 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 34, 期号: 6, 页码: 4653 - 4667
作者:  Xing Nie;  Bolin Ni;  Jianlong Chang;  Gaofeng Meng;  Chunlei Huo;  Shiming Xiang;  Qi Tian
Adobe PDF(2224Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/06/21
SocialVis: Dynamic Social Visualization in Dense Scenes via Real-time Multi-Object Tracking and Proximity Graph Construction 期刊论文
Computer Animation and Virtual Worlds, 2024, 卷号: 35, 期号: 3, 页码: 1-15
作者:  Li BW(李博文);  Li W(李巍);  Wang JQ(王镜淇);  Meng WL(孟维亮);  Zhang JG(张吉光);  Zhang XP(张晓鹏)
Adobe PDF(2914Kb)  |  收藏  |  浏览/下载:45/7  |  提交时间:2024/06/04
dense pedestrian  detection  multi-object tracking  proximity graph  visualization  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:67/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
GPT-4V with Emotion: A Zero-shot Benchmark for Generalized Emotion Recognition 期刊论文
Information Fusion, 2024, 页码: 1-12
作者:  Zheng Lian;  Licai Sun;  Haiyang Sun;  Kang Chen;  Zhuofan Wen;  Hao Gu;  Bin Liu;  Jianhua Tao
Adobe PDF(6888Kb)  |  收藏  |  浏览/下载:66/10  |  提交时间:2024/05/31
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:48/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer