CASIA OpenIR

浏览/检索结果: 共655条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CLIP-Driven hierarchical fusion for referring image segmentation 会议论文
, Kunming, China, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Jing Liu
Adobe PDF(5233Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
CGNN: A Compatibility-aware Graph Neural Network for Social Media Bot Detection 期刊论文
IEEE Transactions on Computational Social System, 2024, 页码: Early Access
作者:  Huang, Haitao;  Tian, Hu;  Zheng, Xiaolong;  Zhang, Xingwei;  Zeng, Dajun;  Wang, Feiyue
Adobe PDF(2267Kb)  |  收藏  |  浏览/下载:18/8  |  提交时间:2024/07/08
graph neural network  heterogeneous compatibility  social media bot detection  
A Semantic and Structural Transformer for Code Summarization Generation 会议论文
, 澳大利亚, 2023.6.8
作者:  Ruyi Ji;  Zhenyu Tong;  Tiejian Luo;  Jing Liu;  Libo Zhang
Adobe PDF(912Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/07/08
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation 会议论文
Proceedings of the 2024 International Conference on Multimedia Retrieval, Phuket, Thailand, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Jing Liu
Adobe PDF(2868Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Attribute-Guided Cross-Modal Interaction and Enhancement for Audio-Visual Matching 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 4986-4998
作者:  Wang, Jiaxiang;  Zheng, Aihua;  Yan, Yan;  He, Ran;  Tang, Jin
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Audio-visual cross-modal matching  attribute-guided cross-modal interaction  attribute-guided cross-modal enhancement  
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
Comment-Context Dual Collaborative Masked Transformer Network for Fake News Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 5170-5180
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Fake news detection  multi-modal learning  social media  
MapGuide: A Simple yet Effective Method to Reconstruct Continuous Language from Brain Activities 会议论文
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Mexico City, Mexico, 2024-6
作者:  Xinpei, Zhao;  Jingyuan, Sun;  Shaonan, Wang;  Jing, Ye;  Xiaohan, Zhang;  Chengqing, Zong
Adobe PDF(843Kb)  |  收藏  |  浏览/下载:25/7  |  提交时间:2024/06/27
neural decoding  
A Multimodal Neural Network for Contact State Recognition during Probe Implantation into Skull Holes 会议论文
, 新西兰, 2023-8
作者:  Song YJ(宋雨佳);  Wang XF(王啸峰);  Zhang DP(张大朋)
Adobe PDF(1573Kb)  |  收藏  |  浏览/下载:23/10  |  提交时间:2024/06/26
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation