CASIA OpenIR

浏览/检索结果: 共1147条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CLIP-Driven hierarchical fusion for referring image segmentation 会议论文
, Kunming, China, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Jing Liu
Adobe PDF(5233Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
CGNN: A Compatibility-aware Graph Neural Network for Social Media Bot Detection 期刊论文
IEEE Transactions on Computational Social System, 2024, 页码: Early Access
作者:  Huang, Haitao;  Tian, Hu;  Zheng, Xiaolong;  Zhang, Xingwei;  Zeng, Dajun;  Wang, Feiyue
Adobe PDF(2267Kb)  |  收藏  |  浏览/下载:18/8  |  提交时间:2024/07/08
graph neural network  heterogeneous compatibility  social media bot detection  
A Semantic and Structural Transformer for Code Summarization Generation 会议论文
, 澳大利亚, 2023.6.8
作者:  Ruyi Ji;  Zhenyu Tong;  Tiejian Luo;  Jing Liu;  Libo Zhang
Adobe PDF(912Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/07/08
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation 会议论文
Proceedings of the 2024 International Conference on Multimedia Retrieval, Phuket, Thailand, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Jing Liu
Adobe PDF(2868Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:  Wang, Wenxuan;  He, Xingjian;  Zhang, Yisi;  Guo, Longteng;  Shen, Jiachen;  Li, Jiangyun;  Liu, Jing
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Referring image segmentation  cross-modality guidance  masked self-distillation  vision and language  
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
Comprehensive Attribute Prediction Learning for Person Search by Language 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1990-2003
作者:  Niu, Kai;  Huang, Linjiang;  Long, Yuzhou;  Huang, Yan;  Wang, Liang;  Zhang, Yanning
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Person search by language  cross-modal retrieval  smart video surveillance  attribute prediction  
Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 1237-1247
作者:  Zhao, Jiahao;  Mao, Wenji;  Zeng, Daniel Dajun
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Adversarial robustness  variation of information  disentangled text representation learning  
An end-to-end model for multi-view scene text recognition 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 149, 页码: 17
作者:  Banerjee, Ayan;  Shivakumara, Palaiahnakote;  Bhattacharya, Saumik;  Pal, Umapada;  Liu, Cheng-Lin
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Text detection  Scene text recognition  Siamese network  Natural language model  Genetic algorithm  Multi-view text detection  
Optimizing Reward Function Weights and Enhancing Control Mechanisms for Bipedal Robots Using LSTM and Attention Mechanisms 会议论文
, 河北保定, 2023-8-16
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(541Kb)  |  收藏  |  浏览/下载:24/9  |  提交时间:2024/07/01