CASIA OpenIR

浏览/检索结果: 共45条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Few-shot  image classification  vision-language models  
An end-to-end model for multi-view scene text recognition 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 149, 页码: 17
作者:  Banerjee, Ayan;  Shivakumara, Palaiahnakote;  Bhattacharya, Saumik;  Pal, Umapada;  Liu, Cheng-Lin
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Text detection  Scene text recognition  Siamese network  Natural language model  Genetic algorithm  Multi-view text detection  
Prompting Large Language Models for Automatic Question Tagging 期刊论文
Machine Intelligence Research, 2024, 页码: 0
作者:  Nuojia Xu;  Dizhan Xue;  Shengsheng Qian;  Quan Fang;  Jun Hu
Adobe PDF(1493Kb)  |  收藏  |  浏览/下载:38/17  |  提交时间:2024/06/04
Community Question Answering  Machine Learning  Large Language Model  Prompt Learning  Question Tagging  
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:28/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Towards a unified framework for imperceptible textual attacks 期刊论文
APPLIED INTELLIGENCE, 2024, 页码: 14
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel
收藏  |  浏览/下载:74/0  |  提交时间:2024/03/26
Adversarial attack  Backdoor attack  Natural language processing  Adversarial machine learning  
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:66/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:106/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
VistaGPT: Generative Parallel Transformers for Vehicles With Intelligent Systems for Transport Automation 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 9, 页码: 4198-4207
作者:  Tian, Yonglin;  Li, Xuan;  Zhang, Hui;  Zhao, Chen;  Li, Bai;  Wang, Xiao;  Wang, Xiao;  Wang, Fei-Yue
收藏  |  浏览/下载:183/0  |  提交时间:2023/12/21
Transformers  Task analysis  Autonomous vehicles  Planning  Biological system modeling  Navigation  Automation  Generative parallel transformers  end-to-end driving  transport automation  large-language models  federation of vehicular transformers  scenario engineering  
Parallel Learning for Legal Intelligence: A HANOI Approach Based on Unified Prompting 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 页码: 11
作者:  Song, Zhuoyang;  Huang, Min;  Miao, Qinghai;  Wang, Fei-Yue
收藏  |  浏览/下载:108/0  |  提交时间:2023/11/17
Index Ternis- Natural language processing (NLP)  parallel learning (PL)  parallel systems  pretrained language model (PLM)  prompt tuning