VQAPT: A New visual question answering model for personality traits in social media images
Biswas, Kunal1; Shivakumara, Palaiahnakote2; Pal, Umapada1; Liu, Cheng-Lin3,4; Lu, Yue5
发表期刊PATTERN RECOGNITION LETTERS
ISSN0167-8655
2023-11-01
卷号175页码:66-73
通讯作者Shivakumara, Palaiahnakote(shiva@um.edu.my)
摘要Visual Question Answering (VQA) for personality trait images on social media is challenging because of multiple emotions and actions with complex backgrounds in social media images. This work aims at developing a new VQA model for different personality traits (VQAPT) identification in a single image. This work considers the Big Five Factors (BFF) for personality traits namely, Openness, Conscientiousness, Extraversion, Agreeableness and Neuroticism. VQA is proposed based on the observation that multiple personality traits can be seen in a single image. We propose a model integrating text recognition and person/face recognition to derive the unique relationship between the text and the person's action in the image. Furthermore, a dynamic text-object graph for personality traits identification is constructed according to the query. For understanding a query, we explore the Contrastive Language-Image Pre-trained (CLIP) transformer encoder in this work. Since it is the first work of its kind, we have created a new dataset under this work for evaluation and the dataset is available publicly as mentioned in Section 4. The effectiveness of the proposed method is also evaluated on two benchmark datasets, namely TextVQA for VQA and PTI for personality traits identification.
关键词Personality trait images Multimodal concept Text recognition Social media images Natural language processing Visual question answering
DOI10.1016/j.patrec.2023.10.016
收录类别SCI
语种英语
资助项目Ministry of Higher Education Malaysia[FRGS/1/2020/ICT02/UM/02/4] ; University Grants Commission (UGC) , India
项目资助者Ministry of Higher Education Malaysia ; University Grants Commission (UGC) , India
WOS研究方向Computer Science
WOS类目Computer Science, Artificial Intelligence
WOS记录号WOS:001102930500001
出版者ELSEVIER
引用统计
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/55235
专题多模态人工智能系统全国重点实验室
通讯作者Shivakumara, Palaiahnakote
作者单位1.Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
2.Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
3.Univ Chinese Acad Sci, Inst Automat, Chinese Acad Sci, Beijing, Peoples R China
4.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
5.East China Normal Univ, Shangahi Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
推荐引用方式
GB/T 7714
Biswas, Kunal,Shivakumara, Palaiahnakote,Pal, Umapada,et al. VQAPT: A New visual question answering model for personality traits in social media images[J]. PATTERN RECOGNITION LETTERS,2023,175:66-73.
APA Biswas, Kunal,Shivakumara, Palaiahnakote,Pal, Umapada,Liu, Cheng-Lin,&Lu, Yue.(2023).VQAPT: A New visual question answering model for personality traits in social media images.PATTERN RECOGNITION LETTERS,175,66-73.
MLA Biswas, Kunal,et al."VQAPT: A New visual question answering model for personality traits in social media images".PATTERN RECOGNITION LETTERS 175(2023):66-73.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Biswas, Kunal]的文章
[Shivakumara, Palaiahnakote]的文章
[Pal, Umapada]的文章
百度学术
百度学术中相似的文章
[Biswas, Kunal]的文章
[Shivakumara, Palaiahnakote]的文章
[Pal, Umapada]的文章
必应学术
必应学术中相似的文章
[Biswas, Kunal]的文章
[Shivakumara, Palaiahnakote]的文章
[Pal, Umapada]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。