CASIA OpenIR

浏览/检索结果: 共65条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:93/8  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:51/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
Progressive Pretraining Network for 3D System Matrix Calibration in Magnetic Particle Imaging 期刊论文
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 卷号: 42, 期号: 12, 页码: 3639-3650
作者:  Shi, GenY;  Yin, Lin;  An, Yu;  Li, Guanghui;  Zhang, Liwen;  Bian, Zhongwei;  Chen, Ziwei;  Zhang, Haoran;  Hui, Hui;  Tian, Jie
收藏  |  浏览/下载:26/0  |  提交时间:2024/02/22
Magnetic particle imaging  system matrix  multimodal data  pretraining strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:32/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
Two Birds With One Stone: Knowledge-Embedded Temporal Convolutional Transformer for Depression Detection and Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 4, 页码: 2595-2613
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:17/0  |  提交时间:2024/03/27
Multimodal depression detection  multimodal emotion recognition  transformer  knowledge embedding  joint learning  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:88/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
An Accurate Outlier Rejection Network With Higher Generalization Ability for Point Cloud Registration 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 8, 页码: 4649-4656
作者:  Guo, Shiyi;  Tang, Fulin;  Liu, Bingxi;  Fu, Yujie;  Wu, Yihong
Adobe PDF(1675Kb)  |  收藏  |  浏览/下载:170/45  |  提交时间:2023/11/17
Point cloud registration  Three-dimensional displays  Feature extraction  Correlation  Learning systems  Task analysis  Robustness  3D feature  outlier rejection  
A Coarse-to-Fine Feature Match Network Using Transformers for Remote Sensing Image Registration 期刊论文
Remote Sensing, 2023, 页码: 3243
作者:  Liang Chenbin;  Dong Yunyun;  Changjun Zhao;  Zengguo Sun
Adobe PDF(44590Kb)  |  收藏  |  浏览/下载:134/36  |  提交时间:2023/06/26
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:184/56  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation