CASIA OpenIR

浏览/检索结果: 共139条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:96/9  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:53/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
CGFormer: ViT-Based Network for Identifying Computer-Generated Images With Token Labeling 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 235-250
作者:  Quan, Weize;  Deng, Pengfei;  Wang, Kai;  Yan, Dong-Ming
收藏  |  浏览/下载:28/0  |  提交时间:2024/02/22
CG image forensics  transformer  token labeling  generalization  robustness  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:34/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Artificial intelligence for automatic surgical phase recognition of laparoscopic gastrectomy in gastric cancer 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 页码: 9
作者:  Zhai, Yuhao;  Chen, Zhen;  Zheng, Zhi;  Wang, Xi;  Yan, Xiaosheng;  Liu, Xiaoye;  Yin, Jie;  Wang, Jinqiao;  Zhang, Jun
收藏  |  浏览/下载:73/0  |  提交时间:2023/12/21
Artificial intelligence  Gastric cancer  Surgical phase  
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:32/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
SOTVerse: A User-Defined Task Space of Single Object Tracking 期刊论文
International Journal of Computer Vision, 2023, 页码: 1-59
作者:  Shiyu, Hu;  Xin, Zhao;  Kaiqi Huang
Adobe PDF(53048Kb)  |  收藏  |  浏览/下载:40/4  |  提交时间:2024/01/22
Single object tracking  Experimental environment  Evaluation system  Performance analysis  
Motion Forecasting Network (MoFCNet): IMU-Based Human Motion Forecasting for Hip Assistive Exoskeleton 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 9, 页码: 5783-5790
作者:  Zhang, Xingxuan;  Zhang, Haojian;  Hu, Jianhua;  Deng, Jieren;  Wang, Yunkuan
Adobe PDF(1895Kb)  |  收藏  |  浏览/下载:156/69  |  提交时间:2023/11/17
Intention recognition  prosthetics and exoskeletons  human motion forecasting  
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
收藏  |  浏览/下载:94/0  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:94/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection