CASIA OpenIR

浏览/检索结果: 共386条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:107/10  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:97/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
GPDAN: Grasp Pose Domain Adaptation Network for Sim-to-Real 6-DoF Object Grasping 期刊论文
IEEE Robotics and Automation Letters, 2023, 页码: 1-8
作者:  Liming Zheng;  Wenxuan Ma;  Yinghao Cai;  Tao Lu;  Shuo Wang
Adobe PDF(2161Kb)  |  收藏  |  浏览/下载:231/106  |  提交时间:2023/06/14
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:197/56  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: early-access
作者:  Mengqi Rong;  Shuhan Shen
Adobe PDF(5811Kb)  |  收藏  |  浏览/下载:108/36  |  提交时间:2023/09/25
DCAT: Dual Cross-Attention-Based Transformer for Change Detection 期刊论文
Remote Sensing, 2023, 卷号: 15, 期号: 9, 页码: 2395
作者:  Yuan Zhou;  Chunlei Huo;  Jiahang Zhu;  Leigang Huo;  Chunhong Pan
Adobe PDF(47919Kb)  |  收藏  |  浏览/下载:131/18  |  提交时间:2023/06/16
change detection  transformer  dual cross-attention  remote sensing  
Autonomous vision-based navigation and stability augmentation control of a biomimetic robotic hammerhead shark 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-13
作者:  Yan, Shuaizheng;  Wang, Jian;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
Adobe PDF(3384Kb)  |  收藏  |  浏览/下载:172/37  |  提交时间:2023/06/12
Robots  Navigation  Visualization  Fish  Biomimetics  Stability analysis  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:186/50  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:121/22  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer