CASIA OpenIR

浏览/检索结果: 共215条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 14
作者:  Qi, Xingqun;  Sun, Muyi;  Wang, Zijian;  Liu, Jiaming;  Li, Qi;  Zhao, Fang;  Zhang, Shanghang;  Shan, Caifeng
Adobe PDF(6718Kb)  |  收藏  |  浏览/下载:85/32  |  提交时间:2024/02/22
Face photo-sketch synthesis  generative adversarial network  graph representation learning  intraclass and interclass  iterative cycle training (ICT)  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Robotic grasping and assembly of screws based on visual servoing using point features 期刊论文
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2023, 页码: 13
作者:  Hao, Tiantian;  Xu, De
收藏  |  浏览/下载:56/0  |  提交时间:2023/12/21
Feature extraction  Image-based visual servoing  Position alignment  Robotic grasping  Robotic assembly  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:98/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
A Coarse-to-Fine Feature Match Network Using Transformers for Remote Sensing Image Registration 期刊论文
Remote Sensing, 2023, 页码: 3243
作者:  Liang Chenbin;  Dong Yunyun;  Changjun Zhao;  Zengguo Sun
Adobe PDF(44590Kb)  |  收藏  |  浏览/下载:141/38  |  提交时间:2023/06/26
GPDAN: Grasp Pose Domain Adaptation Network for Sim-to-Real 6-DoF Object Grasping 期刊论文
IEEE Robotics and Automation Letters, 2023, 页码: 1-8
作者:  Liming Zheng;  Wenxuan Ma;  Yinghao Cai;  Tao Lu;  Shuo Wang
Adobe PDF(2161Kb)  |  收藏  |  浏览/下载:243/107  |  提交时间:2023/06/14
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:201/57  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:165/44  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:190/52  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:122/22  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer