CASIA OpenIR

浏览/检索结果: 共728条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Key-Part Attention Retrieval for Robotic Object Recognition 期刊论文
TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 卷号: 29, 期号: 3, 页码: 644-655
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo
收藏  |  浏览/下载:38/0  |  提交时间:2024/02/22
Training  Visualization  Image recognition  Cameras  Object recognition  Convolutional neural networks  Data mining  key-part attention  retrieval  robotic object recognition  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:85/7  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Learning Proposal-Aware Re-Ranking for Weakly-Supervised Temporal Action Localization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 207-220
作者:  Hu, Yufan;  Fu, Jie;  Chen, Mengyuan;  Gao, Junyu;  Dong, Jianfeng;  Fan, Bin;  Liu, Hongmin
收藏  |  浏览/下载:10/0  |  提交时间:2024/03/26
Proposals  Feature extraction  Location awareness  Videos  Measurement  Task analysis  Optimization  weakly-supervised temporal action localization  Proposal-aware reranking  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:82/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Contrastive Multi-Modal Knowledge Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 卷号: 35, 期号: 9, 页码: 8983-8996
作者:  Fang, Quan;  Zhang, Xiaowei;  Hu, Jun;  Wu, Xian;  Xu, Changsheng
收藏  |  浏览/下载:51/0  |  提交时间:2023/11/17
Knowledge graph  multimedia  graph neural network  contrastive learning  
AAformer: Auto-Aligned Transformer for Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Zhu, Kuan;  Guo, Haiyun;  Zhang, Shiliang;  Wang, Yaowei;  Liu, Jing;  Wang, Jinqiao;  Tang, Ming
收藏  |  浏览/下载:86/0  |  提交时间:2023/11/16
Auto-alignment  part-level representation  person re-identification (re-ID)  transformer  
Boosting deep cross-modal retrieval hashing with adversarially robust training 期刊论文
APPLIED INTELLIGENCE, 2023, 页码: 13
作者:  Zhang, Xingwei;  Zheng, Xiaolong;  Mao, Wenji;  Zeng, Daniel Dajun
收藏  |  浏览/下载:62/0  |  提交时间:2023/11/17
Cross-modal retrieval  Adversarial training  Deep hashing model  Deep neural network  
A Coarse-to-Fine Feature Match Network Using Transformers for Remote Sensing Image Registration 期刊论文
Remote Sensing, 2023, 页码: 3243
作者:  Liang Chenbin;  Dong Yunyun;  Changjun Zhao;  Zengguo Sun
Adobe PDF(44590Kb)  |  收藏  |  浏览/下载:132/36  |  提交时间:2023/06/26
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:152/41  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation