CASIA OpenIR

浏览/检索结果: 共632条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 14
作者:  Ding, Leqi;  Liu, Lei;  Huang, Yan;  Li, Chenglong;  Zhang, Cheng;  Wang, Wei;  Wang, Liang
收藏  |  浏览/下载:15/0  |  提交时间:2024/03/27
Task analysis  Feature extraction  Visualization  Training  Electronic mail  Benchmark testing  Trajectory  Text-to-image vehicle re-identification  cross-modal alignment  multi-scale multi-view analysis  benchmark dataset  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Key-Part Attention Retrieval for Robotic Object Recognition 期刊论文
TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 卷号: 29, 期号: 3, 页码: 644-655
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo
收藏  |  浏览/下载:39/0  |  提交时间:2024/02/22
Training  Visualization  Image recognition  Cameras  Object recognition  Convolutional neural networks  Data mining  key-part attention  retrieval  robotic object recognition  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:94/8  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:100/5  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:89/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Improving Inconspicuous Attributes Modeling for Person Search by Language 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3429-3441
作者:  Niu, Kai;  Huang, Tao;  Huang, Linjiang;  Wang, Liang;  Zhang, Yanning
收藏  |  浏览/下载:72/0  |  提交时间:2023/11/17
Person search by language  cross-modal retrieval  smart video surveillance  
Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 卷号: 18, 页码: 3044-3057
作者:  Wan, Lin;  Jing, Qianyan;  Sun, Zongyuan;  Zhang, Chuang;  Li, Zhihang;  Chen, Yehansen
收藏  |  浏览/下载:66/0  |  提交时间:2023/11/17
Task analysis  Training  Feature extraction  Lighting  Cameras  Visualization  Self-supervised learning  Cross-modality person re-identification  self-supervised learning  multi-modality pre-training  
AAformer: Auto-Aligned Transformer for Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Zhu, Kuan;  Guo, Haiyun;  Zhang, Shiliang;  Wang, Yaowei;  Liu, Jing;  Wang, Jinqiao;  Tang, Ming
收藏  |  浏览/下载:95/0  |  提交时间:2023/11/16
Auto-alignment  part-level representation  person re-identification (re-ID)  transformer  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:91/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection