CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共54条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
视觉语言导航研究进展 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 1-14
作者:  司马双霖;  黄岩;  何科技;  安东;  袁辉;  王亮
Adobe PDF(6272Kb)  |  收藏  |  浏览/下载:16/5  |  提交时间:2024/05/09
视觉语言导航  视觉语言理解  跨模态匹配  具身智能  
Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 14
作者:  Ding, Leqi;  Liu, Lei;  Huang, Yan;  Li, Chenglong;  Zhang, Cheng;  Wang, Wei;  Wang, Liang
收藏  |  浏览/下载:34/0  |  提交时间:2024/03/27
Task analysis  Feature extraction  Visualization  Training  Electronic mail  Benchmark testing  Trajectory  Text-to-image vehicle re-identification  cross-modal alignment  multi-scale multi-view analysis  benchmark dataset  
Enhancing Person Re-Identification Performance Through In Vivo Learning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 639-654
作者:  Huang, Yan;  Huang, Yan;  Zhang, Zhang;  Wu, Qiang;  Zhong, Yi;  Wang, Liang
收藏  |  浏览/下载:27/0  |  提交时间:2024/02/20
Person re-identification  in vivo learning  boosting performance  
Progressive Sub-Domain Information Mining for Single-Source Generalizable Gait Recognition 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 卷号: 18, 页码: 4787-4799
作者:  Wang, Yang;  Huang, Yan;  Shan, Caifeng;  Wang, Liang
收藏  |  浏览/下载:20/0  |  提交时间:2023/11/17
Gait recognition  Data models  Task analysis  Training  Computational modeling  Feature extraction  Pipelines  domain generalization  clustering  domain-invariant feature  
End-to-End Alternating Optimization for Real-World Blind Super Resolution 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 18
作者:  Luo, Zhengxiong;  Huang, Yan;  Li, Shang;  Wang, Liang;  Tan, Tieniu
收藏  |  浏览/下载:49/0  |  提交时间:2023/11/17
Blind super resolution  Degradation estimation  Alternating optimization  Restorer  Estimator  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:101/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Joint Token and Feature Alignment Framework for Text-Based Person Search 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 卷号: 29, 页码: 2238-2242
作者:  Li, Shangze;  Lu, Andong;  Huang, Yan;  Li, Chenglong;  Wang, Liang
收藏  |  浏览/下载:181/0  |  提交时间:2022/12/27
Feature extraction  Visualization  Representation learning  Logic gates  Image reconstruction  Transformers  Training  Cross-modal generation  feature alignment  text-based person search  token alignment  transformer  
Towards Unconstrained Pointing Problem of Visual Question Answering: A Retrieval-based Method 会议论文
, 北京国际会议中心, 2018-08
作者:  Cheng, Wenlong;  Huang, Yan;  Wang, Liang
Adobe PDF(351Kb)  |  收藏  |  浏览/下载:144/28  |  提交时间:2022/06/14
A Reconstruction-based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 14
作者:  Cheng, Wenlong;  Tang, Wei;  Huang, Yan;  Luo, Yiwen;  Wang, Liang
Adobe PDF(1628Kb)  |  收藏  |  浏览/下载:246/91  |  提交时间:2022/06/14
Deconvolutional Generative Adversarial Networks with Application to Video Generation 会议论文
, 西安, 2019年
作者:  Yu HY(俞宏远);  Huang Y(黄岩);  Pi, Lihong;  Wang L(王亮)
Adobe PDF(1048Kb)  |  收藏  |  浏览/下载:115/38  |  提交时间:2022/06/14