CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 14
作者:  Ding, Leqi;  Liu, Lei;  Huang, Yan;  Li, Chenglong;  Zhang, Cheng;  Wang, Wei;  Wang, Liang
收藏  |  浏览/下载:17/0  |  提交时间:2024/03/27
Task analysis  Feature extraction  Visualization  Training  Electronic mail  Benchmark testing  Trajectory  Text-to-image vehicle re-identification  cross-modal alignment  multi-scale multi-view analysis  benchmark dataset  
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 4830-4841
作者:  Chen, Zhuo;  Yin, Fei;  Yang, Qing;  Liu, Cheng-Lin
收藏  |  浏览/下载:21/0  |  提交时间:2024/02/22
Cross-lingual text image recognition  cross-modal mimic  multihierarchy mimic  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:99/9  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:93/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Joint Token and Feature Alignment Framework for Text-Based Person Search 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 卷号: 29, 页码: 2238-2242
作者:  Li, Shangze;  Lu, Andong;  Huang, Yan;  Li, Chenglong;  Wang, Liang
收藏  |  浏览/下载:161/0  |  提交时间:2022/12/27
Feature extraction  Visualization  Representation learning  Logic gates  Image reconstruction  Transformers  Training  Cross-modal generation  feature alignment  text-based person search  token alignment  transformer  
Instance GNN: A Learning Framework for Joint Symbol Segmentation and Recognition in Online Handwritten Diagrams 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2580-2594
作者:  Yun, Xiao-Long;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
收藏  |  浏览/下载:218/0  |  提交时间:2022/07/25
Handwriting recognition  Task analysis  Grammar  Semantics  Image segmentation  Trajectory  Text recognition  Online handwritten diagram recognition  symbol segmentation  symbol recognition  freehand sketch analysis  graph neural networks  
Tell, Imagine, and Search: End-to-end Learning for Composing Text and Image to Image Retrieval 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 卷号: 18, 期号: 2, 页码: 23
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:209/0  |  提交时间:2022/06/10
Composing text and image to image retrieval  end-to-end  image generation  generative adversarial network  global-local  
Geometry Sensitive Cross-Modal Reasoning for Composed Query Based Image Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 1000-1011
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:227/0  |  提交时间:2022/02/16
Visualization  Image retrieval  Semantics  Cognition  Geometry  Task analysis  Electronic mail  Composed query based image retrieval  semantic gap  spatial structure  inter-modal attention  text-guided visual reasoning  
Semantic Image Synthesis via Conditional Cycle-Generative Adversarial Networks 会议论文
, Beijing, China, August 20-24, 2018
作者:  Xiyan Liu;  Gaofeng Meng;  Shiming Xiang;  Chunhong Pan
Adobe PDF(4929Kb)  |  收藏  |  浏览/下载:93/40  |  提交时间:2022/01/24
Image synthesis  Text-to-image  Generative adversarial networks