CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:298/59  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Beyond Crack: Fine-Grained Pavement Defect Segmentation Using Three-Stream Neural Networks 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2021, 卷号: /, 期号: /, 页码: /
作者:  Zhang, Yujia;  Wu, Junxian;  Li, Qianzhong;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(12585Kb)  |  收藏  |  浏览/下载:380/65  |  提交时间:2022/04/02
Fine-grained defect segmentation  Crack detection  Semantic segmentation  Pavement inspection  
Graph-based Multimodal Ranking Models for Multimodal Summarization 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 卷号: 20, 期号: 4, 页码: 21
作者:  Zhu, Junnan;  Xiang, Lu;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(4193Kb)  |  收藏  |  浏览/下载:330/71  |  提交时间:2021/12/28
Multimodal summarization  single-modal  multimodal ranking  unsupervised  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:338/49  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Handwritten Text Generation via Disentangled Representations 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2021, 卷号: 28, 期号: 2021, 页码: 1838-1842
作者:  Liu, Xiyan;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(1272Kb)  |  收藏  |  浏览/下载:328/76  |  提交时间:2021/11/04
Disentangled representation  Handwritten text generation  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:362/49  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Rethinking semantic-visual alignment in zero-shot object detection via a softplus margin focal loss 期刊论文
Neurocomputing, 2021, 卷号: 449, 页码: 117-135
作者:  Li, Qianzhong;  Zhang, Yujia;  Sun, Shiying;  Zhao, Xiaoguang;  Li, Kang;  Tan, Min
Adobe PDF(8753Kb)  |  收藏  |  浏览/下载:341/53  |  提交时间:2021/08/15
Zero-shot object detection  Softplus margin focal loss  Semantic-visual alignment  Auto-encoder architecture  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:215/63  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Robot learning through observation via coarse-to-fine grained video summarization 期刊论文
APPLIED SOFT COMPUTING, 2021, 卷号: 99, 期号: /, 页码: 106913
作者:  Zhang, Yujia;  Li, Qianzhong;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(5989Kb)  |  收藏  |  浏览/下载:395/83  |  提交时间:2021/03/08
Robotic vision  Learning through observation  Coarse-to-fine video summarization  
Learning Aligned Image-Text Representations Using Graph Attentive Relational Network 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 期号: 30, 页码: 1840-1852
作者:  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(4532Kb)  |  收藏  |  浏览/下载:378/64  |  提交时间:2021/03/08
Graph neural networks  Visualization  Semantics  Task analysis  Feature extraction  Annotations  Recurrent neural networks  Image-text matching  cross-modal retrieval  person search  graph neural network