CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:298/59  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Beyond Crack: Fine-Grained Pavement Defect Segmentation Using Three-Stream Neural Networks 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2021, 卷号: /, 期号: /, 页码: /
作者:  Zhang, Yujia;  Wu, Junxian;  Li, Qianzhong;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(12585Kb)  |  收藏  |  浏览/下载:380/65  |  提交时间:2022/04/02
Fine-grained defect segmentation  Crack detection  Semantic segmentation  Pavement inspection  
Graph-based Multimodal Ranking Models for Multimodal Summarization 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 卷号: 20, 期号: 4, 页码: 21
作者:  Zhu, Junnan;  Xiang, Lu;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(4193Kb)  |  收藏  |  浏览/下载:330/71  |  提交时间:2021/12/28
Multimodal summarization  single-modal  multimodal ranking  unsupervised  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:338/49  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Encoding-decoding Network With Pyramid Self-attention Module for Retinal Vessel Segmentation 期刊论文
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 6, 页码: 973-980
作者:  Cong-Zhong Wu;  Jun Sun;  Jing Wang;  Liang-Feng Xu;  Shu Zhan
Adobe PDF(1416Kb)  |  收藏  |  浏览/下载:217/52  |  提交时间:2021/11/26
Retina vessel segmentation  deep learning  U-Net  attention mechanism  medical image  
Handwritten Text Generation via Disentangled Representations 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2021, 卷号: 28, 期号: 2021, 页码: 1838-1842
作者:  Liu, Xiyan;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(1272Kb)  |  收藏  |  浏览/下载:328/76  |  提交时间:2021/11/04
Disentangled representation  Handwritten text generation  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:362/49  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Variational Gridded Graph Convolution Network for Node Classification 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 10, 页码: 1697-1708
作者:  Xiaobin Hong;  Tong Zhang;  Zhen Cui;  Jian Yang
Adobe PDF(2419Kb)  |  收藏  |  浏览/下载:154/45  |  提交时间:2021/09/03
Graph coarsening  gridding  node classification  random walk  variational convolution  
Rethinking semantic-visual alignment in zero-shot object detection via a softplus margin focal loss 期刊论文
Neurocomputing, 2021, 卷号: 449, 页码: 117-135
作者:  Li, Qianzhong;  Zhang, Yujia;  Sun, Shiying;  Zhao, Xiaoguang;  Li, Kang;  Tan, Min
Adobe PDF(8753Kb)  |  收藏  |  浏览/下载:341/53  |  提交时间:2021/08/15
Zero-shot object detection  Softplus margin focal loss  Semantic-visual alignment  Auto-encoder architecture  
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:216/69  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别