CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Pavement Defect Detection with Deep Learning: A Comprehensive Survey 期刊论文
IEEE Transactions on Intelligent Vehicles, 2023, 卷号: 9, 期号: 3, 页码: 4292 - 4311
作者:  Lili Fan;  Dandan Wang;  Junhao Wang;  Yunjie Li;  Yifeng Cao;  Yi Liu;  Xiaoming Chen;  Yutong Wang
Adobe PDF(6287Kb)  |  收藏  |  浏览/下载:40/10  |  提交时间:2024/06/06
Deep learning  pavement defect detection  computer vision  image processing  3D image  
Hierarchical Attention Network for Open-Set Fine-Grained Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: 1-14
作者:  Jiayin, Sun;  Hong, Wang;  Qiulei, Dong
Adobe PDF(2596Kb)  |  收藏  |  浏览/下载:46/13  |  提交时间:2024/05/28
Improved Video Emotion Recognition with Alignment of CNN and Human Brain Representations 期刊论文
IEEE Transactions on Affective Computing, 2023, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(3907Kb)  |  收藏  |  浏览/下载:54/17  |  提交时间:2024/05/28
CNN-brain Alignment  Brain-guided Deep Learning  Video Emotion Recognition  Representation Similarity Analysis  
Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 33, 期号: 3, 页码: 1335 - 1348
作者:  Fu Yujie;  Zhang Pengju;  Liu Bingxi;  Rong Zheng;  Wu Yihong
Adobe PDF(4879Kb)  |  收藏  |  浏览/下载:44/15  |  提交时间:2024/05/28
Image Matching  Large Scale Changes  Scale Difference Reduction  Scale Ratio Estimation  Covisibility-attention-reinforced Matching Module  
Bird's-Eye-View Semantic Segmentation With Two-Stream Compact Depth Transformation and Feature Rectification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4546-4558
作者:  Liu, Jierui;  Cao, Zhiqiang;  Yang, Jing;  Liu, Xilong;  Yang, Yuequan;  Qu, Zhiyou
Adobe PDF(21890Kb)  |  收藏  |  浏览/下载:75/11  |  提交时间:2024/03/27
Bird's-eye-view  semantic segmentation  two-stream compact depth transformation  feature rectification  
Temporal Action Proposal Generation With Action Frequency Adaptive Network 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 2340 - 2353
作者:  Yepeng Tang;  Weining Wang;  Chunjie Zhang;  Jing Liu;  Yao Zhao
Adobe PDF(10095Kb)  |  收藏  |  浏览/下载:65/18  |  提交时间:2024/03/26
Proposals  Task analysis  Data models  Time-frequency analysis  Representation learning  Predictive models  Information science  Temporal action proposal generation  expert learning  fine-gained detection  action frequency  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:91/23  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:84/2  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Attention Weighted Local Descriptors 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 9, 页码: 10632-10649
作者:  Wang, Changwei;  Xu, Rongtao;  Lu, Ke;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(8075Kb)  |  收藏  |  浏览/下载:165/7  |  提交时间:2023/11/17
Local features detection and description  consistent attention mechanism  context augmentation  lightweight local descriptors  knowledge distillation