CASIA OpenIR

浏览/检索结果: 共37条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/06/05
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:54/17  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Hierarchical Attention Network for Open-Set Fine-Grained Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: 1-14
作者:  Jiayin, Sun;  Hong, Wang;  Qiulei, Dong
Adobe PDF(2596Kb)  |  收藏  |  浏览/下载:47/14  |  提交时间:2024/05/28
Pixel-Wise Grasp Detection via Twin Deconvolution and Multi-Dimensional Attention 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 33, 期号: 8, 页码: 4002-4010
作者:  Ren, Guangli;  Geng, Wenjie;  Guan, Peiyu;  Cao, Zhiqiang;  Yu, Junzhi
Adobe PDF(4013Kb)  |  收藏  |  浏览/下载:33/11  |  提交时间:2024/05/28
Domain adaptive object detection with model-agnostic knowledge transferring 期刊论文
Neural Networks, 2023, 页码: 213-227
作者:  Tian Kun;  Zhang Chenghao;  Wang Ying;  Xiang Shiming
Adobe PDF(3116Kb)  |  收藏  |  浏览/下载:13/8  |  提交时间:2024/05/28
Transformer-based Spiking Neural Networks for Multimodal Audio-Visual Classification 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 页码: DOI 10.1109/TCDS.2023.3327081
作者:  Guo LY(郭凌月);  Zeyu Gao;  Jinye Qu;  Suiwu Zheng;  Runhao Jiang;  Yanfeng Lu;  Hong Qiao
Adobe PDF(3922Kb)  |  收藏  |  浏览/下载:38/11  |  提交时间:2024/05/28
Towards Prior Gap and Representation Gap for Long-tailed Recognition, Pattern Recognition 期刊论文
Pattern Recognition, 2023, 卷号: 133, 期号: 109012, 页码: 109012
作者:  Zhang Ming-Liang;  Zhang Xu-Yao;  Wang Chang;  Liu Cheng-Lin
Adobe PDF(2258Kb)  |  收藏  |  浏览/下载:113/27  |  提交时间:2024/04/03
Long-tailed learning  Prior gap  Representation gap  Image recognition  
TwinTex: Geometry-aware Texture Generation for Abstracted 3D Architectural Models 期刊论文
ACM TRANSACTIONS ON GRAPHICS, 2023, 卷号: 42, 期号: 6, 页码: 14
作者:  Xiong, Weidan;  Zhang, Hongqian;  Peng, Botao;  Hu, Ziyu;  Wu, Yongli;  Guo, Jianwei;  Huang, Hui
Adobe PDF(5835Kb)  |  收藏  |  浏览/下载:51/3  |  提交时间:2024/03/26
Texture Mapping  3D Architectural Proxy  View Selection  Image Stitching  Texture Optimization  Diffusion Model  
General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds With One Stone 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6155-6167
作者:  Bao, Zenghao;  Tan, Zichang;  Li, Jun;  Wan, Jun;  Ma, Xibo;  Lei, Zhen
Adobe PDF(1634Kb)  |  收藏  |  浏览/下载:52/2  |  提交时间:2024/02/22
General age estimation  long-tailed age estimation  class-wise mean absolute error