CASIA OpenIR

浏览/检索结果: 共59条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:31/16  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Dual Stream Fusion Network for Multi-spectral High Resolution Remote Sensing Image Segmentation 会议论文
, 珠海, 2021.12.19-2021.12.21
作者:  Cao, Yong;  Shi, Yiwen;  Liu, Yiwei;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(1703Kb)  |  收藏  |  浏览/下载:28/12  |  提交时间:2024/06/25
Semantic segmentation  Remote sensing  Stream fusion  
Differentiable Convolution Search for Point Cloud Processing 会议论文
, Montreal, Canada, 2021年10月10日至2021年10月17日
作者:  Xing Nie;  Yongcheng Liu;  Shaohong Chen;  Jianlong Chang;  Chunlei Huo;  Gaofeng Meng;  Qi Tian;  Weiming Hu;  Chunhong Pan
Adobe PDF(1249Kb)  |  收藏  |  浏览/下载:34/12  |  提交时间:2024/06/24
GraphMLLM: A Graph-based Multi-level Layout Language-independent Model for Document Understanding 会议论文
, 希腊雅典, 2024-09
作者:  He-Sen Dai;  Xiao-Hui Li;  Fei Yin;  Xudong Yan;  Shuqi Mei;  Cheng-Lin Liu
Adobe PDF(967Kb)  |  收藏  |  浏览/下载:52/12  |  提交时间:2024/06/05
Visual information extraction  Self-supervised pre-training  Multi-level page layouts  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:65/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 期号: 0, 页码: 1045 - 1058
作者:  Changwei Wang;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Ruisheng Wang;  Xiaopeng Zhang
Adobe PDF(3269Kb)  |  收藏  |  浏览/下载:67/23  |  提交时间:2024/05/29
Weakly supervised object localization  intrinsic discrimination and consistency  deep metric learning  geometric transformation consistency  
Robotics Dexterous Grasping: The Methods Based on Point Cloud and Deep Learning 期刊论文
Frontiers in Neurorobotics, 2021, 卷号: 15, 页码: 658280
作者:  Duan, Haonan;  Wang, Peng;  Huang, Yayu;  Xu, Guangyun;  Wei, Wei;  Shen, Xiaofei
Adobe PDF(3145Kb)  |  收藏  |  浏览/下载:42/14  |  提交时间:2024/05/29
Robotics  Dexterous grasping  Point Cloud  Deep learning  
DomainFeat: Learning Local Features With Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 46-59
作者:  Xu, Rongtao;  Wang, Changwei;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(6039Kb)  |  收藏  |  浏览/下载:93/13  |  提交时间:2024/03/26
Feature extraction  Location awareness  Visualization  Robustness  Image matching  Detectors  Decoding  Local features  domain adaptation  cross-domain data  consistency loss  
Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 12, 页码: 8066-8073
作者:  Feng, Shihao;  Liang, Pengpeng;  Gao, Jin;  Cheng, Erkang
Adobe PDF(2745Kb)  |  收藏  |  浏览/下载:130/9  |  提交时间:2023/12/21
3D object tracking  Point cloud  Transformer  
SignParser: An End-to-End Framework for Traffic Sign Understanding 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 132, 期号: 2, 页码: 805-821
作者:  Guo, Yunfei;  Feng, Wei;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(7011Kb)  |  收藏  |  浏览/下载:133/7  |  提交时间:2023/12/21
Traffic sign understanding  Content reasoning  Semantic description generation