CASIA OpenIR

浏览/检索结果: 共93条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:28/14  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:36/10  |  提交时间:2024/06/11
3D shape generation  Diffusion model  Continuous normalizing flow  Point cloud  
The survey on multi-source data fusion in cyber-physical-social systems: Foundational infrastructure for industrial metaverses and industries 5.0 期刊论文
Information Fusion, 2024, 卷号: 107, 页码: 1-16
作者:  Xiao Wang;  Yutong Wang;  Jing Yang;  Xiaofeng Jia;  Lijun Li;  Weiping Ding;  Fei-Yue Wang
Adobe PDF(4446Kb)  |  收藏  |  浏览/下载:46/7  |  提交时间:2024/06/06
Multi-source data fusion  CPSS  Industrial metaverses  Parallel manufacturing  Social manufacturing  
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
作者:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  收藏  |  浏览/下载:43/20  |  提交时间:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
SocialVis: Dynamic Social Visualization in Dense Scenes via Real-time Multi-Object Tracking and Proximity Graph Construction 期刊论文
Computer Animation and Virtual Worlds, 2024, 卷号: 35, 期号: 3, 页码: 1-15
作者:  Li BW(李博文);  Li W(李巍);  Wang JQ(王镜淇);  Meng WL(孟维亮);  Zhang JG(张吉光);  Zhang XP(张晓鹏)
Adobe PDF(2914Kb)  |  收藏  |  浏览/下载:42/7  |  提交时间:2024/06/04
dense pedestrian  detection  multi-object tracking  proximity graph  visualization  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:64/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Soccer player tracking and data correction based on attention with full-field videos 期刊论文
VISUAL COMPUTER, 2024, 页码: 13
作者:  Yang, Chao;  Yang, Meng;  Li, Hongyu;  Jiang, Linlu;  Suo, Xiang;  Li, Zhen;  Meng, Weiliang;  Mao, Lijuan
Adobe PDF(8335Kb)  |  收藏  |  浏览/下载:61/22  |  提交时间:2024/05/30
Soccer player tracking  Data correction  Field mapping  
Generalized Feature Learning for Detection of Novel Objects 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 卷号: 16, 期号: 1, 页码: 388-395
作者:  Jierui Liu;  Xilong Liu;  Zhiqiang Cao;  Junzhi Yu;  Min Tan
Adobe PDF(8844Kb)  |  收藏  |  浏览/下载:38/11  |  提交时间:2024/05/28
Object detection  few-shot  generalized features  source aggregation  
Bird's-Eye-View Semantic Segmentation With Two-Stream Compact Depth Transformation and Feature Rectification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4546-4558
作者:  Liu, Jierui;  Cao, Zhiqiang;  Yang, Jing;  Liu, Xilong;  Yang, Yuequan;  Qu, Zhiyou
Adobe PDF(21890Kb)  |  收藏  |  浏览/下载:82/13  |  提交时间:2024/03/27
Bird's-eye-view  semantic segmentation  two-stream compact depth transformation  feature rectification  
DomainFeat: Learning Local Features With Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 46-59
作者:  Xu, Rongtao;  Wang, Changwei;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(6039Kb)  |  收藏  |  浏览/下载:92/13  |  提交时间:2024/03/26
Feature extraction  Location awareness  Visualization  Robustness  Image matching  Detectors  Decoding  Local features  domain adaptation  cross-domain data  consistency loss