CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:27/14  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:62/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Soccer player tracking and data correction based on attention with full-field videos 期刊论文
VISUAL COMPUTER, 2024, 页码: 13
作者:  Yang, Chao;  Yang, Meng;  Li, Hongyu;  Jiang, Linlu;  Suo, Xiang;  Li, Zhen;  Meng, Weiliang;  Mao, Lijuan
Adobe PDF(8335Kb)  |  收藏  |  浏览/下载:60/21  |  提交时间:2024/05/30
Soccer player tracking  Data correction  Field mapping  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:98/26  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 1299-1314
作者:  Jin, Qizhao;  Zhang, Xinbang;  Xiao, Xinyu;  Wang, Ying;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(8766Kb)  |  收藏  |  浏览/下载:85/8  |  提交时间:2024/02/21
Data mining  multimodal knowledge discovery  precipitation nowcasting  
A Graded Assessment System for Parkinsons Upper-Limb Bradykinesia Based on a Temporal Convolutional Network Model 期刊论文
IEEE SENSORS JOURNAL, 2023, 卷号: 23, 期号: 23, 页码: 29283-29292
作者:  Tong, Lina;  Liu, Dai-Song;  Peng, Liang;  Hao, Hong-Lin;  Wang, Chen
Adobe PDF(9425Kb)  |  收藏  |  浏览/下载:74/9  |  提交时间:2024/02/21
Bradykinesia grade  inertial sensors  Parkinson's disease (PD)  temporal convolutional network (TCN)  wearable device  
Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 301-328
作者:  Yuchuang Tong;  Haotian Liu;  Zhengtao Zhang
Adobe PDF(7587Kb)  |  收藏  |  浏览/下载:149/38  |  提交时间:2024/01/23
Future trends and challenges  humanoid robots  human-robot interaction  key technologies  potential applications  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:176/27  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Toward Accurate and Efficient Road Extraction by Leveraging the Characteristics of Road Shapes 期刊论文
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 卷号: 61, 页码: 16
作者:  Wang, Changwei;  Xu, Rongtao;  Xu, Shibiao;  Meng, Weiliang;  Wang, Ruisheng;  Zhang, Jiguang;  Zhang, Xiaopeng
Adobe PDF(13777Kb)  |  收藏  |  浏览/下载:157/16  |  提交时间:2023/11/17
Efficient and accurate road extraction  efficient strip transformer module (ESTM)  geometric deformation estimation module (GDEM)  road edge focal loss (REF loss)  road shape-aware network (RSANet)  
sEMG-Based Gesture Recognition Method for Coal Mine Inspection Manipulator Using Multistream CNN 期刊论文
IEEE SENSORS JOURNAL, 2023, 卷号: 23, 期号: 10, 页码: 11082-11090
作者:  Tong, Lina;  Zhang, Mingjia;  Ma, Hanghang;  Wang, Chen;  Peng, Liang
Adobe PDF(2841Kb)  |  收藏  |  浏览/下载:129/21  |  提交时间:2023/11/17
Sensors  Muscles  Inspection  Coal mining  Robots  Feature extraction  Gesture recognition  Coal mine inspection manipulator  gestures recognition  multistream convolutional neural network (CNN)  surface electromyography (sEMG)  time--frequency graph feature