CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Attribute-Guided Cross-Modal Interaction and Enhancement for Audio-Visual Matching 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 4986-4998
作者:  Wang, Jiaxiang;  Zheng, Aihua;  Yan, Yan;  He, Ran;  Tang, Jin
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Audio-visual cross-modal matching  attribute-guided cross-modal interaction  attribute-guided cross-modal enhancement  
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
SASOD: Saliency-Aware Ship Object Detection in High-Resolution Optical Images 期刊论文
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 卷号: 62, 页码: 15
作者:  Ren, Zhida;  Tang, Yongqiang;  Yang, Yang;  Zhang, Wensheng
Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:19/2  |  提交时间:2024/07/03
ship detection  Saliency detection  high-resolution optical images  remote sensing  Deep learning  Feature extraction  Marine vehicles  Object detection  Remote sensing  Optical sensors  Optical imaging  saliency detection  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:31/16  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Extracting Muscle Geometrical Features With a Fabric-Based Wearable Sensor for Human Motion Intent Recognition 期刊论文
IEEE/ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Zheng EH(郑恩昊);  Jiacheng Wan;  Nanxing Hu;  Qining Wang
Adobe PDF(2867Kb)  |  收藏  |  浏览/下载:31/14  |  提交时间:2024/06/25
Sensors  Robot sensing systems  Muscles  Wearable sensors  Shape  Force  Feature extraction  Fabric-based sensors  human motion recognition  muscle features  wearable sensing  
A Portable Robot-Assisted Device With Built-In Intelligence for Autonomous Ultrasound Acquisitions in Follow-Up Diagnosis 期刊论文
IEEE Transactions on Instrumentation and Measurement, 2024, 卷号: 73, 页码: 1-10
作者:  Deng ZK(邓兆锟);  Hou XL(侯西龙);  Chen C(陈晨);  Gu XL(谷晓林);  Hou ZG(侯增广);  Wang SY(王双翌)
Adobe PDF(6984Kb)  |  收藏  |  浏览/下载:24/11  |  提交时间:2024/06/25
Robots  Ultrasonic imaging  Probes  Robot sensing systems  Robot kinematics  Force  Safety  Autonomous US acquisition  medical ultrasound (US)  reinforcement learning (RL)  US robotic device  
Molecular Contrastive Pretraining with Collaborative Featurizations 期刊论文
Journal of Chemical Information and Modeling (JCIM), 2024, 卷号: 64, 期号: 4, 页码: 1112–1122
作者:  Yanqiao Zhu;  Dingshuo Chen;  Yuanqi Du;  Yingze Wang;  Qiang Liu;  Shu Wu
Adobe PDF(1868Kb)  |  收藏  |  浏览/下载:25/9  |  提交时间:2024/06/21
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:36/10  |  提交时间:2024/06/11
3D shape generation  Diffusion model  Continuous normalizing flow  Point cloud  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:66/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Soccer player tracking and data correction based on attention with full-field videos 期刊论文
VISUAL COMPUTER, 2024, 页码: 13
作者:  Yang, Chao;  Yang, Meng;  Li, Hongyu;  Jiang, Linlu;  Suo, Xiang;  Li, Zhen;  Meng, Weiliang;  Mao, Lijuan
Adobe PDF(8335Kb)  |  收藏  |  浏览/下载:65/22  |  提交时间:2024/05/30
Soccer player tracking  Data correction  Field mapping