CASIA OpenIR

浏览/检索结果: 共21条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:11/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
SASOD: Saliency-Aware Ship Object Detection in High-Resolution Optical Images 期刊论文
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 卷号: 62, 页码: 15
作者:  Ren, Zhida;  Tang, Yongqiang;  Yang, Yang;  Zhang, Wensheng
Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:28/4  |  提交时间:2024/07/03
ship detection  Saliency detection  high-resolution optical images  remote sensing  Deep learning  Feature extraction  Marine vehicles  Object detection  Remote sensing  Optical sensors  Optical imaging  saliency detection  
Exploring Rich Semantics for Open-Set Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 5410-5421
作者:  Hu, Yufan;  Gao, Junyu;  Dong, Jianfeng;  Fan, Bin;  Liu, Hongmin
收藏  |  浏览/下载:11/0  |  提交时间:2024/07/03
Semantics  Prototypes  Knowledge graphs  Visualization  Task analysis  Uncertainty  Training  Open-set action recognition  video action recognition  semantic relation modeling  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:42/20  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Extracting Muscle Geometrical Features With a Fabric-Based Wearable Sensor for Human Motion Intent Recognition 期刊论文
IEEE/ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Zheng EH(郑恩昊);  Jiacheng Wan;  Nanxing Hu;  Qining Wang
Adobe PDF(2867Kb)  |  收藏  |  浏览/下载:38/19  |  提交时间:2024/06/25
Sensors  Robot sensing systems  Muscles  Wearable sensors  Shape  Force  Feature extraction  Fabric-based sensors  human motion recognition  muscle features  wearable sensing  
A Portable Robot-Assisted Device With Built-In Intelligence for Autonomous Ultrasound Acquisitions in Follow-Up Diagnosis 期刊论文
IEEE Transactions on Instrumentation and Measurement, 2024, 卷号: 73, 页码: 1-10
作者:  Deng ZK(邓兆锟);  Hou XL(侯西龙);  Chen C(陈晨);  Gu XL(谷晓林);  Hou ZG(侯增广);  Wang SY(王双翌)
Adobe PDF(6984Kb)  |  收藏  |  浏览/下载:39/16  |  提交时间:2024/06/25
Robots  Ultrasonic imaging  Probes  Robot sensing systems  Robot kinematics  Force  Safety  Autonomous US acquisition  medical ultrasound (US)  reinforcement learning (RL)  US robotic device  
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:44/13  |  提交时间:2024/06/11
3D shape generation  Diffusion model  Continuous normalizing flow  Point cloud  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:74/21  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:36/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Soccer player tracking and data correction based on attention with full-field videos 期刊论文
VISUAL COMPUTER, 2024, 页码: 13
作者:  Yang, Chao;  Yang, Meng;  Li, Hongyu;  Jiang, Linlu;  Suo, Xiang;  Li, Zhen;  Meng, Weiliang;  Mao, Lijuan
Adobe PDF(8335Kb)  |  收藏  |  浏览/下载:75/26  |  提交时间:2024/05/30
Soccer player tracking  Data correction  Field mapping