CASIA OpenIR

浏览/检索结果: 共418条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Conditional visibility aware view synthesis via parallel light fields 期刊论文
NEUROCOMPUTING, 2024, 卷号: 588, 页码: 13
作者:  Shen, Yu;  Li, Yuke;  Liu, Yuhang;  Wang, Yutong;  Chen, Long;  Wang, Fei-Yue
Adobe PDF(3348Kb)  |  收藏  |  浏览/下载:22/3  |  提交时间:2024/07/04
Parallel theory  Light fields  Neural rendering  View synthesis  Conditional visibility  Normalizing Flow  
Attribute-Guided Cross-Modal Interaction and Enhancement for Audio-Visual Matching 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 4986-4998
作者:  Wang, Jiaxiang;  Zheng, Aihua;  Yan, Yan;  He, Ran;  Tang, Jin
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/03
Audio-visual cross-modal matching  attribute-guided cross-modal interaction  attribute-guided cross-modal enhancement  
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
Comprehensive Attribute Prediction Learning for Person Search by Language 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1990-2003
作者:  Niu, Kai;  Huang, Linjiang;  Long, Yuzhou;  Huang, Yan;  Wang, Liang;  Zhang, Yanning
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Person search by language  cross-modal retrieval  smart video surveillance  attribute prediction  
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:13/0  |  提交时间:2024/07/03
Few-shot  image classification  vision-language models  
DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 14
作者:  Huang, Nisha;  Zhang, Yuxin;  Tang, Fan;  Ma, Chongyang;  Huang, Haibin;  Dong, Weiming;  Xu, Changsheng
收藏  |  浏览/下载:19/0  |  提交时间:2024/07/03
Arbitrary image stylization  diffusion  textual guidance  neural network applications  
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:63/24  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:43/20  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
A Portable Robot-Assisted Device With Built-In Intelligence for Autonomous Ultrasound Acquisitions in Follow-Up Diagnosis 期刊论文
IEEE Transactions on Instrumentation and Measurement, 2024, 卷号: 73, 页码: 1-10
作者:  Deng ZK(邓兆锟);  Hou XL(侯西龙);  Chen C(陈晨);  Gu XL(谷晓林);  Hou ZG(侯增广);  Wang SY(王双翌)
Adobe PDF(6984Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2024/06/25
Robots  Ultrasonic imaging  Probes  Robot sensing systems  Robot kinematics  Force  Safety  Autonomous US acquisition  medical ultrasound (US)  reinforcement learning (RL)  US robotic device  
GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images 期刊论文
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 卷号: 17, 期号: 2024, 页码: 4222 - 4234
作者:  Cao, Yong;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(4340Kb)  |  收藏  |  浏览/下载:35/8  |  提交时间:2024/06/25
Cross feature fusion (CFF)  global context learning  group transformer  semantic segmentation