CASIA OpenIR

浏览/检索结果: 共737条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:  Wang, Wenxuan;  He, Xingjian;  Zhang, Yisi;  Guo, Longteng;  Shen, Jiachen;  Li, Jiangyun;  Liu, Jing
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Referring image segmentation  cross-modality guidance  masked self-distillation  vision and language  
Sora for Social Vision With Parallel Intelligence: Social Interaction in Intelligent Vehicles 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 卷号: 9, 期号: 3, 页码: 4240-4243
作者:  Yu, Hui;  Liang, Wei;  Fan, Lili;  Wang, Yutong;  Wang, Fei-Yue
收藏  |  浏览/下载:2/0  |  提交时间:2024/07/03
Intelligent vehicles  Computational modeling  Transformers  Computer vision  Visualization  Human-vehicle systems  Human computer interaction  Sora  parallel intelligence  social vision  social interaction  intelligent Vehicles  diffusion model  human-machine interaction  
Comment-Context Dual Collaborative Masked Transformer Network for Fake News Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 5170-5180
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Fake news detection  multi-modal learning  social media  
When Does Sora Show: The Beginning of TAO to Imaginative Intelligence and Scenarios Engineering 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 卷号: 11, 期号: 4, 页码: 809-815
作者:  Wang, Fei-Yue;  Miao, Qinghai;  Li, Lingxi;  Ni, Qinghua;  Li, Xuan;  Li, Juanjuan;  Fan, Lili;  Tian, Yonglin;  Han, Qing-Long
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Chatbots  Training  Computational modeling  Adaptation models  Spatiotemporal phenomena  Image synthesis  Text-to-Image  Text-to-video  
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Few-shot  image classification  vision-language models  
Semantic Distance Adversarial Learning for Text-to-Image Synthesis 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 1255-1266
作者:  Yuan, Bowen;  Sheng, Yefei;  Bao, Bing-Kun;  Chen, Yi-Ping Phoebe;  Xu, Changsheng
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Text-to-image synthesis  adversarial learning  cycle consistency  
DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 14
作者:  Huang, Nisha;  Zhang, Yuxin;  Tang, Fan;  Ma, Chongyang;  Huang, Haibin;  Dong, Weiming;  Xu, Changsheng
收藏  |  浏览/下载:10/0  |  提交时间:2024/07/03
Arbitrary image stylization  diffusion  textual guidance  neural network applications  
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:38/15  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images 期刊论文
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 卷号: 17, 期号: 2024, 页码: 4222 - 4234
作者:  Cao, Yong;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(4340Kb)  |  收藏  |  浏览/下载:20/4  |  提交时间:2024/06/25
Cross feature fusion (CFF)  global context learning  group transformer  semantic segmentation