CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:55/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:117/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:89/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Multi-View Multi-Label Fine-Grained Emotion Decoding From Human Brain Activity 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:242/61  |  提交时间:2022/12/27
Decoding  Brain modeling  Functional magnetic resonance imaging  Predictive models  Emotion recognition  Dimensionality reduction  Pattern recognition  Fine-grained emotion decoding  multi-label learning  multi-view learning  product of experts (PoEs)  variational autoencoder  
Patient-level grading prediction of prostate cancer from mp-MRI via GMINet 期刊论文
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 卷号: 150, 页码: 10
作者:  Shao, Lizhi;  Liu, Zhenyu;  Liu, Jiangang;  Yan, Ye;  Sun, Kai;  Liu, Xiangyu;  Lu, Jian;  Tian, Jie
收藏  |  浏览/下载:176/0  |  提交时间:2022/11/21
mp-MRI  Prostate cancer  Grade group  Patient-level prediction  Deep learning  
Tell, Imagine, and Search: End-to-end Learning for Composing Text and Image to Image Retrieval 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 卷号: 18, 期号: 2, 页码: 23
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:206/0  |  提交时间:2022/06/10
Composing text and image to image retrieval  end-to-end  image generation  generative adversarial network  global-local  
Geometry Sensitive Cross-Modal Reasoning for Composed Query Based Image Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 1000-1011
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:225/0  |  提交时间:2022/02/16
Visualization  Image retrieval  Semantics  Cognition  Geometry  Task analysis  Electronic mail  Composed query based image retrieval  semantic gap  spatial structure  inter-modal attention  text-guided visual reasoning  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:303/75  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Data Augmented Deep Behavioral Cloning for Urban Traffic Control Operations Under a Parallel Learning Framework 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 卷号: 23, 期号: 6, 页码: 5128-5137
作者:  Li, Xiaoshuang;  Ye, Peijun;  Jin, Junchen;  Zhu, Fenghua;  Wang, Fei-Yue
Adobe PDF(2319Kb)  |  收藏  |  浏览/下载:277/53  |  提交时间:2022/01/27
Generative adversarial networks  Data models  Gallium nitride  Task analysis  Complex systems  Intelligent traffic signal operations  deep behavioral cloning  
Learning Aligned Image-Text Representations Using Graph Attentive Relational Network 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 期号: 30, 页码: 1840-1852
作者:  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(4532Kb)  |  收藏  |  浏览/下载:310/50  |  提交时间:2021/03/08
Graph neural networks  Visualization  Semantics  Task analysis  Feature extraction  Annotations  Recurrent neural networks  Image-text matching  cross-modal retrieval  person search  graph neural network