CASIA OpenIR

浏览/检索结果: 共33条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:56/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:117/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Auditory Feature Driven Model Predictive Control for Sound Source Approaching 期刊论文
International Journal of Control, Automation, and Systems, 2023, 卷号: 22, 期号: 2, 页码: 1-14
作者:  Wang, Zhiqing;  Zou, Wei;  Zhang, Wei;  Ma, Hongxuan;  Zhang, Chi;  Guo, Yuxin
Adobe PDF(7966Kb)  |  收藏  |  浏览/下载:166/46  |  提交时间:2023/06/20
Source approaching control, interaural time difference, robotic audition, sound source localization.  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:90/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Multi-View Multi-Label Fine-Grained Emotion Decoding From Human Brain Activity 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:242/61  |  提交时间:2022/12/27
Decoding  Brain modeling  Functional magnetic resonance imaging  Predictive models  Emotion recognition  Dimensionality reduction  Pattern recognition  Fine-grained emotion decoding  multi-label learning  multi-view learning  product of experts (PoEs)  variational autoencoder  
Navigating Diverse Salient Features for Vehicle Re-Identification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 10
作者:  Qian, Wen;  He, Zhiqun;  Chen, Chen;  Peng, Silong
Adobe PDF(795Kb)  |  收藏  |  浏览/下载:255/51  |  提交时间:2022/09/19
Navigation  Task analysis  Image color analysis  Boosting  Feature extraction  Benchmark testing  Space vehicles  Vehicle re-identification  suppress-and-explore mode  grid-based salient navigation  cross-space constraints  
Learning adversarial point-wise domain alignment for stereo matching 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 564-574
作者:  Zhang, Chenghao;  Meng, Gaofeng;  Xu, Richard Yi Da;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(3885Kb)  |  收藏  |  浏览/下载:251/48  |  提交时间:2022/09/19
Stereo Matching  Domain adaptation  Point-wise linear transformation  Adversarial learning  
Tell, Imagine, and Search: End-to-end Learning for Composing Text and Image to Image Retrieval 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 卷号: 18, 期号: 2, 页码: 23
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:206/0  |  提交时间:2022/06/10
Composing text and image to image retrieval  end-to-end  image generation  generative adversarial network  global-local  
An Efficient Sampling-Based Attention Network for Semantic Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2850-2863
作者:  He, Xingjian;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(3252Kb)  |  收藏  |  浏览/下载:336/75  |  提交时间:2022/06/10
Stochastic processes  Sampling methods  Semantics  Image segmentation  Computational complexity  Pattern recognition  Convolution  Semantic segmentation  stochastic sampling-based attention  deterministic sampling-based attention