CASIA OpenIR

浏览/检索结果: 共48条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 234-244
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:31/0  |  提交时间:2024/03/26
Fake news detection  multi-modal learning  social media  
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:78/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:85/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:80/21  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
A Graded Assessment System for Parkinsons Upper-Limb Bradykinesia Based on a Temporal Convolutional Network Model 期刊论文
IEEE SENSORS JOURNAL, 2023, 卷号: 23, 期号: 23, 页码: 29283-29292
作者:  Tong, Lina;  Liu, Dai-Song;  Peng, Liang;  Hao, Hong-Lin;  Wang, Chen
Adobe PDF(9425Kb)  |  收藏  |  浏览/下载:56/5  |  提交时间:2024/02/21
Bradykinesia grade  inertial sensors  Parkinson's disease (PD)  temporal convolutional network (TCN)  wearable device  
Emotion-Aware Music Driven Movie Montage 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 3, 页码: 540-553
作者:  Liu, Wu-Qin;  Lin, Min-Xuan;  Huang, Hai-Bin;  Ma, Chong-Yang;  Song, Yu;  Dong, Wei-Ming;  Xu, Chang-Sheng
收藏  |  浏览/下载:126/0  |  提交时间:2023/12/21
movie montage  emotion analysis  audio-visual modality  contrastive learning  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:88/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Zero-Shot Predicate Prediction for Scene Graph Parsing 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 3140-3153
作者:  Li, Yiming;  Yang, Xiaoshan;  Huang, Xuhui;  Ma, Zhe;  Xu, Changsheng
收藏  |  浏览/下载:152/0  |  提交时间:2023/11/17
Deep learning  zero-shot  scene graph  
Tri-HGNN: Learning triple policies fused hierarchical graph neural networks for pedestrian trajectory prediction 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 143, 页码: 11
作者:  Zhu, Wenjun;  Liu, Yanghong;  Wang, Peng;  Zhang, Mengyi;  Wang, Tian;  Yi, Yang
收藏  |  浏览/下载:93/0  |  提交时间:2023/11/17
Trajectory prediction  Hierarchical policy  Graph neural networks  
Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 卷号: 19, 期号: 4, 页码: 17
作者:  Ma, Xuan;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:93/0  |  提交时间:2023/11/17
Knowledge reasoning  multi-modal commonsense inference  graph neural network