CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:96/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Multi-level consistency regularization for domain adaptive object detection 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2023, 页码: 18003–18018
作者:  Tian, Kun;  Zhang, Chenghao;  Wang, Ying;  Xiang, Shiming
Adobe PDF(2628Kb)  |  收藏  |  浏览/下载:53/5  |  提交时间:2023/11/17
Consistency regularization  Object detection  Domain adaptation  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:413/4  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1681-1695
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(4827Kb)  |  收藏  |  浏览/下载:267/3  |  提交时间:2022/06/06
Face recognition  Task analysis  Generative adversarial networks  Image synthesis  Image recognition  Faces  Training  Facial expression recognition  facial image synthesis  generative adversarial network  representation learning  
Pay attention to doctor & ndash;patient dialogues: Multi-modal knowledge graph attention image-text embedding for COVID-19 diagnosis 期刊论文
INFORMATION FUSION, 2021, 卷号: 75, 页码: 168-185
作者:  Zheng, Wenbo;  Yan, Lan;  Gou, Chao;  Zhang, Zhi-Cheng;  Zhang, Jun Jason;  Hu, Ming;  Wang, Fei-Yue
收藏  |  浏览/下载:282/0  |  提交时间:2021/11/02
COVID-19 diagnose  Knowledge attention mechanism  Knowledge-based representation learning  Knowledge embedding  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:355/47  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Knowledge-driven Egocentric Multimodal Activity Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 21
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyu;  Sang, Jitao;  Xu, Changsheng
Adobe PDF(1875Kb)  |  收藏  |  浏览/下载:428/68  |  提交时间:2021/03/08
Egocentric videos  wearable sensors  graph neural networks  
Lightweight Two-Stream Convolutional Neural Network for SAR Target Recognition 期刊论文
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 卷号: 18, 期号: 0, 页码: 1-5
作者:  Huang, Xiayuan;  Yang, Qiao;  Qiao, Hong
浏览  |  Adobe PDF(736Kb)  |  收藏  |  浏览/下载:197/75  |  提交时间:2020/10/13
Lightweight  synthetic aperture radar (SAR) target recognition  two-stream convolutional neural network (CNN)  
Self-Attention Based Visual-Tactile Fusion Learning for Predicting Grasp Outcomes 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 卷号: 5, 期号: 4, 页码: 5827-5834
作者:  Cui, Shaowei;  Wang, Rui;  Wei, Junhang;  Hu, Jingyi;  Wang, Shuo
Adobe PDF(1535Kb)  |  收藏  |  浏览/下载:358/64  |  提交时间:2020/08/31
Grasping  perception for grasping and manipulation  multi-modal perception  force and tactile sensing