CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Pavement Defect Detection with Deep Learning: A Comprehensive Survey 期刊论文
IEEE Transactions on Intelligent Vehicles, 2023, 卷号: 9, 期号: 3, 页码: 4292 - 4311
作者:  Lili Fan;  Dandan Wang;  Junhao Wang;  Yunjie Li;  Yifeng Cao;  Yi Liu;  Xiaoming Chen;  Yutong Wang
Adobe PDF(6287Kb)  |  收藏  |  浏览/下载:52/13  |  提交时间:2024/06/06
Deep learning  pavement defect detection  computer vision  image processing  3D image  
A Multi-Modal Classification Method for Early Diagnosis of Mild Cognitive Impairment and Alzheimer's Disease Using Three Paradigms With Various Task Difficulties 期刊论文
IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2024, 卷号: 32, 页码: 1456-1465
作者:  Chen Sheng;  Zhang Chutian;  Yang Hongjun;  Peng Liang;  Xie Haiqun;  Lv Zeping;  Hou Zeng-Guang
Adobe PDF(10190Kb)  |  收藏  |  浏览/下载:56/16  |  提交时间:2024/05/29
Dementia  multi-modal  machine learning  domain-adversarial neural network  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:104/28  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
A Graded Assessment System for Parkinsons Upper-Limb Bradykinesia Based on a Temporal Convolutional Network Model 期刊论文
IEEE SENSORS JOURNAL, 2023, 卷号: 23, 期号: 23, 页码: 29283-29292
作者:  Tong, Lina;  Liu, Dai-Song;  Peng, Liang;  Hao, Hong-Lin;  Wang, Chen
Adobe PDF(9425Kb)  |  收藏  |  浏览/下载:78/9  |  提交时间:2024/02/21
Bradykinesia grade  inertial sensors  Parkinson's disease (PD)  temporal convolutional network (TCN)  wearable device  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:100/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:151/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Subgraph-aware graph structure revision for spatial-temporal graph modeling 期刊论文
NEURAL NETWORKS, 2022, 卷号: 154, 页码: 190-202
作者:  Wang, Yuhu;  Zhang, Chunxia;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(1781Kb)  |  收藏  |  浏览/下载:209/14  |  提交时间:2023/01/09
Graph structure learning  Graph neural network  Spatial-temporal graph modeling  
Learning adversarial point-wise domain alignment for stereo matching 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 564-574
作者:  Zhang, Chenghao;  Meng, Gaofeng;  Xu, Richard Yi Da;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(3885Kb)  |  收藏  |  浏览/下载:387/59  |  提交时间:2022/09/19
Stereo Matching  Domain adaptation  Point-wise linear transformation  Adversarial learning  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:425/7  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:387/76  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition