CASIA OpenIR

浏览/检索结果: 共25条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Pavement Defect Detection with Deep Learning: A Comprehensive Survey 期刊论文
IEEE Transactions on Intelligent Vehicles, 2023, 卷号: 9, 期号: 3, 页码: 4292 - 4311
作者:  Lili Fan;  Dandan Wang;  Junhao Wang;  Yunjie Li;  Yifeng Cao;  Yi Liu;  Xiaoming Chen;  Yutong Wang
Adobe PDF(6287Kb)  |  收藏  |  浏览/下载:50/13  |  提交时间:2024/06/06
Deep learning  pavement defect detection  computer vision  image processing  3D image  
Covariant Peak Constraint for Accurate Keypoint Detection and Keypoint-Specific Descriptor Learning 期刊论文
IEEE Transactions on Multimedia, 2024, 卷号: 26, 页码: 5383 - 5397
作者:  Fu Yujie;  Zhang Pengju;  Tang Fulin;  Wu Yihong
Adobe PDF(15150Kb)  |  收藏  |  浏览/下载:48/11  |  提交时间:2024/05/28
Image Matching  Local Feature Extraction  Covariant Peak Constraint  Conditional Neural Reprojection Error  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:99/27  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:173/24  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:169/36  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:418/5  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:380/75  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:367/93  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:348/69  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:356/47  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy