CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 329-341
作者:  Feng, Cheng;  Chen, Zhen;  Zhang, Congxuan;  Hu, Weiming;  Li, Bing;  Lu, Feng
收藏  |  浏览/下载:12/0  |  提交时间:2024/03/26
Estimation  Iterative methods  Cameras  Task analysis  Feature extraction  Decoding  Training  Monocular depth estimation  iterative refinement  self-supervised learning  deep learning  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:117/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Learning adversarial point-wise domain alignment for stereo matching 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 564-574
作者:  Zhang, Chenghao;  Meng, Gaofeng;  Xu, Richard Yi Da;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(3885Kb)  |  收藏  |  浏览/下载:248/48  |  提交时间:2022/09/19
Stereo Matching  Domain adaptation  Point-wise linear transformation  Adversarial learning  
Meta Graph Transformer: A Novel Framework for Spatial-Temporal Traffic Prediction 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 544-563
作者:  Ye, Xue;  Fang, Shen;  Sun, Fang;  Zhang, Chunxia;  Xiang, Shiming
Adobe PDF(3491Kb)  |  收藏  |  浏览/下载:209/24  |  提交时间:2022/09/19
Traffic prediction  Spatial-temporal modeling  Meta-learning  Attention mechanism  Deep learning  
PSNet: Perspective-sensitive convolutional network for object detection 期刊论文
NEUROCOMPUTING, 2022, 卷号: 468, 页码: 384-395
作者:  Zhang, Xin;  Liu, Yicheng;  Huo, Chunlei;  Xu, Nuo;  Wang, Lingfeng;  Pan, Chunhong
Adobe PDF(8656Kb)  |  收藏  |  浏览/下载:285/36  |  提交时间:2021/12/28
Object detection  Perspective-sensitive  Structural neural network  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:301/61  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:100/5  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation  
Dynamic camera configuration learning for high-confidence active object detection 期刊论文
NEUROCOMPUTING, 2021, 卷号: 466, 页码: 113-127
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Cao, Yong;  Meng, Gaofeng;  Pan, Chunhong
Adobe PDF(4412Kb)  |  收藏  |  浏览/下载:261/50  |  提交时间:2021/12/28
Object detection  Active object detection  Deep reinforcement learning  Camera control  
Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 5, 页码: 1915-1925
作者:  Song, Yi-Fan;  Zhang, Zhang;  Shan, Caifeng;  Wang, Liang
Adobe PDF(3381Kb)  |  收藏  |  浏览/下载:347/55  |  提交时间:2021/06/15
Skeleton  Robustness  Noise measurement  Three-dimensional displays  Degradation  Standards  Feature extraction  Action recognition  skeleton  activation map  graph convolutional network  occlusion  jittering  
End -to -end video text detection with online tracking 期刊论文
PATTERN RECOGNITION, 2021, 卷号: 113, 页码: 12
作者:  Yu, Hongyuan;  Huang, Yan;  Pi, Lihong;  Zhang, Chengquan;  Li, Xuan;  Wang, Liang
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:296/54  |  提交时间:2021/05/06
End-to-end  Video text detection  Online tracking