CASIA OpenIR

浏览/检索结果: 共22条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:37/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 329-341
作者:  Feng, Cheng;  Chen, Zhen;  Zhang, Congxuan;  Hu, Weiming;  Li, Bing;  Lu, Feng
收藏  |  浏览/下载:58/0  |  提交时间:2024/03/26
Estimation  Iterative methods  Cameras  Task analysis  Feature extraction  Decoding  Training  Monocular depth estimation  iterative refinement  self-supervised learning  deep learning  
DomainFeat: Learning Local Features With Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 46-59
作者:  Xu, Rongtao;  Wang, Changwei;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(6039Kb)  |  收藏  |  浏览/下载:85/11  |  提交时间:2024/03/26
Feature extraction  Location awareness  Visualization  Robustness  Image matching  Detectors  Decoding  Local features  domain adaptation  cross-domain data  consistency loss  
Exploring Explicitly Disentangled Features for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6360-6373
作者:  Li, Jingwei;  Li, Yuan;  Wang, Huanjie;  Liu, Chengbao;  Tan, Jie
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:122/11  |  提交时间:2023/12/21
Domain generalization  feature disentanglement  Fourier transform  data augmentation  
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:162/13  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
3D Mapping and 6D Pose Computation for Real Time Augmented Reality on Cylindrical Objects 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 卷号: 30, 期号: 9, 页码: 2887-2899
作者:  Tang FL(唐付林);  Wu YH(吴毅红);  Hou XH(侯晓辉);  Lin HB(林海滨)
Adobe PDF(4394Kb)  |  收藏  |  浏览/下载:165/49  |  提交时间:2023/04/25
Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 10, 页码: 6728-6740
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo;  Liu, Xilong;  Tan, Min
Adobe PDF(22124Kb)  |  收藏  |  浏览/下载:289/11  |  提交时间:2022/11/14
Shape  Three-dimensional displays  Cognition  Pose estimation  Feature extraction  Decoding  Solid modeling  Category-level  6D object pose estimation  structure encoder  reasoning attention  
Learning Semantic-Aware Spatial-Temporal Attention for Interpretable Action Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 8, 页码: 5213-5224
作者:  Fu, Jie;  Gao, Junyu;  Xu, Changsheng
收藏  |  浏览/下载:361/0  |  提交时间:2022/09/19
Visualization  Semantics  Task analysis  Three-dimensional displays  Feature extraction  Solid modeling  Predictive models  Semantic-aware  spatial-temporal attention  interpretable  action recognition  
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1681-1695
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(4827Kb)  |  收藏  |  浏览/下载:263/2  |  提交时间:2022/06/06
Face recognition  Task analysis  Generative adversarial networks  Image synthesis  Image recognition  Faces  Training  Facial expression recognition  facial image synthesis  generative adversarial network  representation learning  
RGBT Tracking by Trident Fusion Network 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 579-592
作者:  Zhu, Yabin;  Li, Chenglong;  Tang, Jin;  Luo, Bin;  Wang, Liang
收藏  |  浏览/下载:238/0  |  提交时间:2022/06/06
Feature extraction  Convolution  Target tracking  Training  Aggregates  Visualization  Benchmark testing  RGBT tracking  feature aggregation  feature pruning  trident architecture