CASIA OpenIR

浏览/检索结果: 共25条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
EFCPose: End-to-End Multi-Person Pose Estimation with Fully Convolutional Heads 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 页码: early access
作者:  Wang Haixin;  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(4407Kb)  |  收藏  |  浏览/下载:37/12  |  提交时间:2024/06/03
Dual-Path Transformer for 3D Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 5, 页码: 3260-3270
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(2410Kb)  |  收藏  |  浏览/下载:46/20  |  提交时间:2024/06/03
Transformers  Three-dimensional displays  Pose estimation  Task analysis  Solid modeling  Feature extraction  Benchmark testing  3D human pose estimation  transformer  motion  distillation  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:44/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Distribution Unified and Probability Space Aligned Teacher-Student Learning for Imbalanced Visual Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2414-2425
作者:  Zhang, Shaoyu;  Chen, Chen;  Xie, Qiong;  Sun, Haigang;  Dong, Fei;  Peng, Silong
Adobe PDF(2511Kb)  |  收藏  |  浏览/下载:51/10  |  提交时间:2024/05/30
Class-imbalanced learning  distribution mismatch  probability space mismatch  teacher-student learning  
DomainFeat: Learning Local Features With Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 46-59
作者:  Xu, Rongtao;  Wang, Changwei;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(6039Kb)  |  收藏  |  浏览/下载:93/13  |  提交时间:2024/03/26
Feature extraction  Location awareness  Visualization  Robustness  Image matching  Detectors  Decoding  Local features  domain adaptation  cross-domain data  consistency loss  
Involving Distinguished Temporal Graph Convolutional Networks for Skeleton-Based Temporal Action Segmentation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 647-660
作者:  Li, Yun-Heng;  Liu, Kai-Yuan;  Liu, Sheng-Lan;  Feng, Lin;  Qiao, Hong
收藏  |  浏览/下载:76/0  |  提交时间:2024/03/26
Feature extraction  Motion segmentation  Correlation  Convolution  Topology  Convolutional neural networks  Solid modeling  Skeleton-based temporal action segmentation  enhanced spatial graph structure  segmented encoding  
Exploring Explicitly Disentangled Features for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6360-6373
作者:  Li, Jingwei;  Li, Yuan;  Wang, Huanjie;  Liu, Chengbao;  Tan, Jie
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:132/14  |  提交时间:2023/12/21
Domain generalization  feature disentanglement  Fourier transform  data augmentation  
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:173/16  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
MoEP-AE: Autoencoding Mixtures of Exponential Power Distributions for Open-Set Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 1, 页码: 312-325
作者:  Jiayin, Sun;  Hong, Wang;  Qiulei, Dong
Adobe PDF(3639Kb)  |  收藏  |  浏览/下载:147/14  |  提交时间:2023/03/20
open-set recognition  autoencoder  scale mixture distribution  exponential power distribution  
Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 10, 页码: 6728-6740
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo;  Liu, Xilong;  Tan, Min
Adobe PDF(22124Kb)  |  收藏  |  浏览/下载:299/13  |  提交时间:2022/11/14
Shape  Three-dimensional displays  Cognition  Pose estimation  Feature extraction  Decoding  Solid modeling  Category-level  6D object pose estimation  structure encoder  reasoning attention