CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
EFCPose: End-to-End Multi-Person Pose Estimation with Fully Convolutional Heads 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 页码: early access
作者:  Wang Haixin;  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(4407Kb)  |  收藏  |  浏览/下载:35/11  |  提交时间:2024/06/03
Dual-Path Transformer for 3D Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 5, 页码: 3260-3270
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(2410Kb)  |  收藏  |  浏览/下载:44/19  |  提交时间:2024/06/03
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:42/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Involving Distinguished Temporal Graph Convolutional Networks for Skeleton-Based Temporal Action Segmentation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 647-660
作者:  Li, Yun-Heng;  Liu, Kai-Yuan;  Liu, Sheng-Lan;  Feng, Lin;  Qiao, Hong
收藏  |  浏览/下载:73/0  |  提交时间:2024/03/26
Feature extraction  Motion segmentation  Correlation  Convolution  Topology  Convolutional neural networks  Solid modeling  Skeleton-based temporal action segmentation  enhanced spatial graph structure  segmented encoding  
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:171/16  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:166/26  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Learning Semantic-Aware Spatial-Temporal Attention for Interpretable Action Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 8, 页码: 5213-5224
作者:  Fu, Jie;  Gao, Junyu;  Xu, Changsheng
收藏  |  浏览/下载:371/0  |  提交时间:2022/09/19
Visualization  Semantics  Task analysis  Three-dimensional displays  Feature extraction  Solid modeling  Predictive models  Semantic-aware  spatial-temporal attention  interpretable  action recognition  
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1681-1695
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(4827Kb)  |  收藏  |  浏览/下载:269/3  |  提交时间:2022/06/06
Face recognition  Task analysis  Generative adversarial networks  Image synthesis  Image recognition  Faces  Training  Facial expression recognition  facial image synthesis  generative adversarial network  representation learning  
Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 5, 页码: 1915-1925
作者:  Song, Yi-Fan;  Zhang, Zhang;  Shan, Caifeng;  Wang, Liang
Adobe PDF(3381Kb)  |  收藏  |  浏览/下载:419/67  |  提交时间:2021/06/15
Skeleton  Robustness  Noise measurement  Three-dimensional displays  Degradation  Standards  Feature extraction  Action recognition  skeleton  activation map  graph convolutional network  occlusion  jittering  
Temporal-Spatial Mapping for Action Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 卷号: 30, 期号: 3, 页码: 748-759
作者:  Song, Xiaolin;  Lan, Cuiling;  Zeng, Wenjun;  Xing, Junliang;  Sun, Xiaoyan;  Yang, Jingyu
收藏  |  浏览/下载:260/0  |  提交时间:2020/06/02
Two dimensional displays  Three-dimensional displays  Feature extraction  Optical imaging  Computational modeling  Streaming media  Kernel  Temporal-spatial mapping (TSM)  action recognition  deep learning