CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:26/0  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 329-341
作者:  Feng, Cheng;  Chen, Zhen;  Zhang, Congxuan;  Hu, Weiming;  Li, Bing;  Lu, Feng
收藏  |  浏览/下载:54/0  |  提交时间:2024/03/26
Estimation  Iterative methods  Cameras  Task analysis  Feature extraction  Decoding  Training  Monocular depth estimation  iterative refinement  self-supervised learning  deep learning  
DomainFeat: Learning Local Features With Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 46-59
作者:  Xu, Rongtao;  Wang, Changwei;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(6039Kb)  |  收藏  |  浏览/下载:78/10  |  提交时间:2024/03/26
Feature extraction  Location awareness  Visualization  Robustness  Image matching  Detectors  Decoding  Local features  domain adaptation  cross-domain data  consistency loss  
Semantic-Context Graph Network for Point-Based 3D Object Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6474-6486
作者:  Dong, Shuwei;  Kong, Xiaoyu;  Pan, Xingjia;  Tang, Fan;  Li, Wei;  Chang, Yi;  Dong, Weiming
收藏  |  浏览/下载:126/0  |  提交时间:2023/12/21
3D object detection  graph neural networks  information entanglement  
Exploring Explicitly Disentangled Features for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6360-6373
作者:  Li, Jingwei;  Li, Yuan;  Wang, Huanjie;  Liu, Chengbao;  Tan, Jie
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:112/8  |  提交时间:2023/12/21
Domain generalization  feature disentanglement  Fourier transform  data augmentation  
Jointing Recurrent Across-Channel and Spatial Attention for Multi-Object Tracking With Block-Erasing Data Augmentation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 8, 页码: 4054-4069
作者:  Deng, Keyu;  Zhang, Congxuan;  Chen, Zhen;  Hu, Weiming;  Li, Bing;  Lu, Feng
收藏  |  浏览/下载:122/0  |  提交时间:2023/11/17
Multi-object tracking  one shot  multiattention feature learning  block erasing strategy  object occlusions  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:146/18  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
A Unified Framework for High Fidelity Face Swap and Expression Reenactment 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 6, 页码: 3673-3684
作者:  Peng, Bo;  Fan, Hongxing;  Wang, Wei;  Dong, Jing;  Lyu, Siwei
收藏  |  浏览/下载:191/0  |  提交时间:2022/07/25
Faces  Task analysis  Videos  Shape  Three-dimensional displays  Face recognition  Information integrity  Face swap  expression reenactment  3DMM  video manipulation  
Density-Aware Haze Image Synthesis by Self-Supervised Content-Style Disentanglement 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 7, 页码: 4552-4572
作者:  Zhang, Chi;  Lin, Zihang;  Xu, Liheng;  Li, Zongliang;  Tang, Wei;  Liu, Yuehu;  Meng, Gaofeng;  Wang, Le;  Li, Li
收藏  |  浏览/下载:241/0  |  提交时间:2022/07/25
Feature extraction  Image synthesis  Scattering  Generative adversarial networks  Atmospheric modeling  Training  Testing  Haze synthesis  unsupervised image-to-image translation  self-supervised disentanglement  
Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1089-1102
作者:  Hu, Juan;  Liao, Xin;  Wang, Wei;  Qin, Zheng
收藏  |  浏览/下载:216/0  |  提交时间:2022/06/06
Videos  Information integrity  Feature extraction  Streaming media  Faces  Forensics  Social networking (online)  Video forensics  compressed Deepfake videos  frame-level stream  temporality-level stream