CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 249-262
作者:  Guyue Hu;  Bin He;  Hanwang Zhang
Adobe PDF(2167Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/04/23
Prompt learning  video-language pretrained models  instructional videos  procedure understanding  knowledge distilling  
Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 15949-15963
作者:  Gao, Junyu;  Chen, Mengyuan;  Xu, Changsheng
收藏  |  浏览/下载:31/0  |  提交时间:2024/03/26
Uncertainty  Location awareness  Reliability  Videos  Noise measurement  Estimation  Deep learning  Weakly-supervised learning  temporal action localization  evidential deep learning  uncertainty estimation  
Learning Proposal-Aware Re-Ranking for Weakly-Supervised Temporal Action Localization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 207-220
作者:  Hu, Yufan;  Fu, Jie;  Chen, Mengyuan;  Gao, Junyu;  Dong, Jianfeng;  Fan, Bin;  Liu, Hongmin
收藏  |  浏览/下载:27/0  |  提交时间:2024/03/26
Proposals  Feature extraction  Location awareness  Videos  Measurement  Task analysis  Optimization  weakly-supervised temporal action localization  Proposal-aware reranking  
Object Affinity Learning: Towards Annotation-Free Instance Segmentation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 11, 页码: 13959-13973
作者:  Wang, Yuqi;  Chen, Yuntao;  Zhang, Zhaoxiang
收藏  |  浏览/下载:82/0  |  提交时间:2023/12/21
Videos  Motion segmentation  Visualization  Three-dimensional displays  Task analysis  Object detection  Geometry  Object affinity learning  geometric information  annotation-free instance segmentation  
Motion Decoupling Network for Intra-Operative Motion Estimation Under Occlusion 期刊论文
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 卷号: 42, 期号: 10, 页码: 2924-2935
作者:  Bian, Gui-Bin;  Zhang, Li;  Chen, He;  Li, Zhen;  Fu, Pan;  Yue, Wen-Qian;  Luo, Yu-Wen;  Ge, Pei-Cong;  Liu, Wei-Peng
收藏  |  浏览/下载:125/0  |  提交时间:2023/12/21
Optical flow  Surgery  Instruments  Estimation  Task analysis  Videos  Motion estimation  Computer-assisted surgery  motion estimation  optical flow  self-supervised learning  surgical images  
A Unified Framework for High Fidelity Face Swap and Expression Reenactment 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 6, 页码: 3673-3684
作者:  Peng, Bo;  Fan, Hongxing;  Wang, Wei;  Dong, Jing;  Lyu, Siwei
收藏  |  浏览/下载:171/0  |  提交时间:2022/07/25
Faces  Task analysis  Videos  Shape  Three-dimensional displays  Face recognition  Information integrity  Face swap  expression reenactment  3DMM  video manipulation  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:340/65  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
VidSfM: Robust and Accurate Structure-From-Motion for Monocular Videos 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2449-2462
作者:  Cui, Hainan;  Tu, Diantao;  Tang, Fulin;  Xu, Pengfei;  Liu, Hongmin;  Shen, Shuhan
收藏  |  浏览/下载:250/0  |  提交时间:2022/06/06
Cameras  Image reconstruction  Videos  Simultaneous localization and mapping  Video sequences  Robustness  Scalability  Structure from motion  image reconstruction  computational geometry  computer vision  
Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1089-1102
作者:  Hu, Juan;  Liao, Xin;  Wang, Wei;  Qin, Zheng
收藏  |  浏览/下载:203/0  |  提交时间:2022/06/06
Videos  Information integrity  Feature extraction  Streaming media  Faces  Forensics  Social networking (online)  Video forensics  compressed Deepfake videos  frame-level stream  temporality-level stream  
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:  Zheng, Aihua;  Hu, Menglan;  Jiang, Bo;  Huang, Yan;  Yan, Yan;  Luo, Bin
收藏  |  浏览/下载:239/0  |  提交时间:2022/03/17
Visualization  Task analysis  Measurement  Speech recognition  Videos  Location awareness  Image recognition  Adversarial learning  audio-visual matching  cross-modal learning  metric learning