CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:36/10  |  提交时间:2024/06/11
3D shape generation  Diffusion model  Continuous normalizing flow  Point cloud  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:44/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Local feature matching using deep learning: A survey 期刊论文
INFORMATION FUSION, 2024, 卷号: 107, 页码: 25
作者:  Xu, Shibiao;  Chen, Shunpeng;  Xu, Rongtao;  Wang, Changwei;  Lu, Peng;  Guo, Li
收藏  |  浏览/下载:31/0  |  提交时间:2024/05/30
Local feature matching  Image matching  Deep learning  Survey  
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 期号: 0, 页码: 1045 - 1058
作者:  Changwei Wang;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Ruisheng Wang;  Xiaopeng Zhang
Adobe PDF(3269Kb)  |  收藏  |  浏览/下载:67/23  |  提交时间:2024/05/29
Weakly supervised object localization  intrinsic discrimination and consistency  deep metric learning  geometric transformation consistency  
Learning Proposal-Aware Re-Ranking for Weakly-Supervised Temporal Action Localization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 207-220
作者:  Hu, Yufan;  Fu, Jie;  Chen, Mengyuan;  Gao, Junyu;  Dong, Jianfeng;  Fan, Bin;  Liu, Hongmin
收藏  |  浏览/下载:60/0  |  提交时间:2024/03/26
Proposals  Feature extraction  Location awareness  Videos  Measurement  Task analysis  Optimization  weakly-supervised temporal action localization  Proposal-aware reranking  
General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds With One Stone 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6155-6167
作者:  Bao, Zenghao;  Tan, Zichang;  Li, Jun;  Wan, Jun;  Ma, Xibo;  Lei, Zhen
Adobe PDF(1634Kb)  |  收藏  |  浏览/下载:59/2  |  提交时间:2024/02/22
General age estimation  long-tailed age estimation  class-wise mean absolute error  
Key-Part Attention Retrieval for Robotic Object Recognition 期刊论文
TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 卷号: 29, 期号: 3, 页码: 644-655
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo
Adobe PDF(2164Kb)  |  收藏  |  浏览/下载:117/17  |  提交时间:2024/02/22
Training  Visualization  Image recognition  Cameras  Object recognition  Convolutional neural networks  Data mining  key-part attention  retrieval  robotic object recognition  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:93/6  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
A Novel Biologically Inspired Structural Model for Feature Correspondence 期刊论文
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 卷号: 15, 期号: 2, 页码: 844-854
作者:  Lu, Yan-Feng;  Yang, Xu;  Li, Yi;  Yu, Qian;  Liu, Zhi-Yong;  Qiao, Hong
Adobe PDF(4447Kb)  |  收藏  |  浏览/下载:187/14  |  提交时间:2023/11/17
Visualization  Biological system modeling  Biology  Brain modeling  Biological information theory  Task analysis  Strain  Appearance feature descriptor  biologically inspired model  feature correspondence  feature representation  graph matching (GM)  graph structure  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:167/26  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection