CASIA OpenIR

浏览/检索结果: 共24条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:39/11  |  提交时间:2024/06/11
3D shape generation  Diffusion model  Continuous normalizing flow  Point cloud  
HIDE: Hierarchical iterative decoding enhancement for multi-view 3D human parameter regression 期刊论文
Computer Animation and Virtual Worlds, 2024, 卷号: 35, 期号: 35, 页码: 3
作者:  Lin WT(林伟涛);  Zhang JG(张吉光);  Meng WL(孟维亮);  Liu XL(刘湘龙);  Zhang XP(张晓鹏)
Adobe PDF(11125Kb)  |  收藏  |  浏览/下载:32/6  |  提交时间:2024/06/11
3D human mesh recovery  body modeling  computer vision  deep learning  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:49/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 期号: 0, 页码: 1045 - 1058
作者:  Changwei Wang;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Ruisheng Wang;  Xiaopeng Zhang
Adobe PDF(3269Kb)  |  收藏  |  浏览/下载:68/23  |  提交时间:2024/05/29
Weakly supervised object localization  intrinsic discrimination and consistency  deep metric learning  geometric transformation consistency  
Hierarchical Distribution-Based Tightly-Coupled LiDAR Inertial Odometry 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 卷号: 9, 期号: 1, 页码: 1423-1435
作者:  Wang, Chengpeng;  Cao, Zhiqiang;  Li, Jianjie;  Yu, Junzhi;  Wang, Shuo
Adobe PDF(3553Kb)  |  收藏  |  浏览/下载:56/14  |  提交时间:2024/05/28
3D LiDAR inertial odometry,  distribution  filtering  optimization  point cloud constraint degeneration  
General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds With One Stone 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6155-6167
作者:  Bao, Zenghao;  Tan, Zichang;  Li, Jun;  Wan, Jun;  Ma, Xibo;  Lei, Zhen
Adobe PDF(1634Kb)  |  收藏  |  浏览/下载:61/3  |  提交时间:2024/02/22
General age estimation  long-tailed age estimation  class-wise mean absolute error  
Key-Part Attention Retrieval for Robotic Object Recognition 期刊论文
TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 卷号: 29, 期号: 3, 页码: 644-655
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo
Adobe PDF(2164Kb)  |  收藏  |  浏览/下载:120/18  |  提交时间:2024/02/22
Training  Visualization  Image recognition  Cameras  Object recognition  Convolutional neural networks  Data mining  key-part attention  retrieval  robotic object recognition  
Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 12, 页码: 8066-8073
作者:  Feng, Shihao;  Liang, Pengpeng;  Gao, Jin;  Cheng, Erkang
Adobe PDF(2745Kb)  |  收藏  |  浏览/下载:132/10  |  提交时间:2023/12/21
3D object tracking  Point cloud  Transformer  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:169/26  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Weakly-Supervised Video Object Grounding Via Learning Uni-Modal Associations 期刊论文
IEEE Transactions on Multimedia, 2022, 卷号: 25, 页码: 1-12
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(5406Kb)  |  收藏  |  浏览/下载:143/42  |  提交时间:2023/04/25
Visualization  Grounding  Task analysis  Prototypes  Annotations  Uncertainty  Proposals  Cross-modal retrieval  weakly-supervised learning  video object grounding  uni-modal association