CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:37/10  |  提交时间:2024/06/11
3D shape generation  Diffusion model  Continuous normalizing flow  Point cloud  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:45/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 期号: 0, 页码: 1045 - 1058
作者:  Changwei Wang;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Ruisheng Wang;  Xiaopeng Zhang
Adobe PDF(3269Kb)  |  收藏  |  浏览/下载:68/23  |  提交时间:2024/05/29
Weakly supervised object localization  intrinsic discrimination and consistency  deep metric learning  geometric transformation consistency  
Key-Part Attention Retrieval for Robotic Object Recognition 期刊论文
TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 卷号: 29, 期号: 3, 页码: 644-655
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo
Adobe PDF(2164Kb)  |  收藏  |  浏览/下载:117/17  |  提交时间:2024/02/22
Training  Visualization  Image recognition  Cameras  Object recognition  Convolutional neural networks  Data mining  key-part attention  retrieval  robotic object recognition  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:98/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Multi-Correlation Siamese Transformer Network With Dense Connection for 3D Single Object Tracking 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 12, 页码: 8066-8073
作者:  Feng, Shihao;  Liang, Pengpeng;  Gao, Jin;  Cheng, Erkang
Adobe PDF(2745Kb)  |  收藏  |  浏览/下载:130/9  |  提交时间:2023/12/21
3D object tracking  Point cloud  Transformer  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:94/7  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Multi-View Multi-Label Fine-Grained Emotion Decoding From Human Brain Activity 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:325/76  |  提交时间:2022/12/27
Fine-grained Emotion Decoding  Multi-view Learning  Multi-label Learning  Variational Autoencoder  Product of Experts  
Intraoperative Glioma Grading Using Neural Architecture Search and Multi-Modal Imaging 期刊论文
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 卷号: 41, 期号: 10, 页码: 2570-2581
作者:  Xiao, Anqi;  Shen, Biluo;  Shi, Xiaojing;  Zhang, Zhe;  Zhang, Zeyu;  Tian, Jie;  Ji, Nan;  Hu, Zhenhua
收藏  |  浏览/下载:260/0  |  提交时间:2022/11/14
Imaging  Computer architecture  Fluorescence  Feature extraction  Surgery  Biomedical imaging  Medical diagnostic imaging  Deep learning  glioma grading  intraoperative imaging  multi-modal imaging  neural architecture search  NIR-II fluorescence imaging  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:421/6  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability