CASIA OpenIR

浏览/检索结果: 共178条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 3, 页码: 673-689
作者:  Cong Pan;  Junran Peng;  Zhaoxiang Zhang
Adobe PDF(37784Kb)  |  收藏  |  浏览/下载:49/20  |  提交时间:2024/02/19
Monocular 3D object detection  normalizing flows  Swin Transformer  
Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 5, 页码: 1106-1126
作者:  Wenqi Ren;  Yang Tang;  Qiyu Sun;  Chaoqiang Zhao;  Qing-Long Han
Adobe PDF(12695Kb)  |  收藏  |  浏览/下载:11/2  |  提交时间:2024/04/10
Computer vision  deep learning  few-shot learning  low-shot learning  semantic segmentation  zero-shot learning  
End-to-End Paired Ambisonic-Binaural Audio Rendering 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 502-513
作者:  Yin Zhu;  Qiuqiang Kong;  Junjie Shi;  Shilei Liu;  Xuzhou Ye;  Ju-Chiang Wang;  Hongming Shan;  Junping Zhang
Adobe PDF(9612Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/01/23
Ambisonic  attention  binaural rendering  neural network  
Comprehensive Relation Modelling for Image Paragraph Generation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 369-382
作者:  Xianglu Zhu;  Zhang Zhang;  Wei Wang;  Zilei Wang
Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Image paragraph generation, visual relationship, scene graph, graph convolutional network (GCN), long short-term memory  
DCAT: Dual Cross-Attention-Based Transformer for Change Detection 期刊论文
Remote Sensing, 2023, 卷号: 15, 期号: 9, 页码: 2395
作者:  Yuan Zhou;  Chunlei Huo;  Jiahang Zhu;  Leigang Huo;  Chunhong Pan
Adobe PDF(47919Kb)  |  收藏  |  浏览/下载:129/18  |  提交时间:2023/06/16
change detection  transformer  dual cross-attention  remote sensing  
Coarse-to-Fine Video Instance Segmentation With Factorized Conditional Appearance Flows 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 5, 页码: 1192-1208
作者:  Zheyun Qin;  Xiankai Lu;  Xiushan Nie;  Dongfang Liu;  Yilong Yin;  Wenguan Wang
Adobe PDF(42794Kb)  |  收藏  |  浏览/下载:79/23  |  提交时间:2023/04/26
Embedding learning  generative model  normalizing flows  video instance segmentation (VIS)  
Axial Assembled Correspondence Network for Few-Shot Semantic Segmentation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 711-721
作者:  Yu Liu;  Bin Jiang;  Jiaming Xu
Adobe PDF(3290Kb)  |  收藏  |  浏览/下载:156/36  |  提交时间:2023/03/02
Artificial intelligence  computer vision  deep convolutional neural network  few-shot semantic segmentation  
Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 603-631
作者:  Qinghai Miao;  Yisheng Lv;  Min Huang;  Xiao Wang;  Fei-Yue Wang
Adobe PDF(11937Kb)  |  收藏  |  浏览/下载:802/144  |  提交时间:2023/03/02
Machine learning  parallel learning  parallel systems  sim-to-real  syn-to-real  virtual-to-real  
Auditory Feature Driven Model Predictive Control for Sound Source Approaching 期刊论文
International Journal of Control, Automation, and Systems, 2023, 卷号: 22, 期号: 2, 页码: 1-14
作者:  Wang, Zhiqing;  Zou, Wei;  Zhang, Wei;  Ma, Hongxuan;  Zhang, Chi;  Guo, Yuxin
Adobe PDF(7966Kb)  |  收藏  |  浏览/下载:169/47  |  提交时间:2023/06/20
Source approaching control, interaural time difference, robotic audition, sound source localization.  
Skeleton-aware Implicit Function for Single-view Human Reconstruction 期刊论文
CAAI Transactions on Intelligence Technology, 2023, 页码: 379-389
作者:  Pengpeng Liu;  Guixuan Zhang;  Shuwu Zhang;  Yuanhao Li;  Zhi Zeng
Adobe PDF(1470Kb)  |  收藏  |  浏览/下载:111/29  |  提交时间:2024/01/12
Body Pose, Human Reconstruction, Implicit Function, Parametric Body Model, Single-view