CASIA OpenIR

浏览/检索结果: 共290条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717
作者:  Yukun Zhai;   Xiaoqiang Zhang;   Xiameng Qin;   Sanyuan Zhao;  Xingping Dong;   Jianbing Shen
Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:15/5  |  提交时间:2024/07/18
End-to-end text spotting  arbitrarily-shaped texts  transformer  mixed supervision  multitask modeling  
Towards Domain-agnostic Depth Completion 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 652-669
作者:  Guangkai Xu;   Wei Yin;   Jianming Zhang;   Oliver Wang;  Simon Niklaus;   Simon Chen;   Jia-Wang Bian
Adobe PDF(17196Kb)  |  收藏  |  浏览/下载:10/2  |  提交时间:2024/07/18
Monocular depth estimation  depth completion  zero-shot generalization  scene reconstruction  neural network  
Segment Anything Is Not Always Perfect: An Investigationof SAM on Different Real-world Applications 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 617-630
作者:  Wei Ji;   Jingjing Li;   Qi Bi;   Tingwei Liu;  Wenbo Li;   Li Cheng
Adobe PDF(11623Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/07/18
Segment anything model (SAM)  visual perception  segmentation  foundational model  computer vision  
Multi-Level Pixel-Wise Correspondence Learning for 6DoF Face Pose Estimation 期刊论文
IEEE Transactions on Multimedia, 2024, 页码: IEEE Xplore
作者:  Xu M(徐淼);  Xiangyu Zhu;  Yueying Kao;  Zhiwen Chen;  Jiangjing Lyu;  Zhen Lei
Adobe PDF(7908Kb)  |  收藏  |  浏览/下载:36/4  |  提交时间:2024/07/16
Automation 5.0: The Key to Industry 5.0 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 8, 页码: 1723-1727
作者:  Ljubo Vlacic;  Hailong Huang;  Mariagrazia Dotoli;  Yutong Wang;  Petros A. Ioannou;  Lili Fan;  Xingxia Wang;  Raffaele Carli;  Chen Lv;  Lingxi Li;  Xiaoxiang Na;  Qing-Long Ha;  Fei-Yue Wang
Adobe PDF(2574Kb)  |  收藏  |  浏览/下载:9/5  |  提交时间:2024/07/16
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:46/18  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
MRFTrans: Multimodal Representation Fusion Transformer for Monocular 3D Semantic Scene Completion 期刊论文
Information Fusion, 2024, 页码: 102493
作者:  Xu RT(许镕涛);  Jiguang Zhang;  Jiaxi Sun;  Changwei Wang;  Yifan Wu;  Shibiao Xu;  Weiliang Meng;  Xiaopeng Zhang
Adobe PDF(3764Kb)  |  收藏  |  浏览/下载:36/8  |  提交时间:2024/06/24
Digital Twin Driven Measurement in Robotic Flexible Printed Circuit Assembly 期刊论文
IEEE Transactions on Instrumentation & Measurement, 2023, 卷号: 72, 页码: 5007812
作者:  Yang Minghao;  Huang Zhenping;  Sun Yangchang;  Zhao Yongjia;  Sun Ruize;  Sun Qi;  Chen JinLong;  Qiang BaoHua;  Wang JingHong;  Sun FuChun
Adobe PDF(39985Kb)  |  收藏  |  浏览/下载:34/8  |  提交时间:2024/06/24
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文
International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684
作者:  Xin Zhao;  Shiyu Hu;  Yipei Wang;  Zhang Jing;  Yimin Hu;  Rongshuai Liu;  Haibin Ling;  Yin Li;  Renshu Li;  Kun Liu;  Jiadong Li
Adobe PDF(9076Kb)  |  收藏  |  浏览/下载:30/8  |  提交时间:2024/06/21
Improving diversity of speech‐driven gesture generation with memory networks as dynamic dictionaries. 期刊论文
CAAI Transactions on Intelligence Technology., 2024, 页码: 1–15
作者:  Zeyu Zhao;  Nan Gao;  Zhi Zeng;  Guixuan Zhang;  Jie Liu;  Shuwu Zhang
Adobe PDF(2067Kb)  |  收藏  |  浏览/下载:53/19  |  提交时间:2024/06/20