CASIA OpenIR

Browse/Search Results:  1-10 of 111 Help

Selected(0)Clear Items/Page:    Sort:
基于Transformer的几何基元检测与分析 学位论文
, 2024
Authors:  周威
Adobe PDF(10295Kb)  |  Favorite  |  View/Download:15/0  |  Submit date:2024/07/21
基元检测  关系分析  关键点  Transformer  
TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717
Authors:  Yukun Zhai;   Xiaoqiang Zhang;   Xiameng Qin;   Sanyuan Zhao;  Xingping Dong;   Jianbing Shen
Adobe PDF(2312Kb)  |  Favorite  |  View/Download:14/5  |  Submit date:2024/07/18
End-to-end text spotting  arbitrarily-shaped texts  transformer  mixed supervision  multitask modeling  
Vision Transformers with Hierarchical Attention 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 670-683
Authors:  Yun Liu;   Yu-Huan Wu;   Guolei Sun;    Le Zhang;  Ajad Chhatkuli;   Luc Van Gool
Adobe PDF(1358Kb)  |  Favorite  |  View/Download:19/6  |  Submit date:2024/07/18
Vision transformer  hierarchical attention  global attention  local attention  scene understanding  
Rethinking Global Context in Crowd Counting 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 640-651
Authors:  Guolei Sun;   Yun Liu;   Thomas Probst;   Danda Pani Paudel;  Nikola Popovic;   Luc Van Gool
Adobe PDF(2388Kb)  |  Favorite  |  View/Download:11/4  |  Submit date:2024/07/18
Crowd counting  vision transformer  global context  attention  density map  
ChatGPT-Based Scenario Engineer: A New Framework on Scenario Generation for Trajectory Prediction 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 卷号: 9, 期号: 3, 页码: 4422-4431
Authors:  Li, Xuan;  Liu, Enlu;  Shen, Tianyu;  Huang, Jun;  Wang, Fei-Yue
Favorite  |  View/Download:17/0  |  Submit date:2024/07/03
Parallel driving  scenarios engineering  foundation model  vehicle operating system  generative pre-trained transformer  trajectory prediction  
Image captioning: Semantic selection unit with stacked residual attention 期刊论文
IMAGE AND VISION COMPUTING, 2024, 卷号: 144, 页码: 12
Authors:  Song, Lifei;  Li, Fei;  Wang, Ying;  Liu, Yu;  Wang, Yuanhua;  Xiang, Shiming
Favorite  |  View/Download:8/0  |  Submit date:2024/07/03
Image captioning  Semantic attributes  Semantic selection unit  Transformer  Stacked residual attention  
Completed Part Transformer for Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 2303-2313
Authors:  Zhang, Zhong;  He, Di;  Liu, Shuang;  Xiao, Baihua;  Durrani, Tariq S.
Favorite  |  View/Download:11/0  |  Submit date:2024/07/03
Person ReID  transformer  adaptive refined tokens  
Efficient Stereo Matching Using Swin Transformer and Multilevel Feature Consistency in Autonomous Mobile Systems 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 页码: 9
Authors:  Su, Xiaojie;  Liu, Shimin;  Li, Rui;  Bing, Zhenshan;  Knoll, Alois
Favorite  |  View/Download:7/0  |  Submit date:2024/07/03
Costs  Feature extraction  Transformers  Computational modeling  Task analysis  Image reconstruction  Unsupervised learning  Disparity estimation  feature consistency  stereo matching  transformer  
GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 4222-4234
Authors:  Cao, Yong;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Favorite  |  View/Download:19/0  |  Submit date:2024/07/03
Cross feature fusion (CFF)  global context learning  group transformer  semantic segmentation  
Mask2Former with Improved Query for Semantic Segmentation in Remote-Sensing Images 期刊论文
MATHEMATICS, 2024, 卷号: 12, 期号: 5, 页码: 24
Authors:  Guo, Shichen;  Yang, Qi;  Xiang, Shiming;  Wang, Shuwen;  Wang, Xuezhi
Favorite  |  View/Download:8/0  |  Submit date:2024/07/03
semantic segmentation  remote-sensing image  transformer  Mask2Former  query