CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Vision Transformers with Hierarchical Attention 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 670-683
作者:  Yun Liu;   Yu-Huan Wu;   Guolei Sun;    Le Zhang;  Ajad Chhatkuli;   Luc Van Gool
Adobe PDF(1358Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/07/18
Vision transformer  hierarchical attention  global attention  local attention  scene understanding  
Rethinking Global Context in Crowd Counting 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 640-651
作者:  Guolei Sun;   Yun Liu;   Thomas Probst;   Danda Pani Paudel;  Nikola Popovic;   Luc Van Gool
Adobe PDF(2388Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/07/18
Crowd counting  vision transformer  global context  attention  density map  
The Devil is in Details: Delving Into Lite FFN Design for Vision Transformers 会议论文
, Seoul, Korea, 2024-4-14
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(407Kb)  |  收藏  |  浏览/下载:56/16  |  提交时间:2024/05/28
Vision Transformer  Light-Weight Structure  Feed-Forward Networks  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:25/8  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Exploring the Brain-like Properties of Deep Neural Networks: A Neural Encoding Perspective 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 439-455
作者:  Qiongyi Zhou;  Changde Du;  Huiguang He
Adobe PDF(7698Kb)  |  收藏  |  浏览/下载:55/12  |  提交时间:2024/04/23
Convolutional neural network (CNN)  vision transformer (ViT)  multi-modal networks  spatial-temporal networks  visual neural encoding  brain-like neural networks  
A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 965-981
作者:  Yong-Chao Li;  Rui-Sheng Jia;  Ying-Xiang Hu;  Hong-Mei Sun
Adobe PDF(10448Kb)  |  收藏  |  浏览/下载:72/31  |  提交时间:2024/03/18
Crowd density estimation  linear feature calibration  vision transformer  weakly-supervision learning  
FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 卷号: 18, 页码: 4775-4786
作者:  Liu, Ajian;  Tan, Zichang;  Yu, Zitong;  Zhao, Chenxu;  Wan, Jun;  Liang, Yanyan;  Lei, Zhen;  Zhang, Du;  Li, Stan Z.;  Guo, Guodong
Adobe PDF(10966Kb)  |  收藏  |  浏览/下载:178/1  |  提交时间:2023/11/17
Face anti-spoofing  flexible-modal testing  vision transformer  mutual-attention  fusion-attention  
HTCViT: an effective network for image classification and segmentation based on natural disaster datasets 期刊论文
VISUAL COMPUTER, 2023, 页码: 13
作者:  Ma, Zhihao;  Li, Wei;  Zhang, Muyang;  Meng, Weiliang;  Xu, Shibiao;  Zhang, Xiaopeng
Adobe PDF(8610Kb)  |  收藏  |  浏览/下载:138/8  |  提交时间:2023/11/17
Natural disaster image analysis  Vision transformer  Convolution  Hierarchical  
A Closer Look at Self-Supervised Lightweight Vision Transformers 会议论文
, Honolulu, Hawaii, USA, 2023-7
作者:  Wang, Shaoru;  Gao, Jin;  Li, Zeming;  Zhang, Xiaoqin;  Weiming, Hu
Adobe PDF(3478Kb)  |  收藏  |  浏览/下载:259/74  |  提交时间:2023/09/20
Vision Transformer  Self-supervised Learning  Lightweight Networks  Knowledge Distillation  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:232/63  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer