CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
The Devil is in Details: Delving Into Lite FFN Design for Vision Transformers 会议论文
, Seoul, Korea, 2024-4-14
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(407Kb)  |  收藏  |  浏览/下载:3/2  |  提交时间:2024/05/28
Vision Transformer  Light-Weight Structure  Feed-Forward Networks  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Exploring the Brain-like Properties of Deep Neural Networks: A Neural Encoding Perspective 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 439-455
作者:  Qiongyi Zhou;  Changde Du;  Huiguang He
Adobe PDF(7698Kb)  |  收藏  |  浏览/下载:23/5  |  提交时间:2024/04/23
Convolutional neural network (CNN)  vision transformer (ViT)  multi-modal networks  spatial-temporal networks  visual neural encoding  brain-like neural networks  
A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 965-981
作者:  Yong-Chao Li;  Rui-Sheng Jia;  Ying-Xiang Hu;  Hong-Mei Sun
Adobe PDF(10448Kb)  |  收藏  |  浏览/下载:35/16  |  提交时间:2024/03/18
Crowd density estimation  linear feature calibration  vision transformer  weakly-supervision learning  
FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 卷号: 18, 页码: 4775-4786
作者:  Liu, Ajian;  Tan, Zichang;  Yu, Zitong;  Zhao, Chenxu;  Wan, Jun;  Liang, Yanyan;  Lei, Zhen;  Zhang, Du;  Li, Stan Z.;  Guo, Guodong
收藏  |  浏览/下载:132/0  |  提交时间:2023/11/17
Face anti-spoofing  flexible-modal testing  vision transformer  mutual-attention  fusion-attention  
HTCViT: an effective network for image classification and segmentation based on natural disaster datasets 期刊论文
VISUAL COMPUTER, 2023, 页码: 13
作者:  Ma, Zhihao;  Li, Wei;  Zhang, Muyang;  Meng, Weiliang;  Xu, Shibiao;  Zhang, Xiaopeng
收藏  |  浏览/下载:98/0  |  提交时间:2023/11/17
Natural disaster image analysis  Vision transformer  Convolution  Hierarchical  
A Closer Look at Self-Supervised Lightweight Vision Transformers 会议论文
, Honolulu, Hawaii, USA, 2023-7
作者:  Wang, Shaoru;  Gao, Jin;  Li, Zeming;  Zhang, Xiaoqin;  Weiming, Hu
Adobe PDF(3478Kb)  |  收藏  |  浏览/下载:208/66  |  提交时间:2023/09/20
Vision Transformer  Self-supervised Learning  Lightweight Networks  Knowledge Distillation  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:197/52  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Computational knowledge vision: paradigmatic knowledge based prescriptive learning and reasoning for perception and vision 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 页码: 36
作者:  Zheng, Wenbo;  Yan, Lan;  Gou, Chao;  Wang, Fei-Yue
收藏  |  浏览/下载:184/0  |  提交时间:2022/06/06
Computer vision  Knowledge engineering  Deep learning  Graph learning  Meta-learning  Transformer  Artificial intelligence (AI)  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:284/36  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning