CASIA OpenIR

浏览/检索结果: 共1048条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Key-Part Attention Retrieval for Robotic Object Recognition 期刊论文
TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 卷号: 29, 期号: 3, 页码: 644-655
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo
收藏  |  浏览/下载:39/0  |  提交时间:2024/02/22
Training  Visualization  Image recognition  Cameras  Object recognition  Convolutional neural networks  Data mining  key-part attention  retrieval  robotic object recognition  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:55/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:93/8  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Efficient Remote Sensing Image Super-Resolution via Lightweight Diffusion Models 期刊论文
IEEE Geoscience and Remote Sensing Letters, 2024, 卷号: 21, 页码: 1-5
作者:  An T(安泰);  Xue B(薛斌);  Huo CL(霍春雷);  Xiang SM(向世明);  Pan CH(潘春洪)
Adobe PDF(30422Kb)  |  收藏  |  浏览/下载:98/22  |  提交时间:2024/01/17
Remote sensing super-resolution  lightweight diffusion models  cross-attention mechanism  satellite imagery  
The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 2-16
作者:  Ye, Shuo;  Wang, Yu;  Peng, Qinmu;  You, Xinge;  Chen, C. L. Philip
收藏  |  浏览/下载:15/0  |  提交时间:2024/03/26
Fine-grained visual categorization  deep learning  weakly supervised learning  
CGFormer: ViT-Based Network for Identifying Computer-Generated Images With Token Labeling 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 235-250
作者:  Quan, Weize;  Deng, Pengfei;  Wang, Kai;  Yan, Dong-Ming
收藏  |  浏览/下载:28/0  |  提交时间:2024/02/22
CG image forensics  transformer  token labeling  generalization  robustness  
Attentional Composition Networks for Long-Tailed Human Action Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 1, 页码: 18
作者:  Wang, Haoran;  Wang, Yajie;  Yu, Baosheng;  Zhan, Yibing;  Yuan, Chunfeng;  Yang, Wankou
收藏  |  浏览/下载:86/0  |  提交时间:2023/11/15
Compositional learning  long tail  few-shot  zero-shot  action recognition  
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 14
作者:  Qi, Xingqun;  Sun, Muyi;  Wang, Zijian;  Liu, Jiaming;  Li, Qi;  Zhao, Fang;  Zhang, Shanghang;  Shan, Caifeng
Adobe PDF(6718Kb)  |  收藏  |  浏览/下载:74/29  |  提交时间:2024/02/22
Face photo-sketch synthesis  generative adversarial network  graph representation learning  intraclass and interclass  iterative cycle training (ICT)  
RTDOD: A large-scale RGB-thermal domain-incremental object detection dataset for UAVs 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 9
作者:  Feng, Hangtao;  Zhang, Lu;  Zhang, Siqi;  Wang, Dong;  Yang, Xu;  Liu, Zhiyong
收藏  |  浏览/下载:50/0  |  提交时间:2024/02/22
Domain -incremental object detection  Dataset  RGB-T dataset  Object detection dataset  UAVs dataset  Object detection  
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:45/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy