CASIA OpenIR

浏览/检索结果: 共1467条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Towards Prior Gap and Representation Gap for Long-tailed Recognition, Pattern Recognition 期刊论文
Pattern Recognition, 2023, 卷号: 133, 期号: 109012, 页码: 109012
作者:  Zhang Ming-Liang;  Zhang Xu-Yao;  Wang Chang;  Liu Cheng-Lin
Adobe PDF(2258Kb)  |  收藏  |  浏览/下载:72/14  |  提交时间:2024/04/03
Long-tailed learning  Prior gap  Representation gap  Image recognition  
Deep convolutional neural network based on self-distillation for tool wear recognition 期刊论文
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 卷号: 132, 页码: 12
作者:  Pan, Yi;  Hao, Ling;  He, Jianliang;  Ding, Kun;  Yu, Qiang;  Wang, Yulin
收藏  |  浏览/下载:28/0  |  提交时间:2024/03/27
Tool fault diagnosis  Mobile inspection robots  Self-distillation  Industry 4.0  Deep learning  
The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 2-16
作者:  Ye, Shuo;  Wang, Yu;  Peng, Qinmu;  You, Xinge;  Chen, C. L. Philip
收藏  |  浏览/下载:29/0  |  提交时间:2024/03/26
Fine-grained visual categorization  deep learning  weakly supervised learning  
Scene text recognition via dual character counting-aware visual and semantic modeling network 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 卷号: 67, 期号: 3, 页码: 2
作者:  Xiao, Ke;  Zhu, Anna;  Iwana, Brian Kenji;  Liu, Cheng-Lin
收藏  |  浏览/下载:59/0  |  提交时间:2024/03/13
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:42/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
A New Lightweight Script Independent Scene Text Style Transfer Network 期刊论文
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 页码: 29
作者:  Shivakumara, Palaiahnakote;  Roy, Ayush;  Nandanwar, Lokesh;  Pal, Umapada;  Lu, Yue;  Liu, Cheng-Lin
收藏  |  浏览/下载:31/0  |  提交时间:2024/02/22
Text detection  style transfer  CNN models  multi-lingual transfer  
Why You Cannot Rank First: Modifications for Benchmarking Six-Degree-of-Freedom Visual Localization Algorithms 期刊论文
SENSORS, 2023, 卷号: 23, 期号: 23, 页码: 18
作者:  Han, Sheng;  Gao, Wei;  Hu, Zhanyi
收藏  |  浏览/下载:32/0  |  提交时间:2024/02/22
visual localization  benchmark enhancement  pose compensation  sequential interpolation  ties resolution  
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:51/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:73/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:43/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation