CASIA OpenIR

浏览/检索结果: 共38条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Transformer-based stroke relation encoding for online handwriting and sketches 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 148, 页码: 13
作者:  Liu, Jing-Yu;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
收藏  |  浏览/下载:32/0  |  提交时间:2024/02/22
Online stroke classification  Handwritten document analysis  Diagram recognition  Sketch semantic segmentation  Position encoding in transformer  
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:45/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
ICaps-ResLSTM: Improved capsule network and residual LSTM for EEG emotion recognition 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 卷号: 87, 页码: 9
作者:  Fan, Cunhang;  Xie, Heng;  Tao, Jianhua;  Li, Yongwei;  Pei, Guanxiong;  Li, Taihao;  Lv, Zhao
收藏  |  浏览/下载:77/0  |  提交时间:2023/11/15
Electroencephalogram  Emotion recognition  Capsule network  Residual Long-Short Term Memory  
Deep representation learning for domain generalization with information bottleneck principle 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 143, 页码: 12
作者:  Zhang, Jiao;  Zhang, Xu-Yao;  Wang, Chuang;  Liu, Cheng-Lin
收藏  |  浏览/下载:72/0  |  提交时间:2023/11/17
Domain generalization  Information bottleneck  Representation learning  
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:4/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
DyGAT: Dynamic stroke classification of online handwritten documents and sketches 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 141, 页码: 12
作者:  Yang, Yu-Ting;  Zhang, Yan-Ming;  Yun, Xiao-Long;  Yin, Fei;  Liu, Cheng-Lin
收藏  |  浏览/下载:70/0  |  提交时间:2023/11/17
Stroke classification  Sketch semantic segmentation  Document layout analysis  Diagram recognition  Streaming recognition  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:24/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
Adversarial training with distribution normalization and margin balance 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 136, 页码: 11
作者:  Cheng, Zhen;  Zhu, Fei;  Zhang, Xu-Yao;  Liu, Cheng-Lin
收藏  |  浏览/下载:201/0  |  提交时间:2023/01/09
Adversarial robustness  Adversarial training  Distribution normalization  Margin balance  
Towards prior gap and representation gap for long-tailed recognition 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 133, 页码: 12
作者:  Zhang, Ming-Liang;  Zhang, Xu-Yao;  Wang, Chuang;  Liu, Cheng-Lin
收藏  |  浏览/下载:236/0  |  提交时间:2022/11/14
Long-tailed learning  Prior gap  Representation gap  Image recognition  
Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing 期刊论文
PATTERN RECOGNITION, 2022, 卷号: 132, 页码: 14
作者:  Li, Xiao-Hui;  Yin, Fei;  Dai, He-Sen;  Liu, Cheng-Lin
收藏  |  浏览/下载:208/0  |  提交时间:2022/11/14
Table detection  Table structure recognition  Template -free form parsing  Graph neural network  End -to -end training