CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection 期刊论文
NEURAL NETWORKS, 2024, 卷号: 175, 页码: 11
作者:  Fan, Cunhang;  Xue, Jun;  Tao, Jianhua;  Yi, Jiangyan;  Wang, Chenglong;  Zheng, Chengshi;  Lv, Zhao
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/04
ASVspoof  Fake speech detection  Fundamental frequency  Res2Net  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:74/13  |  提交时间:2024/02/22
Transformer-based stroke relation encoding for online handwriting and sketches 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 148, 页码: 13
作者:  Liu, Jing-Yu;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:91/3  |  提交时间:2024/02/22
Online stroke classification  Handwritten document analysis  Diagram recognition  Sketch semantic segmentation  Position encoding in transformer  
Hierarchical graph attention network for temporal knowledge graph reasoning 期刊论文
NEUROCOMPUTING, 2023, 卷号: 550, 页码: 126390
作者:  Shao, Pengpeng;  He, Jiayi;  Li, Guanjun;  Zhang, Dawei;  Tao, Jianhua
Adobe PDF(589Kb)  |  收藏  |  浏览/下载:171/13  |  提交时间:2023/11/17
Temporal knowledge graphs  Graph attention network  Reasoning  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:171/6  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
DyGAT: Dynamic stroke classification of online handwritten documents and sketches 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 141, 页码: 12
作者:  Yang, Yu-Ting;  Zhang, Yan-Ming;  Yun, Xiao-Long;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(3180Kb)  |  收藏  |  浏览/下载:152/2  |  提交时间:2023/11/17
Stroke classification  Sketch semantic segmentation  Document layout analysis  Diagram recognition  Streaming recognition  
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2103Kb)  |  收藏  |  浏览/下载:148/9  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing 期刊论文
PATTERN RECOGNITION, 2022, 卷号: 132, 页码: 14
作者:  Li, Xiao-Hui;  Yin, Fei;  Dai, He-Sen;  Liu, Cheng-Lin
Adobe PDF(4030Kb)  |  收藏  |  浏览/下载:277/5  |  提交时间:2022/11/14
Table detection  Table structure recognition  Template -free form parsing  Graph neural network  End -to -end training  
Frequency Feature Pyramid Network With Global-Local Consistency Loss for Crowd-and-Vehicle Counting in Congested Scenes 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 卷号: 23, 期号: 7, 页码: 9654-9664
作者:  Yu, Xiaoyuan;  Liang, Yanyan;  Lin, Xuxin;  Wan, Jun;  Wang, Tian;  Dai, Hong-Ning
收藏  |  浏览/下载:223/0  |  提交时间:2022/11/14
Context prediction  frequency feature pyramid  discrete cosine transformation  global-local consistency loss