CASIA OpenIR

浏览/检索结果: 共83条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 卷号: 15, 期号: 1, 页码: 285-296
作者:  Niu, Mingyue;  Tao, Jianhua;  Li, Yongwei;  Qin, Yong;  Li, Ya
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Assessment block  depression level prediction  representation block  speech signals  WavDepressionNet  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:29/15  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition 期刊论文
Pattern Recognition, 2024, 页码: 110373
作者:  MingMing Yu(于明明);  Zhang H(张恒);  Fei Yin(殷飞);  Cheng-Lin Liu(刘成林)
Adobe PDF(5849Kb)  |  收藏  |  浏览/下载:43/16  |  提交时间:2024/06/24
Spiking Neural Network for Ultralow-Latency and High-Accurate Object Detection 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 10.1109/TNNLS.2024.3372613
作者:  Jinye Qu;  Zeyu Gao;  Tielin Zhang;  Yanfeng Lu;  Huajin Tang;  Hong Qiao
Adobe PDF(2939Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/06/06
Low latency  object detection  spiking neural network (SNN)  timesteps compression  
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:41/17  |  提交时间:2024/06/05
A Double-Hurdle Quantification Model for Freezing of Gait of Parkinson's Patients 期刊论文
IEEE Transactions on Biomedical Engineering, 2024, 页码: 1 - 12
作者:  Ningcun Xu;  Chen Wang;  Liang Peng;  Xiao-Hu Zhou;  Jingyao Chen;  Zhi Cheng;  Zeng-Guang Hou
Adobe PDF(875Kb)  |  收藏  |  浏览/下载:42/14  |  提交时间:2024/06/04
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:65/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:53/12  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
ProSyno: Context-Free Prompt Learning for Synonym Discovery 期刊论文
Frontiers of Computer Science, 2024, 页码: 1-14
作者:  Song Zhang;  Lei He;  Dong Wang;  Hongyun Bao;  Suncong Zheng;  Yuqiao Liu;  Baihua Xiao;  Jiayue Li;  Dongyuan Lu;  Nan Zheng
Adobe PDF(19187Kb)  |  收藏  |  浏览/下载:42/7  |  提交时间:2024/05/31
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:44/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer