CASIA OpenIR

浏览/检索结果: 共145条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 卷号: 15, 期号: 1, 页码: 285-296
作者:  Niu, Mingyue;  Tao, Jianhua;  Li, Yongwei;  Qin, Yong;  Li, Ya
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Assessment block  depression level prediction  representation block  speech signals  WavDepressionNet  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:27/14  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition 期刊论文
Pattern Recognition, 2024, 页码: 110373
作者:  MingMing Yu(于明明);  Zhang H(张恒);  Fei Yin(殷飞);  Cheng-Lin Liu(刘成林)
Adobe PDF(5849Kb)  |  收藏  |  浏览/下载:42/15  |  提交时间:2024/06/24
Improving Generalization of Deepfake Detectors by Imposing Gradient Regularization 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 期号: 2024, 页码: 5345-5356
作者:  Weinan Guan;  Wei Wang;  Jing Dong;  Bo Peng
Adobe PDF(1989Kb)  |  收藏  |  浏览/下载:43/14  |  提交时间:2024/06/21
Deepfake detection  forgery texture patterns  
A robust transformer-based pipeline of 3D cell alignment, denoise and instance segmentation on electron microscopy sequence images 期刊论文
Journal of Plant Physiology, 2024, 页码: 154236
作者:  Jiazheng, Liu;  Yafeng, Zheng;  Limei, Lin;  Jingyue, Guo;  Yanan, Lv;  Jingbin, Yuan;  Hao, Zhai;  Xi, Chen;  Lijun, Shen;  LinLin, Li;  Shunong, Bai;  Hua, Han
Adobe PDF(15549Kb)  |  收藏  |  浏览/下载:34/10  |  提交时间:2024/06/11
Spiking Neural Network for Ultralow-Latency and High-Accurate Object Detection 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 10.1109/TNNLS.2024.3372613
作者:  Jinye Qu;  Zeyu Gao;  Tielin Zhang;  Yanfeng Lu;  Huajin Tang;  Hong Qiao
Adobe PDF(2939Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/06/06
Low latency  object detection  spiking neural network (SNN)  timesteps compression  
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/06/05
A Double-Hurdle Quantification Model for Freezing of Gait of Parkinson's Patients 期刊论文
IEEE Transactions on Biomedical Engineering, 2024, 页码: 1 - 12
作者:  Ningcun Xu;  Chen Wang;  Liang Peng;  Xiao-Hu Zhou;  Jingyao Chen;  Zhi Cheng;  Zeng-Guang Hou
Adobe PDF(875Kb)  |  收藏  |  浏览/下载:41/13  |  提交时间:2024/06/04
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:62/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:50/11  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning