CASIA OpenIR

浏览/检索结果: 共28条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
SSCFormer: Push the Limit of Chunk-Wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2024, 卷号: 31, 页码: 421-425
作者:  Wang, Fangyuan;  Xu, Bo;  Xu, Bo
收藏  |  浏览/下载:13/0  |  提交时间:2024/07/03
Convolution  Complexity theory  Computational modeling  Decoding  Training  Kernel  Transformers  Conformer  streaming ASR  sequentially sampled chunks  chunked causal convolution  linear complexity  
Global and local multi-modal feature mutual learning for retinal vessel segmentation 期刊论文
Pattern Recognition, 2024, 卷号: 151, 页码: 110376
作者:  Xin Zhao;  Zhang Jing;  Qiaozhe Li;  Tengfei Zhao;  Yi Li;  Zifeng Wu
Adobe PDF(4182Kb)  |  收藏  |  浏览/下载:47/18  |  提交时间:2024/06/21
Mutual learning  Multi-modal learning  OCTA images  Retinal vessel segmentation  
Bidirectional Sentence Ordering with Interactive Decoding 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15
作者:  Guirong Bai;  Shizhu HE;  Kang Liu;  Jun Zhao
Adobe PDF(1080Kb)  |  收藏  |  浏览/下载:44/16  |  提交时间:2024/06/20
SSCFormer: Push the Limit of Chunk-Wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2024, 页码: 421-425
作者:  Wang FY(王方圆);  Xu B(徐博);  Xu B(徐波)
Adobe PDF(1843Kb)  |  收藏  |  浏览/下载:58/14  |  提交时间:2024/06/12
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:105/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:186/38  |  提交时间:2023/06/21
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access
作者:  Shentong Mo;  Xin M(辛淼)
Adobe PDF(2209Kb)  |  收藏  |  浏览/下载:111/23  |  提交时间:2023/04/25
long-term forecasting  spatial-temporal graph transformer  Bayesian transformer  uncertainty estimation  
A Hierarchical Architecture for Multisymptom Assessment of Early Parkinson's Disease via Wearable Sensors 期刊论文
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 卷号: 14, 期号: 4, 页码: 1553-1563
作者:  Wang, Chen;  Peng, Liang;  Hou, Zeng-Guang;  Li, Yanfeng;  Tan, Ying;  Hao, Honglin
Adobe PDF(1912Kb)  |  收藏  |  浏览/下载:323/16  |  提交时间:2023/03/20
Diseases  Machine learning  Hidden Markov models  Accelerometers  Monitoring  Gyroscopes  Parkinson's disease  Wearable computing  Sensor systems  multilevel fusion  multisymptom assessment  Parkinson's disease (PD)  wearable sensor system  
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:254/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training  
The Consistent Extended Kalman Filter Design for Maneuvering Target Tracking and Its Application on Hand Position Tracking 期刊论文
Guidance, Navigation and Control, 2022, 页码: 已接受未发表
作者:  Lin, Tian;  Yang, Xu;  Wenchao, Xue;  Long, Cheng
Adobe PDF(12758Kb)  |  收藏  |  浏览/下载:198/48  |  提交时间:2022/06/14