CASIA OpenIR

浏览/检索结果: 共440条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
作者:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
收藏  |  浏览/下载:2/0  |  提交时间:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 卷号: 15, 期号: 1, 页码: 285-296
作者:  Niu, Mingyue;  Tao, Jianhua;  Li, Yongwei;  Qin, Yong;  Li, Ya
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
Assessment block  depression level prediction  representation block  speech signals  WavDepressionNet  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
SSCFormer: Push the Limit of Chunk-Wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2024, 卷号: 31, 页码: 421-425
作者:  Wang, Fangyuan;  Xu, Bo;  Xu, Bo
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
Convolution  Complexity theory  Computational modeling  Decoding  Training  Kernel  Transformers  Conformer  streaming ASR  sequentially sampled chunks  chunked causal convolution  linear complexity  
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:25/9  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:17/8  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition 期刊论文
Pattern Recognition, 2024, 页码: 110373
作者:  MingMing Yu(于明明);  Zhang H(张恒);  Fei Yin(殷飞);  Cheng-Lin Liu(刘成林)
Adobe PDF(5849Kb)  |  收藏  |  浏览/下载:23/8  |  提交时间:2024/06/24
Invisible Intruders: Label-Consistent Backdoor Attack using Re-parameterized Noise Trigger 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 14, 期号: 8, 页码: 1-13
作者:  Bo Wang;  Fei Yu;  Fei Wei;  Yi Li;  Wei Wang
Adobe PDF(1364Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/06/21
Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 3, 页码: 1881-1897
作者:  Gao, Jin;  Lu, Yan;  Qi, Xiaojuan;  Kou, Yutong;  Li, Bing;  Li, Liang;  Yu, Shan;  Hu, Weiming
Adobe PDF(915Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/06/21
Visualization  Training  Adaptation models  Data models  Optimization  Task analysis  Robustness  Online learning  few-shot online adaptation  visual tracking  continual learning  recursive least-squares estimation  
Improving Generalization of Deepfake Detectors by Imposing Gradient Regularization 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 期号: 2024, 页码: 5345-5356
作者:  Weinan Guan;  Wei Wang;  Jing Dong;  Bo Peng
Adobe PDF(1989Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/06/21
Deepfake detection  forgery texture patterns