CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:22/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 5513-5528
作者:  Zhao, Mengbiao;  Feng, Wei;  Yin, Fei;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(5999Kb)  |  收藏  |  浏览/下载:339/35  |  提交时间:2022/09/19
Costs  Annotations  Training  Labeling  Detectors  Data models  Benchmark testing  Mixed-supervised learning  scene text detection  weak supervision forms  expectation-maximization algorithm  
Adaptable Global Network for Whole-Brain Segmentation with Symmetry Consistency Loss 期刊论文
COGNITIVE COMPUTATION, 2022, 页码: 14
作者:  Zhao, Yuan-Xing;  Zhang, Yan-Ming;  Song, Ming;  Liu, Cheng-Lin
Adobe PDF(2496Kb)  |  收藏  |  浏览/下载:285/77  |  提交时间:2022/07/25
Whole-brain segmentation  Adaptable global network  Semi-supervised learning  Symmetry consistency loss  
Detecting Overlapped Objects in X-Ray Security Imagery by a Label-Aware Mechanism 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 卷号: 17, 页码: 998-1009
作者:  Zhao, Cairong;  Zhu, Liang;  Dou, Shuguang;  Deng, Weihong;  Wang, Liang
收藏  |  浏览/下载:185/0  |  提交时间:2022/06/06
X-ray imaging  Security  Object detection  Visualization  Liquids  Containers  Inspection  Object detection  X-ray dataset  overlap  
Boost 3-D Object Detection via Point Clouds Segmentation and Fused 3-D GIoU-L-1 Loss 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 762-773
作者:  Chen, Yaran;  Li, Haoran;  Gao, Ruiyuan;  Zhao, Dongbin
Adobe PDF(2082Kb)  |  收藏  |  浏览/下载:244/51  |  提交时间:2022/03/17
3-D object detection  generalized Intersection of Union (GIoU) loss  segmentation  
Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 229, 页码: 11
作者:  Liu, Bo;  Dong, Qiulei;  Hu, Zhanyi
Adobe PDF(1224Kb)  |  收藏  |  浏览/下载:345/72  |  提交时间:2021/11/04
Zero-shot learning  Visual-semantic embedding  Out-of-distribution detection  
EAT-NAS: elastic architecture transfer for accelerating large-scale neural architecture search 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2021, 卷号: 64, 期号: 9, 页码: 13
作者:  Fang, Jiemin;  Chen, Yukang;  Zhang, Xinbang;  Zhang, Qian;  Huang, Chang;  Meng, Gaofeng;  Liu, Wenyu;  Wang, Xinggang
Adobe PDF(377Kb)  |  收藏  |  浏览/下载:318/55  |  提交时间:2021/11/02
architecture transfer  neural architecture search  evolutionary algorithm  large-scale dataset  
Unconstrained end-to-end text reading with feature rectification 期刊论文
PATTERN RECOGNITION LETTERS, 2021, 卷号: 149, 页码: 1-8
作者:  Du, Chen;  Wang, Yanna;  Wang, Chunheng;  Xiao, Baihua;  Shi, Cunzhao
Adobe PDF(1133Kb)  |  收藏  |  浏览/下载:299/63  |  提交时间:2021/11/02
Text recognition  Text detection  Position-sensitive network  Features incompatibility  End-to-end  
An Iterative Co-Training Transductive Framework for Zero Shot Learning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 6943-6956
作者:  Liu, Bo;  Hu, Lihua;  Dong, Qiulei;  Hu, Zhanyi
Adobe PDF(2452Kb)  |  收藏  |  浏览/下载:262/57  |  提交时间:2021/11/02
Visualization  Semantics  Training  Feature extraction  Testing  Detectors  Predictive models  Zero-shot learning  transductive learning co-training  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:356/59  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling