CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:62/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Hierarchical graph attention network for temporal knowledge graph reasoning 期刊论文
NEUROCOMPUTING, 2023, 卷号: 550, 页码: 126390
作者:  Shao, Pengpeng;  He, Jiayi;  Li, Guanjun;  Zhang, Dawei;  Tao, Jianhua
Adobe PDF(589Kb)  |  收藏  |  浏览/下载:151/5  |  提交时间:2023/11/17
Temporal knowledge graphs  Graph attention network  Reasoning  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:156/3  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
Cycle-Consistent Weakly Supervised Visual Grounding With Individual and Contextual Representations 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5167-5180
作者:  Zhang, Ruisong;  Wang, Chuang;  Liu, Cheng-Lin
收藏  |  浏览/下载:120/0  |  提交时间:2023/11/16
Visualization  Grounding  Task analysis  Sports equipment  Image reconstruction  Transformers  Training  Weakly supervised learning  visual grounding  cycle consistency  individual and contextual representations  
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2103Kb)  |  收藏  |  浏览/下载:130/5  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2013Kb)  |  收藏  |  浏览/下载:385/6  |  提交时间:2022/09/19
Emotion recognition  Iterative methods  Context modeling  Psychology  Oral communication  Logic gates  Learning systems  Contextual information  emotion recognition in conversation (ERC)  iterative method  Personality-enhanced Iterative Refinement Network (PIRNet)  personality influence  
Tucker decomposition-based temporal knowledge graph completion 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2022, 卷号: 238, 页码: 9
作者:  Shao, Pengpeng;  Zhang, Dawei;  Yang, Guohua;  Tao, Jianhua;  Che, Feihu;  Liu, Tong
Adobe PDF(611Kb)  |  收藏  |  浏览/下载:290/50  |  提交时间:2022/06/10
Temporal knowledge graphs  Tucker decomposition  Reconstruction  Contrastive learning  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:273/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
Boost 3-D Object Detection via Point Clouds Segmentation and Fused 3-D GIoU-L-1 Loss 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 762-773
作者:  Chen, Yaran;  Li, Haoran;  Gao, Ruiyuan;  Zhao, Dongbin
Adobe PDF(2082Kb)  |  收藏  |  浏览/下载:268/58  |  提交时间:2022/03/17
3-D object detection  generalized Intersection of Union (GIoU) loss  segmentation  
F-0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3375-3383
作者:  Li, Yongwei;  Tao, Jianhua;  Erickson, Donna;  Liu, Bin;  Akagi, Masato
收藏  |  浏览/下载:140/0  |  提交时间:2021/12/28
Speech recognition  Iterative methods  Production  Estimation  Brain modeling  Shape  Low-frequency noise  Glottal source  vocal tract  source-filter model  ARX-LF model