CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learning Hierarchical Video Graph Networks for One-Stop Video Delivery 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2022, 卷号: 18, 期号: 1, 页码: 1-23
作者:  Song, Yaguang;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(7608Kb)  |  收藏  |  浏览/下载:122/38  |  提交时间:2023/04/25
Cross modal  video retrieval  deep learning  graph neural networks  
Deep Audio-Visual Learning: A Survey 期刊论文
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 3, 页码: 351-376
作者:  Hao Zhu;  Man-Di Luo;  Rui Wang;  Ai-Hua Zheng;  Ran He
Adobe PDF(1864Kb)  |  收藏  |  浏览/下载:188/35  |  提交时间:2021/05/24
Deep audio-visual learning  audio-visual separation and localization  correspondence learning  generative models  representation learning  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:314/58  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Centroid-aware local discriminative metric learning in speaker verification 期刊论文
PATTERN RECOGNITION, 2017, 卷号: 72, 期号: 72, 页码: 176-185
作者:  Sheng, Kekai;  Dong, Weiming;  Li, Wei;  Razik, Joseph;  Huang, Feiyue;  Hu, Baogang
浏览  |  Adobe PDF(2013Kb)  |  收藏  |  浏览/下载:406/106  |  提交时间:2018/01/02
Text-independent Asv  Centroid-aware Balanced Boosting Sampling  Adaptive Neighborhood Component Analysis  Linear Magnet