CASIA OpenIR

浏览/检索结果: 共405条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Audio Mixing Inversion via Embodied Self-supervised Learning 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 55-62
作者:  Haotian Zhou;  Feng Yu;  Xihong Wu
Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:4/3  |  提交时间:2024/04/23
Audio mixing inversion, intelligent audio mixing, self-supervised learning, audio signal processing, deep learning  
Exploring Variational Auto-encoder Architectures, Configurations, and Datasets for Generative Music Explainable AI 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 29-45
作者:  Nick Bryan-Kinns;  Bingyuan Zhang;  Songyan Zhao;  Berker Banar
Adobe PDF(1683Kb)  |  收藏  |  浏览/下载:3/2  |  提交时间:2024/04/23
Variational auto-encoder, explainable AI (XAI), generative music, musical features, datasets  
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
Multimodal Biometric Fusion Algorithm Based on Ranking Partition Collision Theory 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 884-896
作者:  Zhuorong Li;  Yunqi Tang
Adobe PDF(2010Kb)  |  收藏  |  浏览/下载:1/1  |  提交时间:2024/04/23
Image processing, convolutional neural network, multimodal, biometrics, fusion  
Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 595-604
作者:  Yang Liu;  Haoqin Sun;  Wenbo Guan;  Yuqi Xia;   Zhen Zhao
Adobe PDF(1966Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Speech emotion recognition (SER), 3-dimensional (3D) feature, cascaded attention network (CAN), triplet loss, joint loss  
Federated Learning on Multimodal Data: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 539-553
作者:  Yi-Ming Lin;   Yuan Gao;  Mao-Guo Gong;  Si-Jia Zhang;  Yuan-Qiao Zhang;  Zhi-Yuan Li
Adobe PDF(1253Kb)  |  收藏  |  浏览/下载:5/0  |  提交时间:2024/04/23
Federated learning, multimodal learning, heterogeneous data, edge computing, collaborative learning  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:5/0  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Deep Learning-based Moving Object Segmentation: Recent Progress and Research Prospects 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 335-369
作者:  Rui Jiang;  Ruixiang Zhu;  Hu Su;  Yinlin Li;  Yuan Xie;  Wei Zou
Adobe PDF(9061Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
Moving object segmentation (MOS), change detection, background subtraction, deep learning (DL), video understanding  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process