CASIA OpenIR

浏览/检索结果: 共75条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
非受限场景下文本到图像的生成方法研究 学位论文
, 2024
作者:  孙建新
Adobe PDF(32226Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/06/04
生成式对抗网络,扩散模型,文本到图像生成,人脸图像编辑  
Transformer-based Spiking Neural Networks for Multimodal Audio-Visual Classification 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 页码: DOI 10.1109/TCDS.2023.3327081
作者:  Guo LY(郭凌月);  Zeyu Gao;  Jinye Qu;  Suiwu Zheng;  Runhao Jiang;  Yanfeng Lu;  Hong Qiao
Adobe PDF(3922Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/05/28
Audio Mixing Inversion via Embodied Self-supervised Learning 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 55-62
作者:  Haotian Zhou;  Feng Yu;  Xihong Wu
Adobe PDF(1288Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/04/23
Audio mixing inversion, intelligent audio mixing, self-supervised learning, audio signal processing, deep learning  
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 483-513
作者:  Wei-Chien Wang;  Euijoon Ahn;  Dagan Feng;  Jinman Kim
Adobe PDF(2691Kb)  |  收藏  |  浏览/下载:23/6  |  提交时间:2024/04/23
Self-supervised learning (SSL), contrastive learning, deep learning, medical image analysis, computer vision  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:21/6  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
EEG-based Emotion Recognition Using Multiple Kernel Learning 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 472-484
作者:  Qian Cai;  Guo-Chong Cui;  Hai-Xian Wang
Adobe PDF(1939Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/04/23
Emotion recognition  electroencephalography (EEG)  multiple kernel learning  machine learning  brain computer interface  
Opportunities and challenges for biometrics 专著
Switzerland:Springer, 2020
作者:  Sun, Zhenan;  Li, Qi;  Liu, Yunfan;  Zhu, Yuhao
Adobe PDF(590Kb)  |  收藏  |  浏览/下载:84/34  |  提交时间:2024/02/23
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:61/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:198/69  |  提交时间:2023/07/06
标注受限视频人体行为理解模型与算法研究 学位论文
, 2023
作者:  李定
Adobe PDF(8391Kb)  |  收藏  |  浏览/下载:155/8  |  提交时间:2023/06/28
标注受限  人体行为理解  主动学习  视频片段检索  自监督学习