CASIA OpenIR

浏览/检索结果: 共109条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Parsing Objects at a Finer Granularity: A Survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 431-451
作者:  Yifan Zhao;  Jia Li;  Yonghong Tian
Adobe PDF(1743Kb)  |  收藏  |  浏览/下载:40/16  |  提交时间:2024/05/23
Finer granularity, visual parsing, part segmentation, fine-grained object recognition, part relationship  
基于i向量和变分自编码相对生成对抗网络的语音转换 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 7, 页码: 1824-1833
作者:  李燕萍;  曹盼;  左宇涛;  张燕;  钱博
Adobe PDF(5653Kb)  |  收藏  |  浏览/下载:45/21  |  提交时间:2024/05/20
语音转换  相对生成对抗网络  i向量  非平行文本  变分自编码器  多对多  
Transformer: A General Framework from Machine Translation to Others 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 514-538
作者:  Yang Zhao;  Jiajun Zhang;  Chengqing Zong
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:53/15  |  提交时间:2024/04/23
Neural machine translation, Transformer, document neural machine translation (NMT), multimodal NMT, low-resource NMT  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:71/16  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
虹膜呈现攻击检测综述 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 2, 页码: 241-281
作者:  王财勇;  刘星雨;  房美玲;  赵光哲;  何召锋;  孙哲南
Adobe PDF(26163Kb)  |  收藏  |  浏览/下载:67/19  |  提交时间:2024/04/12
虹膜识别  虹膜呈现攻击检测  虹膜合成  泛化性  可解释性  
Key-Part Attention Retrieval for Robotic Object Recognition 期刊论文
TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 卷号: 29, 期号: 3, 页码: 644-655
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo
Adobe PDF(2164Kb)  |  收藏  |  浏览/下载:117/17  |  提交时间:2024/02/22
Training  Visualization  Image recognition  Cameras  Object recognition  Convolutional neural networks  Data mining  key-part attention  retrieval  robotic object recognition  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:179/27  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Machine Learning With Data Assimilation and Uncertainty Quantification for Dynamical Systems: A Review 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 6, 页码: 1361-1387
作者:  Sibo Cheng;  César Quilodrán-Casas;  Said Ouala;  Alban Farchi;  Che Liu;  Pierre Tandeo;  Ronan Fablet;  Didier Lucor;  Bertrand Iooss;  Julien Brajard;  Dunhui Xiao;  Tijana Janjic;  Weiping Ding;  Yike Guo;  Alberto Carrassi;  Marc Bocquet;  Rossella Arcucci
Adobe PDF(17725Kb)  |  收藏  |  浏览/下载:125/33  |  提交时间:2023/05/29
Data assimilation (DA)  deep learning  machine learning (ML)  reduced-order-modelling  uncertainty quantification (UQ)  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:169/36  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
ECSS: High-Embedding-Capacity Audio Watermarking with Diversity Reception 期刊论文
ENTROPY, 2022, 卷号: 24, 期号: 12, 页码: 23
作者:  Wu, Shiqiang;  Huang, Ying;  Guan, Hu;  Zhang, Shuwu;  Liu, Jie
Adobe PDF(988Kb)  |  收藏  |  浏览/下载:221/10  |  提交时间:2023/02/22
digital audio watermarking  embedding capacity  spread spectrum  diversity reception