CASIA OpenIR

浏览/检索结果: 共101条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:96/9  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:34/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
基于视听融合的目标定位与趋近导航 学位论文
, 2023
作者:  王智清
Adobe PDF(49485Kb)  |  收藏  |  浏览/下载:118/10  |  提交时间:2023/06/02
听觉特征,声源定位,趋近控制,机器人运动,多目标定位,视听融合  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Temporal sparse adversarial attack on sequence-based gait recognition 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 133, 页码: 11
作者:  He, Ziwen;  Wang, Wei;  Dong, Jing;  Tan, Tieniu
Adobe PDF(1435Kb)  |  收藏  |  浏览/下载:287/52  |  提交时间:2022/11/21
Adversarial attack  Gait recognition  Temporal sparsity  
Graph-Enhanced Emotion Neural Decoding 期刊论文
IEEE Transactions on Medical Imaging, 2023, 页码: 1-1
作者:  Huang ZY(黄中昱);  Du CD(杜长德);  Wang YH;  Fu KC(付铠成);  He HG(何晖光)
Adobe PDF(6049Kb)  |  收藏  |  浏览/下载:235/52  |  提交时间:2023/05/05
Dual-stream Representation Fusion Learning for accurate medical image segmentation 期刊论文
Engineering Applications of Artificial Intelligence, 2023, 卷号: 123, 页码: 106402
作者:  Xu RT(许镕涛);  Wang CW(王常维);  Xu SB(徐士彪);  Meng WL(孟维亮);  Zhang XP(张晓鹏)
Adobe PDF(1893Kb)  |  收藏  |  浏览/下载:208/54  |  提交时间:2023/05/18
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access
作者:  Shentong Mo;  Xin M(辛淼)
Adobe PDF(2209Kb)  |  收藏  |  浏览/下载:86/14  |  提交时间:2023/04/25
long-term forecasting  spatial-temporal graph transformer  Bayesian transformer  uncertainty estimation  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:42/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
A Framework and Operational Procedures for Metaverses-Based Industrial Foundation Models 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 10
作者:  Wang, Jiangong;  Tian, Yonglin;  Wang, Yutong;  Yang, Jing;  Wang, Xingxia;  Wang, Sanjin;  Kwan, Oliver
Adobe PDF(3322Kb)  |  收藏  |  浏览/下载:125/33  |  提交时间:2023/02/22
Cyber-physical-social intelligence (CPSI)  cyber-physical-social systems (CPSSs)  industrial foundation models (IFMs)  intelligent enterprises  metaverses  operational processes  parallel intelligence