CASIA OpenIR

Browse/Search Results:  1-4 of 4 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition 会议论文
, Greece, 2023-6-4
Authors:  Liu, Jiawei;  Wang, Hao;  Wang, Weining;  He, Xingjian;  Liu, Jing
Adobe PDF(1673Kb)  |  Favorite  |  View/Download:173/39  |  Submit date:2023/07/06
文本指导的视频生成方法研究 学位论文
, 2023
Authors:  刘佳伟
Adobe PDF(15246Kb)  |  Favorite  |  View/Download:158/6  |  Submit date:2023/06/08
基于人工智能的内容生成  多模态  视频生成  
ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation 会议论文
, Queensland, Australia, 2023-6-18
Authors:  Liu, Jiawei;  Wang, Weining;  Liu, Wei;  He, Qian;  Liu, Jing
Adobe PDF(4537Kb)  |  Favorite  |  View/Download:208/44  |  Submit date:2023/05/04
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
Authors:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  Favorite  |  View/Download:162/32  |  Submit date:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer