CASIA OpenIR

浏览/检索结果: 共88条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:41/8  |  提交时间:2024/02/23
Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 5, 页码: 1106-1126
作者:  Wenqi Ren;  Yang Tang;  Qiyu Sun;  Chaoqiang Zhao;  Qing-Long Han
Adobe PDF(12695Kb)  |  收藏  |  浏览/下载:11/2  |  提交时间:2024/04/10
Computer vision  deep learning  few-shot learning  low-shot learning  semantic segmentation  zero-shot learning  
Adaptively Enhancing Facial Expression Crucial Regions via a Local Non-local Joint Network 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 331-348
作者:  Guanghui Shi;  Shasha Mao;  Shuiping Gou;  Dandan Yan;  Licheng Jiao;  Lin Xiong
Adobe PDF(3926Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/04/23
Facial expression recognition, deep neural network, multiple network ensemble, attention network, facial crucial regions  
Comprehensive Relation Modelling for Image Paragraph Generation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 369-382
作者:  Xianglu Zhu;  Zhang Zhang;  Wei Wang;  Zilei Wang
Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Image paragraph generation, visual relationship, scene graph, graph convolutional network (GCN), long short-term memory  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:163/43  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Learning Video-Text Aligned Representations for Video Captioning 期刊论文
ACM Trans. Multimedia Comput. Commun. Appl., 2023, 页码: 1-21
作者:  Yaya Shi;  Haiyang Xu;  Chunfeng Yuan;  Bing Li;  Weiming Hu,;  Zhengjun Zha
Adobe PDF(3574Kb)  |  收藏  |  浏览/下载:184/68  |  提交时间:2023/04/28
XANet: An Efficient Remote Sensing Image Segmentation Model Using Element-Wise Attention Enhancement and Multi-Scale Attention Fusion 期刊论文
REMOTE SENSING, 2023, 卷号: 15, 期号: 1, 页码: 25
作者:  Liang, Chenbin;  Xiao, Baihua;  Cheng, Bo;  Dong, Yunyun
Adobe PDF(63859Kb)  |  收藏  |  浏览/下载:269/25  |  提交时间:2023/02/22
semantic segmentation  attention mechanism  cross-attention  feature fusion  
Integrating Relational Knowledge With Text Sequences for Script Event Prediction 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: early access
作者:  Zikang Wang;  Linjing Li;  Daniel Zeng
Adobe PDF(3215Kb)  |  收藏  |  浏览/下载:268/83  |  提交时间:2023/03/20
Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 603-631
作者:  Qinghai Miao;  Yisheng Lv;  Min Huang;  Xiao Wang;  Fei-Yue Wang
Adobe PDF(11937Kb)  |  收藏  |  浏览/下载:802/144  |  提交时间:2023/03/02
Machine learning  parallel learning  parallel systems  sim-to-real  syn-to-real  virtual-to-real