CASIA OpenIR

浏览/检索结果: 共92条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:45/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:70/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Semisupervised Progressive Representation Learning for Deep Multiview Clustering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:  Chen, Rui;  Tang, Yongqiang;  Xie, Yuan;  Feng, Wenlong;  Zhang, Wensheng
收藏  |  浏览/下载:78/0  |  提交时间:2023/11/17
Representation learning  Training  Data models  Task analysis  Complexity theory  Semisupervised learning  Optimization  Deep clustering  multiview clustering  progressive sample learning  semisupervised learning  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:117/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Optimization-Based Post-Training Quantization With Bit-Split and Stitching 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 2119-2135
作者:  Wang, Peisong;  Chen, Weihan;  He, Xiangyu;  Chen, Qiang;  Liu, Qingshan;  Cheng, Jian
Adobe PDF(921Kb)  |  收藏  |  浏览/下载:154/41  |  提交时间:2023/03/20
Deep neural networks  compression  quantization  post-training quantization  
Can Digital Intelligence and Cyber-Physical-Social Systems Achieve Global Food Security and Sustainability? 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 11, 页码: 2070-2080
作者:  Yanfen Wang;  Mengzhen Kang;  Yali Liu;  Juanjuan Li;  Kai Xue;  Xiujuan Wang;  Jianqing Du;  Yonglin Tian;  Qinghua Ni;  Fei-Yue Wang
Adobe PDF(9632Kb)  |  收藏  |  浏览/下载:183/107  |  提交时间:2023/09/22
Carbon-water balance  decision-support  digital intelligence (DI)  foundation models  planning  
Class-Oriented Self-Learning Graph Embedding for Image Compact Representation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 1, 页码: 74-87
作者:  Hu, Liangchen;  Dai, Zhenlei;  Tian, Lei;  Zhang, Wensheng
收藏  |  浏览/下载:161/0  |  提交时间:2023/03/20
Sparse matrices  Manifolds  Machine learning algorithms  Laplace equations  Heuristic algorithms  Data models  Data mining  Adaptive graph learning  separability examination  marginal information preserving  L-2,L-p-norm sparsity  compact representation  
Zero-Shot Predicate Prediction for Scene Graph Parsing 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 3140-3153
作者:  Li, Yiming;  Yang, Xiaoshan;  Huang, Xuhui;  Ma, Zhe;  Xu, Changsheng
收藏  |  浏览/下载:111/0  |  提交时间:2023/11/17
Deep learning  zero-shot  scene graph  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:88/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training