CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Vision Transformers with Hierarchical Attention 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 670-683
作者:  Yun Liu;   Yu-Huan Wu;   Guolei Sun;    Le Zhang;  Ajad Chhatkuli;   Luc Van Gool
Adobe PDF(1358Kb)  |  收藏  |  浏览/下载:22/6  |  提交时间:2024/07/18
Vision transformer  hierarchical attention  global attention  local attention  scene understanding  
Comprehensive Relation Modelling for Image Paragraph Generation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 369-382
作者:  Xianglu Zhu;  Zhang Zhang;  Wei Wang;  Zilei Wang
Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:68/24  |  提交时间:2024/04/23
Image paragraph generation, visual relationship, scene graph, graph convolutional network (GCN), long short-term memory  
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:66/9  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:57/22  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:71/16  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:56/17  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Federated Learning with Privacy-preserving and Model IP-right-protection 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 19-37
作者:  Qiang Yang;  Anbu Huang;  Lixin Fan;  Chee Seng Chan;  Jian Han Lim;  Kam Woh Ng;  Ding Sheng Ong;  Bowen Li
Adobe PDF(2634Kb)  |  收藏  |  浏览/下载:36/10  |  提交时间:2024/04/23
Federated learning  privacy-preserving machine learning  security  decentralized learning  intellectual property protection  
Video Polyp Segmentation: A Deep Learning Perspective 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 531-549
作者:  Ge-Peng Ji;  Guobao Xiao;  Yu-Cheng Chou;  Deng-Ping Fan;  Kai Zhao;  Geng Chen;  Luc Van Gool
Adobe PDF(9520Kb)  |  收藏  |  浏览/下载:62/13  |  提交时间:2024/04/23
Video polyp segmentation (VPS)  dataset  self-attention  colonoscopy  abdomen  
Causal Reasoning Meets Visual Representation Learning: A Prospective Study 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 485-511
作者:  Yang Liu;  Yu-Shen Wei;  Hong Yan;  Guan-Bin Li;  Liang Lin
Adobe PDF(3224Kb)  |  收藏  |  浏览/下载:52/6  |  提交时间:2024/04/23
Causal reasoning  visual representation learning  reliable artificial intelligence  spatial-temporal data  multi-modal analysis  
Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 366-411
作者:  Yang Wu;  Ding-Heng Wang;  Xiao-Tong Lu;  Fan Yang;  Man Yao;  Wei-Sheng Dong;  Jian-Bo Shi;  Guo-Qi Li
Adobe PDF(6780Kb)  |  收藏  |  浏览/下载:52/8  |  提交时间:2024/04/23
Visual recognition  deep neural networks (DNNS)  brain-inspired methodologies  network compression  dynamic inference  survey