CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:132/28  |  提交时间:2023/06/21
Vision Enhanced Generative Pre-trained Language Model for Multimodal Sentence Summarization 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 289-298
作者:  Liqiang Jing;  Yiren Li;  Junhao Xu;  Yongcan Yu;  Pei Shen;  Xuemeng Song
Adobe PDF(2389Kb)  |  收藏  |  浏览/下载:5/1  |  提交时间:2024/04/23
Multimodal sentence summarization (MMSS)  generative pre-trained language model (GPLM)  natural language generation  deep learning  artificial intelligence  
Transmission Line Insulator Defect Detection Based on Swin Transformer and Context 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 729-740
作者:  Yu Xi;  Ke Zhou;  Ling-Wen Meng;  Bo Chen;  Hao-Min Chen;  Jing-Yi Zhang
Adobe PDF(18337Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/04/23
Insulator defect, object detection, Swin transformer, data augmentation, context information  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Federated Learning with Privacy-preserving and Model IP-right-protection 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 19-37
作者:  Qiang Yang;  Anbu Huang;  Lixin Fan;  Chee Seng Chan;  Jian Han Lim;  Kam Woh Ng;  Ding Sheng Ong;  Bowen Li
Adobe PDF(2634Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/04/23
Federated learning  privacy-preserving machine learning  security  decentralized learning  intellectual property protection  
Multimodal Pretraining from Monolingual to Multilingual 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 220-232
作者:  Liang Zhang;  Ludan Ruan;  Anwen Hu;  Qin Jin
Adobe PDF(3024Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Multilingual pretraining  multimodal pretraining  cross-lingual transfer  multilingual generation  cross-modal retrieval  
Towards Interpretable Defense Against Adversarial Attacks via Causal Inference 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 3, 页码: 209-226
作者:  Min Ren;  Yun-Long Wang;  Zhao-Feng He
Adobe PDF(5143Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/04/23
Adversarial sample  adversarial defense  causal inference  interpretable machine learning  transformers  
Dense Face Network: A Dense Face Detector Based on Global Context and Visual Attention Mechanism 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 3, 页码: 247-256
作者:  Lin Song;  Jin-Fu Yang;  Qing-Zhen Shang;  Ming-Ai Li
Adobe PDF(1569Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/04/23
Video Polyp Segmentation: A Deep Learning Perspective 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 531-549
作者:  Ge-Peng Ji;  Guobao Xiao;  Yu-Cheng Chou;  Deng-Ping Fan;  Kai Zhao;  Geng Chen;  Luc Van Gool
Adobe PDF(9520Kb)  |  收藏  |  浏览/下载:6/1  |  提交时间:2024/04/23
Video polyp segmentation (VPS)  dataset  self-attention  colonoscopy  abdomen  
Glaucoma Detection with Retinal Fundus Images Using Segmentation and Classification 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 563-580
作者:  Thisara Shyamalee;  Dulani Meedeniya
Adobe PDF(3581Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Attention U-Net  segmentation  classification  Inception-v3  visual geometry group 19 (VGG19)  residual neural network 50 (ResNet50)  glaucoma  fundus images