CASIA OpenIR

浏览/检索结果: 共84条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
基于边缘特征增强的任意形状文本检测网络 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 5, 页码: 1019-1030
作者:  白鹤翔;  王浩然
Adobe PDF(3157Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/05/09
场景文本检测  任意形状  边缘区域  浅层特征  渐进尺度扩张网络  
一种基于成对字向量和噪声鲁棒学习的同义词挖掘算法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 6, 页码: 1181-1194
作者:  张浩宇;  王戟
Adobe PDF(1420Kb)  |  收藏  |  浏览/下载:1/1  |  提交时间:2024/05/09
同义词挖掘  噪声标签学习  自然语言处理  成对字向量  信息抽取  
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:11/2  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
Federated Learning on Multimodal Data: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 539-553
作者:  Yi-Ming Lin;   Yuan Gao;  Mao-Guo Gong;  Si-Jia Zhang;  Yuan-Qiao Zhang;  Zhi-Yuan Li
Adobe PDF(1253Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/04/23
Federated learning, multimodal learning, heterogeneous data, edge computing, collaborative learning  
Transformer: A General Framework from Machine Translation to Others 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 514-538
作者:  Yang Zhao;  Jiajun Zhang;  Chengqing Zong
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/04/23
Neural machine translation, Transformer, document neural machine translation (NMT), multimodal NMT, low-resource NMT  
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 483-513
作者:  Wei-Chien Wang;  Euijoon Ahn;  Dagan Feng;  Jinman Kim
Adobe PDF(2691Kb)  |  收藏  |  浏览/下载:13/5  |  提交时间:2024/04/23
Self-supervised learning (SSL), contrastive learning, deep learning, medical image analysis, computer vision  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:17/3  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Deep Learning-based Moving Object Segmentation: Recent Progress and Research Prospects 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 335-369
作者:  Rui Jiang;  Ruixiang Zhu;  Hu Su;  Yinlin Li;  Yuan Xie;  Wei Zou
Adobe PDF(9061Kb)  |  收藏  |  浏览/下载:8/0  |  提交时间:2024/04/23
Moving object segmentation (MOS), change detection, background subtraction, deep learning (DL), video understanding  
Vision Enhanced Generative Pre-trained Language Model for Multimodal Sentence Summarization 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 289-298
作者:  Liqiang Jing;  Yiren Li;  Junhao Xu;  Yongcan Yu;  Pei Shen;  Xuemeng Song
Adobe PDF(2389Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/04/23
Multimodal sentence summarization (MMSS)  generative pre-trained language model (GPLM)  natural language generation  deep learning  artificial intelligence