CASIA OpenIR

浏览/检索结果: 共38条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
视觉语言导航研究进展 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 1-14
作者:  司马双霖;  黄岩;  何科技;  安东;  袁辉;  王亮
Adobe PDF(6272Kb)  |  收藏  |  浏览/下载:12/2  |  提交时间:2024/05/09
视觉语言导航  视觉语言理解  跨模态匹配  具身智能  
Transformer: A General Framework from Machine Translation to Others 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 514-538
作者:  Yang Zhao;  Jiajun Zhang;  Chengqing Zong
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:12/4  |  提交时间:2024/04/23
Neural machine translation, Transformer, document neural machine translation (NMT), multimodal NMT, low-resource NMT  
Multimodal Pretraining from Monolingual to Multilingual 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 220-232
作者:  Liang Zhang;  Ludan Ruan;  Anwen Hu;  Qin Jin
Adobe PDF(3024Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/04/23
Multilingual pretraining  multimodal pretraining  cross-lingual transfer  multilingual generation  cross-modal retrieval  
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 4830-4841
作者:  Chen, Zhuo;  Yin, Fei;  Yang, Qing;  Liu, Cheng-Lin
收藏  |  浏览/下载:25/0  |  提交时间:2024/02/22
Cross-lingual text image recognition  cross-modal mimic  multihierarchy mimic  
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:203/59  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
Generating Emotion Descriptions for Fine Art Paintings via Multiple Painting Representations 期刊论文
IEEE Intelligent Systems, 2023, 卷号: 38, 期号: 3, 页码: 31-40
作者:  Lu, Yue;  Guo, Chao;  Dai, Xingyuan;  Wang, Fei-Yue
Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:99/14  |  提交时间:2023/06/25
painting captioning  
Synchronous Inference for Multilingual Neural Machine Translation 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2022, 期号: 30, 页码: 1827
作者:  Wang, Qian;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(1738Kb)  |  收藏  |  浏览/下载:150/48  |  提交时间:2022/12/19
A Bi-population Cooperative Optimization Algorithm Assisted by an Autoencoder for Medium-scale Expensive Problems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 11, 页码: 1952-1966
作者:  Meiji Cui;  Li Li;  MengChu Zhou;  Jiankai Li;  Abdullah Abusorrah;  Khaled Sedraoui
Adobe PDF(4431Kb)  |  收藏  |  浏览/下载:200/40  |  提交时间:2022/10/09
Autoencoder  dimension reduction  evolutionary algorithm  medium-scale expensive problems  teaching-learning-based optimization  
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 5513-5528
作者:  Zhao, Mengbiao;  Feng, Wei;  Yin, Fei;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(5999Kb)  |  收藏  |  浏览/下载:315/35  |  提交时间:2022/09/19
Costs  Annotations  Training  Labeling  Detectors  Data models  Benchmark testing  Mixed-supervised learning  scene text detection  weak supervision forms  expectation-maximization algorithm  
Visuals to Text: A Comprehensive Review on Automatic Image Captioning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 8, 页码: 1339-1365
作者:  Yue Ming;  Nannan Hu;  Chunxiao Fan;  Fan Feng;  Jiangwan Zhou;  Hui Yu
Adobe PDF(56128Kb)  |  收藏  |  浏览/下载:157/21  |  提交时间:2022/08/01
Artificial intelligence  attention mechanism  encoder-decoder framework  image captioning  multi-modal understanding  training strategies