CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Comprehensive Relation Modelling for Image Paragraph Generation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 369-382
作者:  Xianglu Zhu;  Zhang Zhang;  Wei Wang;  Zilei Wang
Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Image paragraph generation, visual relationship, scene graph, graph convolutional network (GCN), long short-term memory  
State of the Art on Deep Learning-enhanced Rendering Methods 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 799-821
作者:  Qi Wang;  Zhihua Zhong;  Yuchi Huo;  Hujun Bao;  Rui Wang
Adobe PDF(6540Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/04/23
Neural rendering, computer graphics, scene representation, rendering, post-processing  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:3/2  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Federated Learning with Privacy-preserving and Model IP-right-protection 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 19-37
作者:  Qiang Yang;  Anbu Huang;  Lixin Fan;  Chee Seng Chan;  Jian Han Lim;  Kam Woh Ng;  Ding Sheng Ong;  Bowen Li
Adobe PDF(2634Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/04/23
Federated learning  privacy-preserving machine learning  security  decentralized learning  intellectual property protection  
Causal Reasoning Meets Visual Representation Learning: A Prospective Study 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 485-511
作者:  Yang Liu;  Yu-Shen Wei;  Hong Yan;  Guan-Bin Li;  Liang Lin
Adobe PDF(3224Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/04/23
Causal reasoning  visual representation learning  reliable artificial intelligence  spatial-temporal data  multi-modal analysis  
Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 5, 页码: 1106-1126
作者:  Wenqi Ren;  Yang Tang;  Qiyu Sun;  Chaoqiang Zhao;  Qing-Long Han
Adobe PDF(12695Kb)  |  收藏  |  浏览/下载:12/2  |  提交时间:2024/04/10
Computer vision  deep learning  few-shot learning  low-shot learning  semantic segmentation  zero-shot learning  
Cyberbullying and Cyberviolence Detection: A Triangular User-Activity-Content View 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 8, 页码: 1384-1405
作者:  Shuwen Wang;  Xingquan Zhu;  Weiping Ding;  Amir Alipour Yengejeh
Adobe PDF(10999Kb)  |  收藏  |  浏览/下载:144/15  |  提交时间:2022/08/01
Classification  clustering  cyberbullying  natural language processing  social network  
Visuals to Text: A Comprehensive Review on Automatic Image Captioning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 8, 页码: 1339-1365
作者:  Yue Ming;  Nannan Hu;  Chunxiao Fan;  Fan Feng;  Jiangwan Zhou;  Hui Yu
Adobe PDF(56128Kb)  |  收藏  |  浏览/下载:151/21  |  提交时间:2022/08/01
Artificial intelligence  attention mechanism  encoder-decoder framework  image captioning  multi-modal understanding  training strategies  
MFSR: Maximum Feature Score Region-based Captions Locating in News Video Images 期刊论文
International Journal of Automation and Computing, 2018, 卷号: 15, 期号: 4, 页码: 454-461
作者:  Zhi-Heng Wang;  Chao Guo;  Hong-Min Liu;  Zhan-Qiang Huo
浏览  |  Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:100/26  |  提交时间:2021/02/23
News video images  captions recognizing  captions locating  content understanding  maximum feature score region (MFSR).