CASIA OpenIR

浏览/检索结果: 共235条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Adaptively Enhancing Facial Expression Crucial Regions via a Local Non-local Joint Network 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 331-348
作者:  Guanghui Shi;  Shasha Mao;  Shuiping Gou;  Dandan Yan;  Licheng Jiao;  Lin Xiong
Adobe PDF(3926Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/04/23
Facial expression recognition, deep neural network, multiple network ensemble, attention network, facial crucial regions  
A Comprehensive Overview of CFN From a Commonsense Perspective 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 239-256
作者:  Ru Li;  Yunxiao Zhao;  Zhiqiang Wang;  Xuefeng Su;  Shaoru Guo;  Yong Guan;  Xiaoqi Han;  Hongyan Zhao
Adobe PDF(2392Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/04/23
Chinese FrameNet (CFN), commonsense, scenario commonsense, frame, knowledge  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
Vision Enhanced Generative Pre-trained Language Model for Multimodal Sentence Summarization 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 289-298
作者:  Liqiang Jing;  Yiren Li;  Junhao Xu;  Yongcan Yu;  Pei Shen;  Xuemeng Song
Adobe PDF(2389Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/04/23
Multimodal sentence summarization (MMSS)  generative pre-trained language model (GPLM)  natural language generation  deep learning  artificial intelligence  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Causal Reasoning Meets Visual Representation Learning: A Prospective Study 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 485-511
作者:  Yang Liu;  Yu-Shen Wei;  Hong Yan;  Guan-Bin Li;  Liang Lin
Adobe PDF(3224Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/04/23
Causal reasoning  visual representation learning  reliable artificial intelligence  spatial-temporal data  multi-modal analysis  
Clause-level Relationship-aware Math Word Problems Solver 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 425-438
作者:  Chang-Yang Wu;  Xin Lin;  Zhen-Ya Huang;  Yu Yin;  Jia-Yu Liu;  Qi Liu;  Gang Zhou
Adobe PDF(2063Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
Artificial intelligence (AI)  artificial neural network (ANN)  computational mathematics  machine intelligence  machine learning  
TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 3, 页码: 227-246
作者:  Xiao-Yu Zhang;  Hai-Chao Shi;  Chang-Sheng Li;  Li-Xin Duan
Adobe PDF(3616Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
Knowledge transfer  weakly supervised learning  self-attention mechanism  representation learning  action localization  
Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 5, 页码: 1106-1126
作者:  Wenqi Ren;  Yang Tang;  Qiyu Sun;  Chaoqiang Zhao;  Qing-Long Han
Adobe PDF(12695Kb)  |  收藏  |  浏览/下载:10/1  |  提交时间:2024/04/10
Computer vision  deep learning  few-shot learning  low-shot learning  semantic segmentation  zero-shot learning