CASIA OpenIR

浏览/检索结果: 共517条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Text Difficulty Study: Do Machines Behave the Same as Humans Regarding Text Difficulty? 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 283-293
作者:  Bowen Chen;  Xiao Ding;  Yi Zhao;  Bo Fu;  Tingmao Lin;  Bing Qin;  Ting Liu
Adobe PDF(1796Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/04/23
Cognition inspired natural language processing, psycholinguistics, explainability, text difficulty, curriculum learning  
GraphFlow+: Exploiting Conversation Flow in Conversational Machine Comprehension with Graph Neural Networks 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 272-282
作者:  Jing Hu;  Lingfei Wu;  Yu Chen;  Po Hu;  Mohammed J. Zaki
Adobe PDF(1612Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Conversational machine comprehension (MC), reading comprehension, question answering, graph neural networks (GNNs), natural language processing (NLP)  
Deep Video Harmonization by Improving Spatial-temporal Consistency 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 46-54
作者:  Xiuwen Chen;  Li Fang;  Long Ye;  Qin Zhang
Adobe PDF(3779Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/04/23
Harmonization, temporal consistency, video editing, video composition, nonlocal similarity  
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
State of the Art on Deep Learning-enhanced Rendering Methods 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 799-821
作者:  Qi Wang;  Zhihua Zhong;  Yuchi Huo;  Hujun Bao;  Rui Wang
Adobe PDF(6540Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Neural rendering, computer graphics, scene representation, rendering, post-processing  
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
Transformer: A General Framework from Machine Translation to Others 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 514-538
作者:  Yang Zhao;  Jiajun Zhang;  Chengqing Zong
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/04/23
Neural machine translation, Transformer, document neural machine translation (NMT), multimodal NMT, low-resource NMT  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 249-262
作者:  Guyue Hu;  Bin He;  Hanwang Zhang
Adobe PDF(2167Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
Prompt learning  video-language pretrained models  instructional videos  procedure understanding  knowledge distilling