CASIA OpenIR  > 学术期刊  > Machine Intelligence Research
DynamicRetriever: A Pre-trained Model-based IR System Without an Explicit Index
Yu-Jia Zhou1
发表期刊Machine Intelligence Research
ISSN2731-538X
2023
卷号20期号:2页码:276-288
摘要Web search provides a promising way for people to obtain information and has been extensively studied. With the surge of deep learning and large-scale pre-training techniques, various neural information retrieval models are proposed, and they have demonstrated the power for improving search (especially, the ranking) quality. All these existing search methods follow a common paradigm, i.e., index-retrieve-rerank, where they first build an index of all documents based on document terms (i.e., sparse inverted index) or representation vectors (i.e., dense vector index), then retrieve and rerank retrieved documents based on the similarity between the query and documents via ranking models. In this paper, we explore a new paradigm of information retrieval without an explicit index but only with a pre-trained model. Instead, all of the knowledge of the documents is encoded into model parameters, which can be regarded as a differentiable indexer and optimized in an end-to-end manner. Specifically, we propose a pre-trained model-based information retrieval (IR) system called DynamicRetriever, which directly returns document identifiers for a given query. Under such a framework, we implement two variants to explore how to train the model from scratch and how to combine the advantages of dense retrieval models. Compared with existing search methods, the model-based IR system parameterizes the traditional static index with a pre-training model, which converts the document semantic mapping into a dynamic and updatable process. Extensive experiments conducted on the public search benchmark Microsoft machine reading comprehension (MS MARCO) verify the effectiveness and potential of our proposed new paradigm for information retrieval.
关键词Information retrieval (IR) document retrieval model-based IR pre-trained language model differentiable search index
DOI10.1007/s11633-022-1373-9
引用统计
被引频次:4[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/51481
专题学术期刊_Machine Intelligence Research
作者单位1.Gaoling School of Artificial Intelligence, Renmin University of China, Beijing 100872, China
2.Beijing Academy of Artificial Intelligence, Beijing 100084, China
推荐引用方式
GB/T 7714
Yu-Jia Zhou. DynamicRetriever: A Pre-trained Model-based IR System Without an Explicit Index[J]. Machine Intelligence Research,2023,20(2):276-288.
APA Yu-Jia Zhou.(2023).DynamicRetriever: A Pre-trained Model-based IR System Without an Explicit Index.Machine Intelligence Research,20(2),276-288.
MLA Yu-Jia Zhou."DynamicRetriever: A Pre-trained Model-based IR System Without an Explicit Index".Machine Intelligence Research 20.2(2023):276-288.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
MIR-2022-06-207.pdf(7814KB)期刊论文出版稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Yu-Jia Zhou]的文章
百度学术
百度学术中相似的文章
[Yu-Jia Zhou]的文章
必应学术
必应学术中相似的文章
[Yu-Jia Zhou]的文章
相关权益政策
暂无数据
收藏/分享
文件名: MIR-2022-06-207.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。