CASIA OpenIR  > 学术期刊  > IEEE/CAA Journal of Automatica Sinica
A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce
Lun Hu; Shicheng Yang; Xin Luo; Huaqiang Yuan; Khaled Sedraoui; MengChu Zhou
发表期刊IEEE/CAA Journal of Automatica Sinica
ISSN2329-9266
2022
卷号9期号:1页码:160-172
摘要Protein-protein interactions are of great significance for human to understand the functional mechanisms of proteins. With the rapid development of high-throughput genomic technologies, massive protein-protein interaction (PPI) data have been generated, making it very difficult to analyze them efficiently. To address this problem, this paper presents a distributed framework by reimplementing one of state-of-the-art algorithms, i.e., CoFex, using MapReduce. To do so, an in-depth analysis of its limitations is conducted from the perspectives of efficiency and memory consumption when applying it for large-scale PPI data analysis and prediction. Respective solutions are then devised to overcome these limitations. In particular, we adopt a novel tree-based data structure to reduce the heavy memory consumption caused by the huge sequence information of proteins. After that, its procedure is modified by following the MapReduce framework to take the prediction task distributively. A series of extensive experiments have been conducted to evaluate the performance of our framework in terms of both efficiency and accuracy. Experimental results well demonstrate that the proposed framework can considerably improve its computational efficiency by more than two orders of magnitude while retaining the same high accuracy.
关键词Distributed computing large-scale prediction machine learning MapReduce protein-protein interaction (PPI)
DOI10.1109/JAS.2021.1004198
引用统计
被引频次:38[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/45982
专题学术期刊_IEEE/CAA Journal of Automatica Sinica
推荐引用方式
GB/T 7714
Lun Hu,Shicheng Yang,Xin Luo,et al. A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce[J]. IEEE/CAA Journal of Automatica Sinica,2022,9(1):160-172.
APA Lun Hu,Shicheng Yang,Xin Luo,Huaqiang Yuan,Khaled Sedraoui,&MengChu Zhou.(2022).A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce.IEEE/CAA Journal of Automatica Sinica,9(1),160-172.
MLA Lun Hu,et al."A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce".IEEE/CAA Journal of Automatica Sinica 9.1(2022):160-172.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
JAS-2021-0399.pdf(1993KB)期刊论文出版稿开放获取CC BY-NC-SA浏览
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Lun Hu]的文章
[Shicheng Yang]的文章
[Xin Luo]的文章
百度学术
百度学术中相似的文章
[Lun Hu]的文章
[Shicheng Yang]的文章
[Xin Luo]的文章
必应学术
必应学术中相似的文章
[Lun Hu]的文章
[Shicheng Yang]的文章
[Xin Luo]的文章
相关权益政策
暂无数据
收藏/分享
文件名: JAS-2021-0399.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。