Improving Ranking-Oriented Defect Prediction Using a Cost-Sensitive Ranking SVM
Yu, Xiao1,2; Liu, Jin1,3,4; Keung, Jacky Wai2; Li, Qing5; Bennin, Kwabena Ebo6; Xu, Zhou1; Wang, Junping7; Cui, Xiaohui8
发表期刊IEEE TRANSACTIONS ON RELIABILITY
ISSN0018-9529
2020-03-01
卷号69期号:1页码:139-153
摘要

Context: Ranking-oriented defect prediction (RODP) ranks software modules to allocate limited testing resources to each module according to the predicted number of defects. Most RODP methods overlook that ranking a module with more defects incorrectly makes it difficult to successfully find all of the defects in the module due to fewer testing resources being allocated to the module, which results in much higher costs than incorrectly ranking the modules with fewer defects, and the numbers of defects in software modules are highly imbalanced in defective software datasets. Cost-sensitive learning is an effective technique in handling the cost issue and data imbalance problem for software defect prediction. However, the effectiveness of cost-sensitive learning has not been investigated in RODP models. Aims: In this article, we propose a cost-sensitive ranking support vector machine (SVM) (CSRankSVM) algorithm to improve the performance of RODP models. Method: CSRankSVM modifies the loss function of the ranking SVM algorithm by adding two penalty parameters to address both the cost issue and the data imbalance problem. Additionally, the loss function of the CSRankSVM is optimized using a genetic algorithm. Results: The experimental results for 11 project datasets with 41 releases show that CSRankSVM achieves 1.12%-15.68% higher average fault percentile average (FPA) values than the five existing RODP methods (i.e., decision tree regression, linear regression, Bayesian ridge regression, ranking SVM, and learning-to-rank (LTR)) and 1.08%-15.74% higher average FPA values than the four data imbalance learning methods (i.e., random undersampling and a synthetic minority oversampling technique; two data resampling methods; RankBoost, an ensemble learning method; IRSVM, a CSRankSVM method for information retrieval). Conclusion: CSRankSVM is capable of handling the cost issue and data imbalance problem in RODP methods and achieves better performance. Therefore, CSRankSVM is recommended as an effective method for RODP.

关键词Support vector machines Software Prediction algorithms Predictive models Testing Software algorithms Computer science Cost-sensitive learning data imbalance ranking-oriented defect prediction (RODP)
DOI10.1109/TR.2019.2931559
关键词[WOS]SUPPORT VECTOR MACHINE ; GENETIC ALGORITHM ; FEATURE-SELECTION ; NEURAL-NETWORKS ; COUNT MODELS ; SOFTWARE ; CLASSIFICATION ; REGRESSION ; NUMBER ; FAULTS
收录类别SCI
语种英语
资助项目National Key R&D Program of China[2018YFC1604000] ; National Natural Science Foundation of China[61572374] ; National Natural Science Foundation of China[U163620068] ; National Natural Science Foundation of China[U1135005] ; National Natural Science Foundation of China[61572371] ; National Natural Science Foundation of China[61772525] ; Open Fund of Key Laboratory of Network Assessment Technology from CAS ; Guangxi Key Laboratory of Trusted Software[kx201607] ; Academic Team Building Plan for Young Scholars from Wuhan University[WHU2016012] ; General Research Fund of the Research Grants Council of Hong Kong[11208017] ; City University of Hong Kong[9678149] ; City University of Hong Kong[7005028] ; Intel[9220097] ; Hong Kong Polytechnic University[9B0V]
项目资助者National Key R&D Program of China ; National Natural Science Foundation of China ; Open Fund of Key Laboratory of Network Assessment Technology from CAS ; Guangxi Key Laboratory of Trusted Software ; Academic Team Building Plan for Young Scholars from Wuhan University ; General Research Fund of the Research Grants Council of Hong Kong ; City University of Hong Kong ; Intel ; Hong Kong Polytechnic University
WOS研究方向Computer Science ; Engineering
WOS类目Computer Science, Hardware & Architecture ; Computer Science, Software Engineering ; Engineering, Electrical & Electronic
WOS记录号WOS:000526289100010
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
是否为代表性论文
七大方向——子方向分类机器学习
国重实验室规划方向分类认知机理与类脑学习
是否有论文关联数据集需要存交
引用统计
被引频次:40[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/38857
专题多模态人工智能系统全国重点实验室_人工智能与机器学习(杨雪冰)-技术团队
通讯作者Liu, Jin; Keung, Jacky Wai
作者单位1.Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
2.City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
3.Chinese Acad Sci, Inst Informat Engn, Key Lab Network Technol, Beijing 100000, Peoples R China
4.Guilin Univ Elect Technol, Guangxi Key Lab Trusted Software, Guilin 541000, Peoples R China
5.Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
6.Blekinge Inst Technol, Dept Software Engn, S-37134 Karlskrona, Sweden
7.Chinese Acad Sci, Inst Automat, Lab Precis Sensing & Control Ctr, Beijing 100000, Peoples R China
8.Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Peoples R China
推荐引用方式
GB/T 7714
Yu, Xiao,Liu, Jin,Keung, Jacky Wai,et al. Improving Ranking-Oriented Defect Prediction Using a Cost-Sensitive Ranking SVM[J]. IEEE TRANSACTIONS ON RELIABILITY,2020,69(1):139-153.
APA Yu, Xiao.,Liu, Jin.,Keung, Jacky Wai.,Li, Qing.,Bennin, Kwabena Ebo.,...&Cui, Xiaohui.(2020).Improving Ranking-Oriented Defect Prediction Using a Cost-Sensitive Ranking SVM.IEEE TRANSACTIONS ON RELIABILITY,69(1),139-153.
MLA Yu, Xiao,et al."Improving Ranking-Oriented Defect Prediction Using a Cost-Sensitive Ranking SVM".IEEE TRANSACTIONS ON RELIABILITY 69.1(2020):139-153.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Improving_Ranking-Or(2373KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Yu, Xiao]的文章
[Liu, Jin]的文章
[Keung, Jacky Wai]的文章
百度学术
百度学术中相似的文章
[Yu, Xiao]的文章
[Liu, Jin]的文章
[Keung, Jacky Wai]的文章
必应学术
必应学术中相似的文章
[Yu, Xiao]的文章
[Liu, Jin]的文章
[Keung, Jacky Wai]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Improving_Ranking-Oriented_Defect_Prediction_Using_a_Cost-Sensitive_Ranking_SVM (1).pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。