N. GayatriS. NickolasAnusuyah SubbaraoT KhoshgoftaarL BullardK GaoS LessmannB BaesensC MuesS PietschM MeulenM RevillaD RodriguezR RuizJ Cuadrado-GallegoJ Aguilar-RuizK SunghunT ZimmermannE WhiteheadA ZellerS PfleegerC OoiH ChettyM TengSJohn KohaviR PflegerKG FormanC SerafiniM MerlerS JurmanGI GuyonA ElisseeffS DoraisamyS GolzariN NorowiN SulaimanN UdzirM A HallG HolmesG IlczukR MlynarskiW KargulWakulicz-DejaD RodriguezR RuizJ Cuadrado-GallegoJ Aguilar-RuizM GarreZ ChenT MenziesD PortB BoehmN PizziA DemkoW PedryczK JongE MarchioriM SebagVan Der VaartK GaoT KhoshgoftaarH WangN SeliyaXinwang LiuGuomin ZhangYubin ZhanEn ZhuH LiuL YuMarko Robnik-SikonjaIgor KononenkoI GuyonJ WestonS BarnhillV VapnikD AhaD KiblerM AlbertH JohnP LangleyP DomingosM PazzaniLe CessieS Van HouwelingenJY MaB CukicT KhoshgoftaarM GolawalaJ Van Hulse
Software defect prediction models are essential for understanding quality attributes relevant for software organization to deliver better software reliability. This paper focuses mainly based on the selection of attributes in the perspective of software quality estimation for incremental database. A new dimensionality reduction method Wilk’s Lambda Average Threshold (WLAT) is presented for selection of optimal features which are used for classifying modules as fault prone or not. This paper uses software metrics and defect data collected from benchmark data sets. The comparative results confirm that the statistical search algorithm (WLAT) outperforms the other relevant feature selection methods for most classifiers. The main advantage of the proposed WLAT method is: The selected features can be reused when there is increase or decrease in database size, without the need of extracting features afresh. In addition, performances of the defect prediction models either remains unchanged or improved even after eliminating 85% of the software metrics.
Xiao YuZiyi MaChuanxiang MaYi GuRuiqi LiuYan Zhang
Romi Satria WahonoNanna Suryana
Wenzhi ZhuZhiqiang LiHaiyang Liu