Revisiting the Class Imbalance Issue in Software Defect Prediction

Author Topic: Revisiting the Class Imbalance Issue in Software Defect Prediction  (Read 460 times)

Offline Monir Hossan

  • Sr. Member
  • ****
  • Posts: 343
  • Remain honest in every sphere of life!
    • View Profile
    • Daffodil International University
Revisiting the Class Imbalance Issue in Software Defect Prediction

Abstract:
Software defect prediction is related to the testing area of software industry. Several methods have been developed for the prediction of bugs in software source codes. The objective of this study is to find the inconsistency of performance between imbalances and balance data set and to find the distinction of performance between single classifier and aggregate classifier (voting). In this investigation, eight publicly available data sets have collected, also seven algorithms and hard voting are used for finding precision, recall and F-1 score to predict software defect. In these collected data, two sets are almost balanced. For this investigation, these balanced data sets have converted into imbalanced sets as average non-defective and defective ratio of the other 6 data sets. The experiment result shows that performance of the two balanced data sets is lower than other six sets. After conversion of two data sets, the performance has increased as like as other six data sets. Another observation is the performance metric that shows the results of precision, recall and F1-score for voting are 0.92, 0.84 and 0.87 respectively, which are better than other single classifier. This study has been able to shows that- imbalance of non-defective and defective classes have a big impact on software defect prediction and the voting is the best performer among the classifiers.

Authors: Md. Fahimuzzman Sohan ; Md Alamgir Kabir ; Md. Ismail Jabiullah ; Sheikh Shah Mohammad Motiur Rahman

Source: 2nd International Conference on Electrical, Computer and Communication Engineering, ECCE 2019
Link: https://ieeexplore.ieee.org/abstract/document/8679382
Mohammad Monir Hossan
Senior Assistant Director (Division of Research)
E-mail: monirhossain@daffodilvarsity.edu.bd

Offline drrana

  • Sr. Member
  • ****
  • Posts: 325
  • Test
    • View Profile
Re: Revisiting the Class Imbalance Issue in Software Defect Prediction
« Reply #1 on: February 20, 2020, 12:11:01 PM »
 :o 8)