ENSEMBLE META CLASSIFIER WITH SAMPLING AND FEATURE SELECTION FOR DATA WITH IMBALANCE MULTICLASS PROBLEM

  • Mohd Shamrie Sainin Faculty of Computing and Informatics, Universiti Malaysia Sabah, Malaysia
  • Rayner Alfred Faculty of Computing and Informatics, Universiti Malaysia Sabah, Malaysia
  • Faudziah Ahmad School of Computing, Universiti Utara Malaysia, Malaysia

Abstract

Ensemble learning by combining several single classifiers or another ensemble classifier is one of the procedures to solve the imbalance problem in multiclass data. However, this approach still faces the question of how the ensemble methods obtain their higher performance. In this paper, an investigation was carried out on the design of the meta classifier ensemble with sampling and feature selection for multiclass imbalanced data. The specific objectives were: 1) to improve the ensemble classifier through data-level approach (sampling and feature selection); 2) to perform experiments on sampling, feature selection, and ensemble classifier model; and 3 ) to evaluate t he performance of the ensemble classifier. To fulfil the objectives, a preliminary data collection of Malaysian plants’ leaf images was prepared and experimented, and the results were compared. The ensemble design was also tested with three other high imbalance ratio benchmark data. It was found that the design using sampling, feature selection, and ensemble classifier method via AdaboostM1 with random forest (also an ensemble classifier) provided improved performance throughout the investigation. The result of this study is important to the on-going problem of multiclass imbalance where specific structure and its performance can be improved in terms of processing time and accuracy.

Author Biography

Mohd Shamrie Sainin, Faculty of Computing and Informatics, Universiti Malaysia Sabah, Malaysia

Senior Lecturer
Faculty Of Computing and Informatics,
Universiti Malaysia Sabah, Jalan UMS,
88400, Kota Kinabalu, Sabah,

Published
2021-02-23
How to Cite
SAININ, Mohd Shamrie; ALFRED, Rayner; AHMAD, Faudziah. ENSEMBLE META CLASSIFIER WITH SAMPLING AND FEATURE SELECTION FOR DATA WITH IMBALANCE MULTICLASS PROBLEM. Journal of Information and Communication Technology, [S.l.], v. 20, n. 2, p. 103-133, feb. 2021. ISSN 2180-3862. Available at: <http://e-journal.uum.edu.my/index.php/jict/article/view/jict2021.20.2.1>. Date accessed: 15 apr. 2021. doi: https://doi.org/10.32890/jict2021.20.2.1.