Pattern generation through feature values modification and decision tree ensemble construction

An ensemble method produces diverse classifiers and combines their decisions for ensemble’s decision. A number of methods have been investigated for constructing ensemble in which some of them train classifiers with the generated patterns. This study investigates a new technique of training patter...

Full description

Saved in:
Bibliographic Details
Main Authors: Akhand, M. A. H, Rahman, M.M. Hafizur, Murase, K.
Format: Article
Language:English
Published: IACSIT Press 2013
Subjects:
Online Access:http://irep.iium.edu.my/31742/4/IJMLC_2013.pdf
http://irep.iium.edu.my/31742/
http://www.ijmlc.org/index.php?m=content&c=index&a=show&catid=39&id=362
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:An ensemble method produces diverse classifiers and combines their decisions for ensemble’s decision. A number of methods have been investigated for constructing ensemble in which some of them train classifiers with the generated patterns. This study investigates a new technique of training pattern generation that is easy and effective for ensemble construction. The method modifies feature values of some patterns with the values of other patterns to generate different patterns for different classifiers. The ensemble of decision trees based on the proposed technique was evaluated using a suite of 30 benchmark classification problems, and was found to achieve performance better than or competitive with related conventional methods. Furthermore, two different hybrid ensemble methods have been investigated incorporating the proposed technique of pattern generation with two popular ensemble methods bagging and random subspace method (RSM). It is found that the performance of bagging and RSM algorithms can be improved by incorporating feature values modification with their training processes. Experimental investigation of different types of modification techniques finds that feature values modification with pattern values in the same class is better for generalization.