Optimizing twins decision tree classification, using genetic algorithms

Farid Seifi; Mohammad Reza Kangavari; Hamed Ahmadi; Ehsan Lotfi; Sanaz Imaniyan; Somayeh Lagzian

doi:10.1109/ukricis.2008.4798957

ScienceGate Book Chapters

JOURNAL ARTICLE

Optimizing twins decision tree classification, using genetic algorithms

Farid Seifi Mohammad Reza Kangavari Hamed Ahmadi Ehsan Lotfi Sanaz Imaniyan Somayeh Lagzian

Year: 2008 Pages: 1-6

DOI: 10.1109/ukricis.2008.4798957

Get Full-Text PDF Get Analytical Report

Abstract

Decision tree classification is one of the most practical and effective methods which is used in inductive learning. Many different approaches, which are usually used for decision making and prediction, have been invented to construct decision tree classifiers. These approaches try to optimize parameters such as accuracy, speed of classification, size of constructed trees, learning speed, and the amount of used memory. There is a trade off between these parameters. That is to say that optimization of one may cause obstruction in the other, hence all existing approaches try to establish equilibrium. In this study, considering the effect of the whole data set on class assigning of any data, we propose a new approach to construct not perfectly accurate, but less complex trees in a short time, using small amount of memory. To achieve this purpose, a multi-step process has been used. We trace the training data set twice in any step, from the beginning to the end and vice versa, to extract the class pattern for attribute selection. Using the selected attribute, we make new branches in the tree. After making branches, the selected attribute and some records of training data set are deleted at the end of any step. This process continues alternatively in several steps for remaining data and attributes until the tree is completely constructed. In order to have an optimized tree the parameters which we use in this algorithm are optimized using genetic algorithms. In order to compare this new approach with previous ones we used some known data sets which have been used in different researches. This approach has been compared with others based on the classification accuracy and also the decision tree size. Experimental results show that it is efficient to use this approach particularly in cases of massive data sets, memory restrictions or short learning time.

Keywords:

Computer science Decision tree ID3 algorithm Construct (python library) Incremental decision tree Tree (set theory) Data mining Decision tree learning Machine learning Set (abstract data type) Artificial intelligence Class (philosophy) Process (computing) Algorithm TRACE (psycholinguistics) Genetic algorithm Interval tree Tree structure Binary tree Mathematics

Metrics

Cited By

0.79

FWCI (Field Weighted Citation Impact)

Refs

0.83

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Data Mining Algorithms and Applications

Physical Sciences → Computer Science → Information Systems

Machine Learning and Data Classification

Physical Sciences → Computer Science → Artificial Intelligence

Imbalanced Data Classification Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Optimizing twins decision tree classification, using genetic algorithms

Abstract

Metrics

Citation History

Topics

Related Documents

Pruning Decision Tree Using Genetic Algorithms

Optimizing Musical Genre Classification Using Genetic Algorithms

Breast Cancer Classification using Decision Tree Algorithms

Optimizing land use classification using decision tree approaches

Optimizing Fuzzy Clinical Decision Support Rules Using Genetic Algorithms