Abstract
In this chapter, we consider two questions related to decision trees: (i) how to construct decision trees with reasonable number of nodes and reasonable number of misclassifications when they are used for knowledge representation, and (ii) how to improve the prediction accuracy of decision trees when they are used as classifiers. We created so-called multi-pruning approach based on dynamic programming algorithms for bi-criteria optimization of CART-like decision trees relative to the number of nodes and the number of misclassifications. This approach allows us to construct the set of all Pareto optimal points and to derive, for each such point, decision trees with parameters corresponding to that point. Experiments with decision tables from the UCI ML Repository show that, very often, we can find a suitable Pareto optimal point and derive a decision tree with small number of nodes at the expense of small increment in the number of misclassifications. Multi-pruning approach includes a procedure which constructs decision trees that, as classifiers, often outperform decision trees constructed by CART. We considered a modification of multi-pruning approach (restricted multi-pruning) that requires less memory and time but usually keeps the quality of constructed trees as classifiers or as a way for knowledge representation. Based on the uncertainty measure abs which is applicable both to decision tables with single- and many-valued decisions, we extended the considered approaches to the case of decision tables with many-valued decisions.
Original language | English (US) |
---|---|
Title of host publication | Intelligent Systems Reference Library |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 153-174 |
Number of pages | 22 |
DOIs | |
State | Published - 2020 |
Publication series
Name | Intelligent Systems Reference Library |
---|---|
Volume | 156 |
ISSN (Print) | 1868-4394 |
ISSN (Electronic) | 1868-4408 |
Bibliographical note
Publisher Copyright:© 2020, Springer Nature Switzerland AG.
ASJC Scopus subject areas
- General Computer Science
- Information Systems and Management
- Library and Information Sciences