Online adaptive decision trees based on concentration inequalities.
Loading...
Identifiers
Publication date
Reading date
Collaborators
Advisors
Tutors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Share
Center
Department/Institute
Abstract
Classification trees are a powerful tool for mining non-stationary data streams. In these situations, mas sive data are constantly generated at high speed and the underlying target function can change over time.
The iadem family of algorithms is based on Hoeffding’s and Chernoff’s bounds and induces online deci sion trees from data streams, but is not able to handle concept drift. This study extends this family to
deal with time-changing data streams. The new online algorithm, named iadem-3, performs two main
actions in response to a concept drift. Firstly, it resets the variables affected by the change and main tains unbroken the structure of the tree, which allows for changes in which consecutive target functions
are very similar. Secondly, it creates alternative models that replace parts of the main tree when they
significantly improve the accuracy of the model, thereby rebuilding the main tree if needed. An online
change detector and a non-parametric statistical test based on Hoeffding’s bounds are used to guaran tee this significance. A new pruning method is also incorporated in iadem-3, making sure that all split
tests previously installed in decision nodes are useful. The learning model is also viewed as an ensem ble of classifiers, and predictions of the main and alternative models are combined to classify unlabeled
examples. iadem-3 is empirically compared with various well-known decision tree induction algorithms
for concept drift detection. We empirically show that our new algorithm often reaches higher levels of
accuracy with smaller decision tree models, maintaining the processing time bounded, irrespective of the
number of instances processed.
Description
Bibliographic citation
Frías-Blanco, I., Campo-Ávila, J. del, Ramos-Jiménez, G., Carvalho, A. C. P. L. F., Ortiz-Díaz, A., & Morales-Bueno, R. (2016). Online adaptive decision trees based on concentration inequalities. Knowledge-Based Systems, 104, 179–194.










