Online adaptive decision trees based on concentration inequalities.

dc.contributor.authorFrías-Blanco, Isvani
dc.contributor.authorDel-Campo-Ávila, José
dc.contributor.authorRamos-Jiménez, Gonzalo Pascual
dc.contributor.authorCarvalho, Andre C.P.L.F.
dc.contributor.authorOrtiz-Díaz, Agustín
dc.contributor.authorMorales-Bueno, Rafael
dc.date.accessioned2024-02-09T07:28:40Z
dc.date.available2024-02-09T07:28:40Z
dc.date.created2024
dc.date.issued2016-04-21
dc.departamentoLenguajes y Ciencias de la Computación
dc.description.abstractClassification trees are a powerful tool for mining non-stationary data streams. In these situations, mas sive data are constantly generated at high speed and the underlying target function can change over time. The iadem family of algorithms is based on Hoeffding’s and Chernoff’s bounds and induces online deci sion trees from data streams, but is not able to handle concept drift. This study extends this family to deal with time-changing data streams. The new online algorithm, named iadem-3, performs two main actions in response to a concept drift. Firstly, it resets the variables affected by the change and main tains unbroken the structure of the tree, which allows for changes in which consecutive target functions are very similar. Secondly, it creates alternative models that replace parts of the main tree when they significantly improve the accuracy of the model, thereby rebuilding the main tree if needed. An online change detector and a non-parametric statistical test based on Hoeffding’s bounds are used to guaran tee this significance. A new pruning method is also incorporated in iadem-3, making sure that all split tests previously installed in decision nodes are useful. The learning model is also viewed as an ensem ble of classifiers, and predictions of the main and alternative models are combined to classify unlabeled examples. iadem-3 is empirically compared with various well-known decision tree induction algorithms for concept drift detection. We empirically show that our new algorithm often reaches higher levels of accuracy with smaller decision tree models, maintaining the processing time bounded, irrespective of the number of instances processed.es_ES
dc.identifier.citationFrías-Blanco, I., Campo-Ávila, J. del, Ramos-Jiménez, G., Carvalho, A. C. P. L. F., Ortiz-Díaz, A., & Morales-Bueno, R. (2016). Online adaptive decision trees based on concentration inequalities. Knowledge-Based Systems, 104, 179–194.es_ES
dc.identifier.doi10.1016/j.knosys.2016.04.019
dc.identifier.urihttps://hdl.handle.net/10630/30225
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectAprendizaje automático (Inteligencia artificial)es_ES
dc.subjectDatos masivoses_ES
dc.subject.otherMachine Learninges_ES
dc.subject.otherDecision Treeses_ES
dc.subject.otherConcentration Inequalitieses_ES
dc.subject.otherArtificial Intelligencees_ES
dc.subject.otherData Mininges_ES
dc.titleOnline adaptive decision trees based on concentration inequalities.es_ES
dc.typejournal articlees_ES
dc.type.hasVersionSMURes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication94274f5d-d8b4-488c-a1de-2e0744acaf5b
relation.isAuthorOfPublicationb955a101-7349-453f-a9f4-41805f6c1052
relation.isAuthorOfPublication6785abd4-5d88-4012-ad1f-cd3854d0ceff
relation.isAuthorOfPublication.latestForDiscovery94274f5d-d8b4-488c-a1de-2e0744acaf5b

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Knowl-Based Syst - Preprint - 2016 - Gonzalo Ramos Jiménez.pdf
Size:
753.95 KB
Format:
Adobe Portable Document Format
Description:

Collections