Piecewise Constant and Linear Regression Trees

van den Bos, Mim; van der Linden, Jacobus G.M.; Demirovic, E.

Piecewise Constant and Linear Regression Trees

An Optimal Dynamic Programming Approach

Journal article (2024)

Authors

Mim van den Bos Student

Jacobus G.M. van der Linden Student

E. Demirovic Algorithmics

Faculty

Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:a33970cf-e928-433f-97c2-cafc9c97bf38

More Info

expand_more

Published Date

2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Regression trees are a human-comprehensible machine-learning model that can represent complex relationships. They are typically trained using greedy heuristics because computing optimal regression trees is NP-hard. Contrary to this standard practice, we consider optimal methods and improve the scalability of optimal methods by developing three new dynamic programming approaches. First, we improve the performance of a piecewise constant regression tree method using a special algorithm for trees of depth two. Second, we provide the first optimal dynamic programming method for piecewise multiple linear regression. Third, we develop the first optimal method for piecewise simple linear regression, for which we also provide a special algorithm for trees of depth two. The experimental results show that our methods improve scalability by one or more orders of magnitude over the state-of-the-art optimal methods while performing similarly or better in out-of-sample performance.

Files

Van-den-bos24a.pdf

(pdf | 0.704 Mb)

- Embargo expired in 03-02-2025

Unknown license