Learning Learning Curves

Turan, O.T.; Tax, D.M.J.; Viering, T.J.; Loog, Marco

Learning Learning Curves

Journal article (2025)

Authors

O.T. Turan Pattern Recognition and Bioinformatics -

D.M.J. Tax Pattern Recognition and Bioinformatics -

T.J. Viering Pattern Recognition and Bioinformatics -

Marco Loog Pattern Recognition and Bioinformatics - , Radboud Universiteit Nijmegen

Research Group

Pattern Recognition and Bioinformatics () (TU Delft)

Https://repository.tudelft.nl/Thing_15c64a64-14c4-4768-add2-2af3fc139b14 Extrapolation Learning curves Data-driven modeling Kernel methods Meta-learning Functional data analysis

To reference this document use:

http://resolver.tudelft.nl/uuid:15c64a64-14c4-4768-add2-2af3fc139b14

More Info

expand_more

Published Date

2025

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Intelligent Systems

Research Group

Pattern Recognition and Bioinformatics

Abstract

Learning curves depict how a model’s expected performance changes with varying training set sizes, unlike training curves, showing a gradient-based model’s performance with respect to training epochs. Extrapolating learning curves can be useful for determining the performance gain with additional data. Parametric functions, that assume monotone behaviour of the curves, are a prevalent methodology to model and extrapolate learning curves. However, learning curves do not necessarily follow a specific parametric shape: they can have peaks, dips, and zigzag patterns. These unconventional shapes can hinder the extrapolation performance of commonly used parametric curve-fitting models. In addition, the objective functions for fitting such parametric models are non-convex, making them initialization-dependent and brittle. In response to these challenges, we propose a convex, data-driven approach that extracts information from available learning curves to guide the extrapolation of another targeted learning curve. Our method achieves this through using a learning curve database. Using the initial segment of the observed curve, we determine a group of similar curves from the database and reduce the dimensionality via Functional Principle Component Analysis FPCA. These principal components are used in a semi-parametric kernel ridge regression (SPKR) model to extrapolate targeted curves. The solution of the SPKR can be obtained analytically and does not suffer from initialization issues. To evaluate our method, we create a new database of diverse learning curves that do not always adhere to typical parametric shapes. Our method performs better than parametric non-parametric learning curve-fitting methods on this database for the learning curve extrapolation task.

Files

S10044-024-01394-6.pdf

(pdf | 4.37 Mb)

Unknown license

File under embargo until 07-07-2025