Minimizers of the empirical risk and risk monotonicity

Loog, M; Viering, Tom J.; Mey, A.

Minimizers of the empirical risk and risk monotonicity

Conference paper (2019)

Authors

M Loog Pattern Recognition and Bioinformatics -

Tom J. Viering Computer Science & Engineering-Teaching Team -

A. Mey Interactive Intelligence -

Research Group

Pattern Recognition and Bioinformatics () (TU Delft)

To reference this document use:

http://resolver.tudelft.nl/uuid:ccd5f21c-b354-41d2-85f7-68f10fe20db8

More Info

expand_more

Published Date

2019

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Intelligent Systems

Research Group

Pattern Recognition and Bioinformatics

Abstract

Plotting a learner’s average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves is, however, not very well understood and can display (for most researchers) quite unexpected behavior. Our work introduces the formal notion of risk monotonicity, which asks the risk to not deteriorate with increasing training set sizes in expectation over the training samples. We then present the surprising result that various standard learners, specifically those that minimize the empirical risk, can act nonmonotonically irrespective of the training sample size. We provide a theoretical underpinning for specific instantiations from classification, regression, and density estimation. Altogether, the proposed monotonicity notion opens up a whole new direction of research.

Files

NeurIPS_2019_minimizers_of_the... (pdf)

(pdf | 0.404 Mb)

Unknown license