T.J. Viering | TU Delft Repository

Learning Learning Curves

Journal article (2025) - O.T. Turan (author) , David M.J. Tax (author) , Tom Viering (author) , M. Loog (author)

Learning curves depict how a model’s expected performance changes with varying training set sizes, unlike training curves, showing a gradient-based model’s performance with respect to training epochs. Extrapolating learning curves can be useful for determining the performance gai ...

Global patterns in vegetation accessible subsurface water storage emerge from spatially varying importance of individual drivers

Journal article (2024) - F. van Oorschot (author) , Markus Hrachowitz (author) , Tom Viering (author) , A. Alessandri (author) , R.J. van der Ent (author)

Vegetation roots play an essential role in regulating the hydrological cycle by removing water from the subsurface and releasing it to the atmosphere. However, the present understanding of the drivers of ecosystem-scale root development and their spatial variability globally is l ...

Vegetation roots play an essential role in regulating the hydrological cycle by removing water from the subsurface and releasing it to the atmosphere. However, the present understanding of the drivers of ecosystem-scale root development and their spatial variability globally is limited. This study investigates the varying roles of climate, landscape, and vegetation on the magnitude of root zone storage capacity (Sr) worldwide, which is defined as the maximum volume of subsurface moisture accessible to vegetation roots. To this aim, we quantified Sr and evaluated 21 possible climate, landscape, and vegetation controls for 3612 river catchments worldwide using a random forest machine learning model. Our findings reveal climate as primary, but spatially varying, driver of ecosystem scale Sr with landscape and vegetation characteristics playing a minor role. More specifically, we found the mean inter-storm duration as most dominant control of Sr globally, followed by mean temperature, mean precipitation, and mean topographic slope. While the inter-storm duration, temperature, and slope exhibit a consistent relation with Sr globally, the relation between precipitation and Sr varies spatially. Based on this spatial variability, we classified two different regimes: precipitation driven and energy limited. The precipitation-driven regime exhibits a positive relation between precipitation and Sr for precipitation of up to 3 mm d−1, above which the relation flattens and eventually becomes negative. The energy-limited regime exhibits a strictly negative relation between precipitation and Sr. Using the random forest model based on these three dominant climate variables and the landscape variable slope, we generated a global gridded dataset of Sr, which closely resembles other global datasets of root characteristics. This suggests that our parsimonious approach based on four globally available variables to estimate Sr on a global scale has the potential to be readily and easily integrated into the parameterization of Sr in global hydrological and land surface models. This may enhance the accuracy of global predictions of land–atmosphere exchange fluxes and hydrological extremes by providing a robust representation of both spatial and temporal variability in vegetation root characteristics.@en

The unreasonable effectiveness of early discarding after one epoch in neural network hyperparameter optimization

Journal article (2024) - Romain Egele (author) , F. Mohr (author) , Tom Julian Viering (author) , Prasanna Balaprakash (author)

To reach high performance with deep learning, hyperparameter optimization (HPO) is essential. This process is usually time-consuming due to costly evaluations of neural networks. Early discarding techniques limit the resources granted to unpromising candidates by observing the em ...

LCDB 1.0

An Extensive Learning Curves Database for Classification Tasks

Conference paper (2023) - Felix Mohr (author) , Tom J. Viering (author) , M Loog (author) , Jan N. van Rijn (author)

The use of learning curves for decision making in supervised machine learning is standard practice, yet understanding of their behavior is rather limited. To facilitate a deepening of our knowledge, we introduce the Learning Curve Database (LCDB), which contains empirical learnin ...

On Safety in Machine Learning

Doctoral thesis (2023) - Tom Julian Viering (author)

This dissertation focuses on safety in machine learning. Our adopted safety notion is related to robustness of learning algorithms. Related to this concept, we touch upon three topics: explainability, active learning and learning curves. Complex models can often achieve better p ...

This dissertation focuses on safety in machine learning. Our adopted safety notion is related to robustness of learning algorithms. Related to this concept, we touch upon three topics: explainability, active learning and learning curves. Complex models can often achieve better performance compared to simpler ones. Such larger models are more like blackboxes, whose inner workings are much harder to understand. However, explanations for their decisions may be required by law when these models are used, and may help us further improve them. For image data and CNNs, Grad-CAM produces explanations in the form of a heatmap. We construct CNNs whose heatmaps are manipulated, but whose predictions remain accurate, illustrating that Grad-CAM may not be robust enough for high stakes tasks such as self-driving cars. Machine learning often require large amounts of data for learning. Data annotation is often expensive or difficult. Active learning aims to reduce labeling costs by selecting data in a smart way — instead of the default, random sampling. Active learning algorithms aim to find the most useful samples. Surprisingly, we find that active learning algorithms with strictly better performance guarantees perform worse empirically. The cause: their worst-case analysis is unrealistic. A more optimistic average-case analysis does explain our empirical results. Thus better guarantees do not always translate to better performance. A learning curve visualizes the expected performance versus the sample size a learning algorithm is trained on. These curves are important for various applications, such as estimating the amount of data needed for learning. The conventional wisdom is that more data equals better performance. This means a learning curve strictly improves with more data, or in other words, is monotone. Deviations can surely be explained away by noise, chance, or a faulty experimental setup? To many in our field this may come as a surprise, but this behavior cannot be explained away. We survey the literature and highlight various non-monotone behaviors, even in cases where the learner uses a correct model. Our survey finds that learning curves can have a variety of shapes, such as power laws or exponentials, but there is no consensus and a complete characterization remains an open problem. We also find simple learning problems in classification and regression that show new non-monotone behaviors. Our problems can be tuned so non-monotonicity occurs for any sample size. Is there a universal solution to make learners montone? We design a wrapper algorithm that only adopt a new model if its performance is significantly better on validation data. We prove that the learning curve of the wrapper is monotone with a certain probability. This provides a first step towards safe learners that are guaranteed to improve with more data. Many questions regarding safety remain, however, this thesis may provide inspiration to develop more robust learning algorithms. The main take-aways are (TLDR): • Strictly tighter generalization bounds do not imply better performance. • Explanations provided by Grad-CAM can be misleading. • Even in simple settings more data can lead to worse performance. • We provide ideas to construct learners that always improve with more data. @en

The Shape of Learning Curves

A Review

Review (2023) - Tom Viering (author) , M. Loog (author)

Learning curves provide insight into the dependence of a learner's generalization performance on the training set size. This important tool can be used for model selection, to predict the effect of more training data, and to reduce the computational complexity of model training a ...

Is Wikipedia succeeding in reducing gender bias? Assessing changes in gender bias in Wikipedia using word embeddings

Conference paper (2020) - Katja Geertruida Schmahl (author) , Tom Julian Viering (author) , Stavros Makrodimitris (author) , Arman Naseri Naseri (author) , David M.J. Tax (author) , M. Loog (author)

Large text corpora used for creating word embeddings (vectors which represent word meanings) often contain stereotypical gender biases. As a result, such unwanted biases will typically also be present in word embeddings derived from such corpora and downstream applications in the ...

A Distribution Dependent and Independent Complexity Analysis of Manifold Regularization

Conference paper (2020) - A. Mey (author) , Tom J. Viering (author) , M Loog (author)

Manifold regularization is a commonly used technique in semi-supervised learning. It enforces the classification rule to be smooth with respect to the data-manifold. Here, we derive sample complexity bounds based on pseudo-dimension for models that add a convex data dependent reg ...

Making Learners (More) Monotone

Conference paper (2020) - Tom J. Viering (author) , A. Mey (author) , M Loog (author)

Learning performance can show non-monotonic behavior. That is, more data does not necessarily lead to better models, even on average. We propose three algorithms that take a supervised learning model and make it perform more monotone. We prove consistency and monotonicity with hi ...

A brief prehistory of double descent

Journal article (2020) - M. Loog (author) , Tom J. Viering (author) , Alexander Mey (author) , J.H. Krijthe (author) , David Tax (author)

Nuclear discrepancy for single-shot batch active learning

Journal article (2019) - Tom Viering (author) , JH Krijthe (author) , Marco Loog (author)

Active learning algorithms propose what data should be labeled given a pool of unlabeled data. Instead of selecting randomly what data to annotate, active learning strategies aim to select data so as to get a good predictive model with as little labeled samples as possible. Singl ...

Minimizers of the empirical risk and risk monotonicity

Conference paper (2019) - M Loog (author) , Tom J. Viering (author) , A. Mey (author)

Plotting a learner’s average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves i ...

Generalization Bound Minimization for Active Learning

Abstract (2017) - Tom Viering (author) , JH Krijthe (author) , Marco Loog (author)