M. Loog | TU Delft Repository

An Analysis of Model-Based Reinforcement Learning From Abstracted Observations

Journal article (2024) - R.A.N. Starre (author), M. Loog (author), E. Congeduti (author), E. Congeduti (author), F.A. Oliehoek (author)

Many methods for Model-based Reinforcement learning (MBRL) in Markov decision processes (MDPs) provide guarantees for both the accuracy of the model they can deliver and the learning efficiency. At the same time, state abstraction techniques allow for a reduction of the size of a ...

Percolate

An Exponential Family JIVE Model to Design DNA-Based Predictors of Drug Response

Conference paper (2023) - S.M.C. Mourragui (author), S.M.C. Mourragui (author), M. Loog (author), M. Loog (author), Mirrelijn van Nee van Nee (author), Mark A.van de Wiel van de Wiel (author), Mark A.van de Wiel van de Wiel (author), M.J.T. Reinders (author), M.J.T. Reinders (author), L.F.A. Wessels (author), L.F.A. Wessels (author)

Motivation: Anti-cancer drugs may elicit resistance or sensitivity through mechanisms which involve several genomic layers. Nevertheless, we have demonstrated that gene expression contains most of the predictive capacity compared to the remaining omic data types. Unfortunately, t ...

Social Processes

Self-supervised Meta-learning Over Conversational Groups for Forecasting Nonverbal Social Cues

Conference paper (2023) - C.A. Raman (author), H.S. Hung (author), M. Loog (author), M. Loog (author)

Free-standing social conversations constitute a yet underexplored setting for human behavior forecasting. While the task of predicting pedestrian trajectories has received much recent attention, an intrinsic difference between these settings is how groups form and disband. Eviden ...

LCDB 1.0

An Extensive Learning Curves Database for Classification Tasks

Conference paper (2023) - Felix Mohr Mohr (author), T.J. Viering (author), M. Loog (author), M. Loog (author), Jan N. van Rijn van Rijn (author)

The use of learning curves for decision making in supervised machine learning is standard practice, yet understanding of their behavior is rather limited. To facilitate a deepening of our knowledge, we introduce the Learning Curve Database (LCDB), which contains empirical learnin ...

The Shape of Learning Curves

A Review

Review (2023) - T.J. Viering (author), M. Loog (author), M. Loog (author)

Learning curves provide insight into the dependence of a learner's generalization performance on the training set size. This important tool can be used for model selection, to predict the effect of more training data, and to reduce the computational complexity of model training a ...

A View on Model Misspecification in Uncertainty Quantification

Conference paper (2023) - Y. Kato (author), David Tax (author), M. Loog (author), M. Loog (author)

Estimating uncertainty of machine learning models is essential to assess the quality of the predictions that these models provide. However, there are several factors that influence the quality of uncertainty estimates, one of which is the amount of model misspecification. Model m ...

Also for k-means

More data does not imply better performance

Journal article (2023) - M. Loog (author), M. Loog (author), J.H. Krijthe (author), J.H. Krijthe (author), Manuele Bicego Bicego (author)

Arguably, a desirable feature of a learner is that its performance gets better with an increasing amount of training data, at least in expectation. This issue has received renewed attention in recent years and some curious and surprising findings have been reported on. In essence ...

To Actively Initialize Active Learning

Journal article (2022) - Yazhou Yang Yang (author), M. Loog (author), M. Loog (author)

Though much effort has been spent on designing new active learning algorithms, little attention has been paid to the initialization problem of active learning, i.e., how to find a set of labeled samples which contains at least one instance per category. This work identifies the i ...

Improved Generalization in Semi-Supervised Learning

A Survey of Theoretical Results

Journal article (2022) - A. Mey (author), M. Loog (author), M. Loog (author)

Semi-supervised learning is the learning setting in which we have both labeled and unlabeled data at our disposal. This survey covers theoretical results for this setting and maps out the benefits of unlabeled data in classification and regression tasks. Most methods that use unl ...

Enhancing Classifier Conservativeness and Robustness by Polynomiality

Conference paper (2022) - Z. Wang (author), M. Loog (author), M. Loog (author)

We illustrate the detrimental effect, such as overconfident decisions, that exponential behavior can have in methods like classical LDA and logistic regression. We then show how polynomiality can remedy the situation. This, among others, leads purposefully to random-level perform ...

Model-Based Reinforcement Learning with State Abstraction: A Survey

Conference paper (2022) - R.A.N. Starre (author), M. Loog (author), F.A. Oliehoek (author)

Model-based reinforcement learning methods are promising since they can increase sample efficiency while simultaneously improving generalizability. Learning can also be made more efficient through state abstraction, which delivers more compact models. Model-based reinforcement le ...

A Review of Domain Adaptation without Target Labels

Review (2021) - Wouter M. Kouw Kouw (author), M. Loog (author)

Domain adaptation has become a prominent problem setting in machine learning and related fields. This review asks the question: How can a classifier learn from a source domain and generalize to a target domain We present a categorization of approaches, divided into, what we refer ...

Seismic inversion with deep learning

A proposal for litho-type classification

Journal article (2021) - S. Pintea (author), S. Sharma (author), Femke C. Vossepoel (author), J.C. van Gemert (author), M. Loog (author), D.J. Verschuur (author)

This article investigates bypassing the inversion steps involved in a standard litho-type classification pipeline and performing the litho-type classification directly from imaged seismic data. We consider a set of deep learning methods that map the seismic data directly into lit ...

Resolution learning in deep convolutional networks using scale-space theory

Journal article (2021) - S. Pintea (author), N. Tömen (author), Stanley F. Goes Goes (author), M. Loog (author), J.C. van Gemert (author)

Resolution in deep convolutional neural networks (CNNs) is typically bounded by the receptive field size through filter sizes, and subsampling layers or strided convolutions on feature maps. The optimal resolution may vary significantly depending on the dataset. Modern CNNs hard- ...

Target Robust Discriminant Analysis

Conference paper (2021) - Wouter M. Kouw Kouw (author), M. Loog (author), M. Loog (author)

In practice, the data distribution at test time often differs, to a smaller or larger extent, from that of the original training data. Consequentially, the so-called source classifier, trained on the available labelled data, deteriorates on the test, or target, data. Domain adapt ...

Consistency and Finite Sample Behavior of Binary Class Probability Estimation

Conference paper (2021) - A. Mey (author), M. Loog (author)

We investigate to which extent one can recover class probabilities within the empirical risk minimization (ERM) paradigm. We extend existing results and emphasize the tight relations between empirical risk minimization and class probability estimation. Following previous literatu ...

ReproducedPapers.org

Openly Teaching and Structuring Machine Learning Reproducibility

Conference paper (2021) - B. Yildiz (author), H.S. Hung (author), M.M. de Weerdt (author), J.C. van Gemert (author), J.H. Krijthe (author), C.C.S. Liem (author), M. Loog (author), M.A. Migut (author), F.A. Oliehoek (author), A. Panichella (author), Przemysław Pawełczak (author), S. Picek (author)

We present ReproducedPapers.org : an open online repository for teaching and structuring machine learning reproducibility. We evaluate doing a reproduction project among students and the added value of an online reproduction repository among AI researchers. We use anonymous self- ...

Robust domain-adaptive discriminant analysis

Journal article (2021) - W.M. Kouw (author), W.M. Kouw (author), M. Loog (author), M. Loog (author)

Consider a domain-adaptive supervised learning setting, where a classifier learns from labeled data in a source domain and unlabeled data in a target domain to predict the corresponding target labels. If the classifier’s assumption on the relationship between domains (e.g. covari ...

Predicting patient response with models trained on cell lines and patient-derived xenografts by nonlinear transfer learning

Journal article (2021) - S.M.C. Mourragui (author), S.M.C. Mourragui (author), M. Loog (author), M. Loog (author), Daniël J. Vis (author), Kat Moore (author), Anna G. Manjon (author), Mark A. van de Wiel van de Wiel (author), Mark A. van de Wiel van de Wiel (author), M.J.T. Reinders (author), M.J.T. Reinders (author), L.F.A. Wessels (author), L.F.A. Wessels (author)

Preclinical models have been the workhorse of cancer research, producing massive amounts of drug response data. Unfortunately, translating response biomarkers derived from these datasets to human tumors has proven to be particularly challenging. To address this challenge, we deve ...

Is Wikipedia succeeding in reducing gender bias? Assessing changes in gender bias in Wikipedia using word embeddings

Conference paper (2020) - Katja Geertruida Schmahl Schmahl (author), T.J. Viering (author), S. Makrodimitris (author), A. Naseri Jahfari (author), David Tax (author), M. Loog (author)

Large text corpora used for creating word embeddings (vectors which represent word meanings) often contain stereotypical gender biases. As a result, such unwanted biases will typically also be present in word embeddings derived from such corpora and downstream applications in the ...