A. Lengyel | TU Delft Repository

On Color and Symmetries for Data Efficient Deep Learning

Doctoral thesis (2024) - A. Lengyel (author), Attila Lengyel (author)

Computer vision algorithms are getting more advanced by the day and slowly approach human-like capabilities, such as detecting objects in cluttered scenes and recognizing facial expressions. Yet, computers learn to perform these tasks very differently from humans. Where humans ca ...

Computer vision algorithms are getting more advanced by the day and slowly approach human-like capabilities, such as detecting objects in cluttered scenes and recognizing facial expressions. Yet, computers learn to perform these tasks very differently from humans. Where humans can generalize between different lighting conditions or geometric orientations with ease, computers require vast amounts of training data to adapt from day to night images, or even to recognize a cat hanging upside-down. This requires additional data, annotations and compute power, increasing the development costs of useful computer vision models. This thesis is therefore concerned with reducing the data and compute hunger of computer vision algorithms by incorporating prior knowledge into the model architecture. Knowledge that is built in no longer needs to be learned from data. This thesis considers various knowledge priors. To improve the robustness of deep learning models to changes in illumination, we make use of color invariant representations derived from physics-based reflection models. We find that a color invariant input layer effectively normalizes the feature map activations throughout the entire network, thereby reducing the distribution shift that normally occurs between day and night images. Equivariance has proven to be a useful network property for improving data efficiency. We introduce the color equivariant convolution, where spatial features are explicitly shared between different colors. This improves generalization to out-of-distribution colors, and therefore reduces the amount of required training data. We subsequently investigate Group Equivariant Convolutions (GConvs). First, we discover that GConv filters learn redundant symmetries, which can be hard-coded using separable convolutions. This preserves equivariance to rotation and mirroring, and improves data and compute efficiency. We also explore the notion of approximate equivariance in GConvs. Subsampling is known to introduce equivariance errors in regular convolutional layers, and we find that it similarly breaks exact equivariance for rotation and mirroring. This turns out to be a double-edged sword: while it improves performance on in-distribution data, at the same time it negatively affects out-of-distribution generalization. Finally, we show that exact equivariance can be restored by choosing an appropriate input size. This thesis aims to provide a step forward in the adoption of invariant and equivariant architectures to improve data and compute efficiency in deep learning.@en

Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models

In temporal action localization, given an input video, the goal is to predict which actions it contains, where they begin, and where they end. Training and testing current state-of- the-art deep learning models requires access to large amounts of data and computational power. How ...

Color Equivariant Convolutional Networks

Color is a crucial visual cue readily exploited by Convolutional Neural Networks (CNNs) for object recognition. However, CNNs struggle if there is data imbalance between color variations introduced by accidental recording conditions. Color invariance addresses this issue but does ...

Using and Abusing Equivariance

Conference paper (2023) - T.F. Edixhoven (author), A. Lengyel (author), Attila Lengyel (author), Jan van Gemert (author), Jan Gemert (author), Jan C. Gemert (author), J.C. Van Gemert (author), Jan Van Gemert (author), J.C. Gemert (author), Jan van van Gemert (author), Jan C. van Gemert (author), Jan C. Van Gemert (author), Jan van Van Gemert (author), Jan van Gemert (author), J.C. van Gemert (author)

In this paper we show how Group Equivariant Convolutional Neural Networks use subsampling to learn to break equivariance to the rotation and reflection symmetries. We focus on the 2D rotations and reflections and investigate the impact of the broken equivariance on network perfor ...

Copy-Pasting Coherent Depth Regions Improves Contrastive Learning for Urban-Scene Segmentation

Conference paper (2022) - L. Zeng (author), Attila Lengyel (author), A. Lengyel (author), N. Tömen (author), Nergis Tömen (author), J.C. Gemert (author), Jan Gemert (author), J.C. Van Gemert (author), Jan van van Gemert (author), Jan Van Gemert (author), Jan van Gemert (author), Jan C. Gemert (author), Jan C. Van Gemert (author), Jan van Van Gemert (author), J.C. van Gemert (author), Jan van Gemert (author), Jan C. van Gemert (author)

In this work, we leverage estimated depth to boost self-supervised contrastive learning for segmentation of urban scenes, where unlabeled videos are readily available for training self-supervised depth estimation. We argue that the semantics of a coherent group of pixels in 3D sp ...

Zero-Shot Day-Night Domain Adaptation with a Physics Prior

Conference paper (2021) - A. Lengyel (author), Attila Lengyel (author), Sourav Garg (author), Michael J. Milford (author), Michael Milford (author), Jan Gemert (author), Jan van Gemert (author), Jan van van Gemert (author), Jan C. Gemert (author), Jan C. van Gemert (author), Jan Van Gemert (author), Jan C. Van Gemert (author), Jan van Van Gemert (author), J.C. Gemert (author), J.C. van Gemert (author), J.C. Van Gemert (author), Jan van Gemert (author)

We explore the zero-shot setting for day-night domain adaptation. The traditional domain adaptation setting is to train on one domain and adapt to the target domain by exploiting unlabeled data samples from the test set. As gathering relevant test data is expensive and sometimes ...

Exploiting Learned Symmetries in Group Equivariant Convolutions

Conference paper (2021) - Attila Lengyel (author), A. Lengyel (author), J.C. Van Gemert (author), Jan van van Gemert (author), Jan C. Van Gemert (author), Jan Van Gemert (author), Jan Gemert (author), Jan van Gemert (author), Jan van Gemert (author), J.C. Gemert (author), Jan van Van Gemert (author), Jan C. van Gemert (author), J.C. van Gemert (author), Jan C. Gemert (author)

Group Equivariant Convolutions (GConvs) enable convolutional neural networks to be equivariant to various transformation groups, but at an additional parameter and compute cost. We investigate the filter parameters learned by GConvs and find certain conditions under which they be ...