J.C. van Gemert | TU Delft Repository

MSD

A Benchmark Dataset for Floor Plan Generation of Building Complexes

Conference paper (2025) - Casper van Engelenburg (author) , Fatemeh Mostafavi (author) , Emanuel Kuhn (author) , Yuntae Jeon (author) , Michael Franzen (author) , Matthias Standfest (author) , Jan van van Gemert (author) , Seyran Khademi (author)

Diverse and realistic floor plan data are essential for the development of useful computer-aided methods in architectural design. Today’s large-scale floor plan datasets predominantly feature simple floor plan layouts, typically representing single-apartment dwellings only. To co ...

Contrast-Agnostic Groupwise Registration by Robust PCA for Quantitative Cardiac MRI

Conference paper (2024) - Xinqi Li (author) , Yi Zhang (author) , Yidong Zhao (author) , Jan Van Gemert (author) , Qian Tao (author)

Quantitative cardiac magnetic resonance imaging (MRI) is an increasingly important diagnostic tool for cardiovascular diseases. Yet, co-registration of all baseline images within the quantitative MRI sequence is essential for the accuracy and precision of quantitative maps. Howev ...

Learn & drop

Fast learning of cnns based on layer dropping

Journal article (2024) - Giorgio Cruciata (author) , Luca Cruciata (author) , Liliana Lo Presti (author) , J.C. van Gemert (author) , Marco La Cascia (author)

This paper proposes a new method to improve the training efficiency of deep convolutional neural networks. During training, the method evaluates scores to measure how much each layer’s parameters change and whether the layer will continue learning or not. Based on these scores, t ...

Using and Abusing Equivariance

Conference paper (2023) - T.F. Edixhoven (author) , A. Lengyel (author) , Jan van Gemert (author)

In this paper we show how Group Equivariant Convolutional Neural Networks use subsampling to learn to break equivariance to the rotation and reflection symmetries. We focus on the 2D rotations and reflections and investigate the impact of the broken equivariance on network perfor ...

What Affects Learned Equivariance in Deep Image Recognition Models?

Conference paper (2023) - R. Bruintjes (author) , Tomasz Motyka (author) , Jan van Gemert (author)

Equivariance w.r.t. geometric transformations in neural networks improves data efficiency, parameter efficiency and robustness to out-of-domain perspective shifts. When equivariance is not designed into a neural network, the network can still learn equivariant functions from the ...

Assessing facial weakness in myasthenia gravis with facial recognition software and deep learning

Journal article (2023) - Annabel M. Ruiter (author) , Ziqi Wang (author) , Zhao Yin (author) , Willemijn C. Naber (author) , Jerrel Simons (author) , Jurre T. Blom (author) , Jan van van Gemert (author) , Jan J.G.M. Verschuuren (author) , Martijn R. Tannemaat (author)

Objective: Myasthenia gravis (MG) is an autoimmune disease leading to fatigable muscle weakness. Extra-ocular and bulbar muscles are most commonly affected. We aimed to investigate whether facial weakness can be quantified automatically and used for diagnosis and disease monitori ...

Differentiable Transportation Pruning

Conference paper (2023) - Yunqiang Li (author) , J.C. Gemert (author) , Torsten Hoefler (author) , Bert Moons (author) , Evangelos Eleftheriou (author) , Bram-Ernst Verhoef (author)

Deep learning algorithms are increasingly employed at the edge. However, edge devices are resource constrained and thus require efficient deployment of deep neural networks. Pruning methods are a key tool for edge deployment as they can improve storage, compute, memory bandwidth, ...

Are current long-term video understanding datasets long-term?

Conference paper (2023) - Ombretta Strafforello (author) , Klamer Schutte (author) , J.C. Van Gemert (author)

Many real-world applications, from sport analysis to surveillance, benefit from automatic long-term action recognition. In the current deep learning paradigm for automatic action recognition, it is imperative that models are trained and tested on datasets and tasks that evaluate ...

LAB

Learnable Activation Binarizer for Binary Neural Networks

Conference paper (2023) - Sieger Falkena (author) , H Jamali Rad (author) , Jan C. Gemert (author)

Binary Neural Networks (BNNs) are receiving an up-surge of attention for bringing power-hungry deep learning towards edge devices. The traditional wisdom in this space is to employ sign(.) for binarizing feature maps. We argue and illustrate that sign(.) is a uniqueness bottlenec ...

Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models

Conference paper (2023) - J. Warchocki (author) , T. Oprescu (author) , Y. Wang (author) , A. Dămăcuș (author) , P.M. Misterka (author) , Robert Jan Bruintjes (author) , Attila Lengyel (author) , Ombretta Strafforello (author) , Jan van van Gemert (author)

In temporal action localization, given an input video, the goal is to predict which actions it contains, where they begin, and where they end. Training and testing current state-of- the-art deep learning models requires access to large amounts of data and computational power. How ...

Color Equivariant Convolutional Networks

Conference paper (2023) - A. Lengyel (author) , O. Strafforello (author) , Robert Jan Bruintjes (author) , A.S. Gielisse (author) , Jan Van Gemert (author)

Color is a crucial visual cue readily exploited by Convolutional Neural Networks (CNNs) for object recognition. However, CNNs struggle if there is data imbalance between color variations introduced by accidental recording conditions. Color invariance addresses this issue but does ...

Computer vision and architectural history at eye level

Mixed methods for linking research in the humanities and in information technology (ArchiMediaL)

Book chapter (2023) - Tino Mager (author) , S. Khademi (author) , R.M. Siebes (author) , Jan C. Van Gemert (author) , Victor De Boer (author) , Beate Löffler (author) , Carola Hein (author)

Information on the history of architecture is embedded in our daily surroundings, in vernacular and heritage buildings and in physical objects, photographs and plans. Historians study these tangible and intangible artefacts and the communities that built and used them. Thus valua ...

Understanding weight-magnitude hyperparameters in training binary networks

Conference paper (2023) - Joris Quist (author) , Yunqiang Li (author) , J.C. Van Gemert (author)

Objects do not disappear

Video object detection by single-frame object location anticipation

Conference paper (2023) - X. Liu (author) , Jan Van Gemert (author) , Fatemeh Karimi Nejadasl (author) , O. Booij (author) , Silvia L. Pintea (author)

Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyfram ...

A step towards understanding why classification helps regression

Conference paper (2023) - Silvia Pintea (author) , Y. Lin (author) , Jouke Dijkstra (author) , J.C. Van Gemert (author)

A number of computer vision deep regression approaches report improved results when adding a classification loss to the regression loss. Here, we explore why this is useful in practice and when it is beneficial. To do so, we start from precisely controlled dataset variations and ...

Analyzing Components of a Transformer under Different Dataset Scales in 3D Prostate CT Segmentation

Conference paper (2023) - Yicong Tan (author) , P. Mody (author) , Viktor van der Valk (author) , Marius Staring (author) , Jan Van Gemert (author)

Literature on medical imaging segmentation claims that hybrid UNet models containing both Transformer and convolutional blocks perform better than purely convolutional UNet models. This recently touted success of hybrid Transformers warrants an investigation into which of its com ...

Literature on medical imaging segmentation claims that hybrid UNet models containing both Transformer and convolutional blocks perform better than purely convolutional UNet models. This recently touted success of hybrid Transformers warrants an investigation into which of its components contribute to its performance. Also, previous work has a limitation of analysis only at fixed dataset scales as well as unfair comparisons with other models where parameter counts are not equivalent. Here, we investigate the performance of a hybrid Transformer network i.e. the nnFormer for organ segmentation in prostate CT scans. We do this in context of replacing its various components and by constructing learning curves by plotting model performance at different dataset scales. To compare with literature, the first experiment replaces all the shifted-window(swin) Transformer blocks of the nnFormer with convolutions. Results show that the convolution prevails as the data scale increases. In the second experiment, to reduce complexity, the self-attention mechanism within the swin-Transformer block is replaced with an similar albeit simpler spatial mixing operation i.e. max-pooling. We observe improved performance for max-pooling in smaller dataset scales, indicating that the window-based Transformer may not be the best choice in both small and larger dataset scales. Finally, since convolution has an inherent local inductive bias of positional information, we conduct a third experiment to imbibe such a property to the Transformer by exploring two kinds of positional encodings. The results show that there are insignificant improvements after adding positional encoding, indicating the hybrid swin-Transformers deficiency in capturing positional information given our dataset at its various scales. Through this work, we hope to motivate the community to use learning curves under fair experimental settings to evaluate the efficacy of newer architectures like Transformers for their medical imaging tasks. Code is available on https://github.com/prerakmody/ window-transformer-prostate-segmentation.

@en

Non-Destructive Infield Quality Estimation of Strawberries using Deep Architectures

Conference paper (2023) - Cees Jol (author) , Junhan Wen (author) , J.C. Van Gemert (author)

Strawberries are profitable fruits, yet they have a short shelf life. Therefore, it is crucial to anticipate their quality and harvest them at the best time, which is vital not only for finding the appropriate market but also for minimizing food and economic waste. To this end, n ...

SSIG

A Visually-Guided Graph Edit Distance for Floor Plan Similarity

Conference paper (2023) - C.C.J. Engelenburg (author) , S. Khademi (author) , Jan Van Gemert (author)

We propose a simple yet effective metric that measures structural similarity between visual instances of architectural floor plans, without the need for learning. Qualitatively, our experiments show that the retrieval results are similar to deeply learned methods. Effectively com ...

Video BagNet

Short temporal receptive fields increase robustness in long-term action recognition

Conference paper (2023) - O. Strafforello (author) , Xin Liu (author) , Klamer Schutte (author) , J.C. Gemert (author)

Previous work on long-term video action recognition relies on deep 3D-convolutional models that have a large temporal receptive field (RF). We argue that these models are not always the best choice for temporal modeling in videos. A large temporal receptive field allows the model ...

Is there progress in activity progress prediction?

Conference paper (2023) - Frans de Boer (author) , Jan C. Gemert (author) , Jouke Dijkstra (author) , Silvia Pintea (author)

Activity progress prediction aims to estimate what percentage of an activity has been completed. Currently this is done with machine learning approaches, trained and evaluated on complicated and realistic video datasets. The videos in these datasets vary drastically in length and ...