Y. Chen | TU Delft Repository

EdgeTA

Neuron-Grained Scaling of Foundation Models in Edge-Side Retraining

Journal article (2025) - Qinglong Zhang (author) , Rui Han (author) , Chi Harold Harold Liu (author) , Guoren Wang (author) , Song Guo (author) , Y. Chen (author)

Foundation models (FMs) such as large language models are becoming the backbone technology for artificial intelligence systems. It is particularly challenging to deploy multiple FMs on edge devices, which not only have limited computational resources, but also encounter unseen in ...

Adaptive ensemble optimization for memory-related hyperparameters in retraining DNN at edge

Journal article (2025) - Yidong Xu (author) , Rui Han (author) , Xiaojiang Zuo (author) , Junyan Ouyang (author) , Chi Harold Liu (author) , Lydia Y. Chen (author)

Edge applications are increasingly empowered by deep neural networks (DNN) and face the challenges of adapting or retraining models for the changes in input data domains and learning tasks. The existing techniques to enable DNN retraining on edge devices are to configure the memo ...

Parameterizing Federated Continual Learning for Reproducible Research

Conference paper (2025) - Bart Cox (author) , Jeroen Galjaard (author) , A. Shankar (author) , J.E.A.P. Decouchant (author) , Lydia Y. Chen (author)

Federated Learning (FL) systems evolve in heterogeneous and ever-evolving environments that challenge their performance. Under real deployments, the learning tasks of clients can also evolve with time, which calls for the integration of methodologies such as Continual Learning (C ...

RobustDA

Lightweight Robust Domain Adaptation for Evolving Data at Edge

Journal article (2024) - Xinyu Guo (author) , Xiaojiang Zuo (author) , Rui Han (author) , Junyan Ouyang (author) , Jing Xie (author) , Chi Harold Liu (author) , Qinglong Zhang (author) , Ying Guo (author) , Jing Chen (author) , Y. Chen (author)

AI applications powered by deep learning models are increasingly run natively at edge. A deployed model not only encounters continuously evolving input distributions (domains) but also faces adversarial attacks from third-party. This necessitates adapting the model to shifting do ...

Amalur

The Convergence of Data Integration and Machine Learning

Journal article (2024) - Z. Li (author) , W. Sun (author) , Danning Zhan (author) , Yan Kang (author) , Lydia Y. Chen (author) , A. Bozzon (author) , Rihan Hai (author)

Machine learning (ML) training data is often scattered across disparate collections of datasets, called <italic>data silos</italic>. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different ...

Spyker

Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients

Conference paper (2024) - Yuncong Zuo (author) , Bart Cox (author) , Lydia Y. Chen (author) , Jérémie Decouchant (author)

Federated learning (FL) systems enable multiple clients to train a machine learning model iteratively through synchronously exchanging the intermediate model weights with a single server. The scalability of such FL systems can be limited by two factors: server idle time due to sy ...

Cross-Facility Federated Learning

Journal article (2024) - Iacopo Colonnelli (author) , Robert Birke (author) , Giulio Malenza (author) , Gianluca Mittone (author) , Alberto Mulone (author) , Jeroen Galjaard (author) , Y. Chen (author) , Sanzio Bassini (author) , Gabriella Scipione (author) , More Authors...

In a decade, AI frontier research transitioned from the researcher's workstation to thousands of high-end hardware-accelerated compute nodes. This rapid evolution shows no signs of slowing down in the foreseeable future. While top cloud providers may be able to keep pace with thi ...

SiloFuse

Cross-silo Synthetic Data Generation with Latent Tabular Diffusion Models

Conference paper (2024) - Aditya Shankar (author) , Hans Brouwer (author) , R. Hai (author) , Y. Chen (author)

Synthetic tabular data is crucial for sharing and augmenting data across silos, especially for enterprises with proprietary data. However, existing synthesizers are designed for centrally stored data. Hence, they struggle with real-world scenarios where features are distributed a ...

ElasticDNN

On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

Journal article (2024) - Qinglong Zhang (author) , Rui Han (author) , Chi Harold Liu (author) , Guoren Wang (author) , Y. Chen (author)

Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods eith ...

Defending Against Free-Riders Attacks in Distributed Generative Adversarial Networks

Conference paper (2024) - Zilong Zhao (author) , Jiyue Huang (author) , Lydia Y. Chen (author) , Stefanie Roos (author)

Generative Adversarial Networks (GANs) are increasingly adopted by the industry to synthesize realistic images using competing generator and discriminator neural networks. Due to data not being centrally available, Multi-Discriminator (MD)-GANs training frameworks employ multiple ...

FedViT

Federated continual learning of vision transformer at edge

Journal article (2024) - X Zuo (author) , Yaxin Luopan (author) , Rui Han (author) , Qinglong Zhang (author) , Chi Harold Liu (author) , Guoren Wang (author) , Y. Chen (author)

Deep Neural Networks (DNNs) have been ubiquitously adopted in internet of things and are becoming an integral part of our daily life. When tackling the evolving learning tasks in real world, such as classifying different types of objects, DNNs face the challenge to continually re ...

Deep Neural Networks (DNNs) have been ubiquitously adopted in internet of things and are becoming an integral part of our daily life. When tackling the evolving learning tasks in real world, such as classifying different types of objects, DNNs face the challenge to continually retrain themselves according to the tasks on different edge devices. Federated continual learning (FCL) is a promising technique that offers partial solutions but yet to overcome the following difficulties: the significant accuracy loss due to the limited on-device processing, the negative knowledge transfer caused by the limited communication of non-IID (non-Independent and Identically Distributed) data, and the limited scalability on the tasks and edge devices. Moreover, existing FCL techniques are designed for convolutional neural networks (CNNs), which have not utilized the full potential of newly emerged powerful vision transformers (ViTs). Considering ViTs depend heavily on training data diversity and volume, we hypothesize ViTs are well-suited for FCL where data arrives continually. In this paper, we propose FedViT, an accurate and scalable federated continual learning framework for ViT models, via a novel concept of signature task knowledge. FedViT is a client-side solution that continuously extracts and integrates the knowledge of signature tasks which are highly influenced by the current task. Each client of FedViT is composed of a knowledge extractor, a gradient restorer and, most importantly, a gradient integrator. Upon training for a new task, the gradient integrator ensures the prevention of catastrophic forgetting and mitigation of negative knowledge transfer by effectively combining signature tasks identified from the past local tasks and other clients’ current tasks through the global model. We implement FedViT in PyTorch and extensively evaluate it against state-of-the-art techniques using popular federated continual learning benchmarks. Extensive evaluation results on heterogeneous edge devices show that FedViT improves model accuracy by 88.61% without increasing model training time, reduces communication cost by 61.55%, and achieves more improvements under difficult scenarios such as large numbers of tasks or clients, and training different complex ViT models.

@en

Maverick Matters

Client Contribution and Selection in Federated Learning

Conference paper (2023) - Jiyue Huang (author) , Chi Hong (author) , Yang Liu (author) , Y. Chen (author) , Stefanie Roos (author)

Federated learning (FL) enables collaborative learning between parties, called clients, without sharing the original and potentially sensitive data. To ensure fast convergence in the presence of such heterogeneous clients, it is imperative to timely select clients who can effecti ...

AGIC: Approximate Gradient Inversion Attack on Federated Learning

Conference paper (2023) - J. Xu (author) , C. Hong (author) , J. Huang (author) , Lydia Chen (author) , J.E.A.P. Decouchant (author)

Federated learning is a private-by-design distributed learning paradigm where clients train local models on their own data before a central server aggregates their local updates to compute a global model. Depending on the aggregation method used, the local updates are either the ...

Federated learning is a private-by-design distributed learning paradigm where clients train local models on their own data before a central server aggregates their local updates to compute a global model. Depending on the aggregation method used, the local updates are either the gradients or the weights of local learning models, e.g., FedAvg aggregates model weights. Unfortunately, recent reconstruction attacks apply a gradient inversion optimization on the gradient update of a single mini- batch to reconstruct the private data used by clients during training. As the state-of-the-art reconstruction attacks solely focus on single update, realistic adversarial scenarios are over- looked, such as observation across multiple updates and updates trained from multiple mini-batches. A few studies consider a more challenging adversarial scenario where only model updates based on multiple mini-batches are observable, and resort to computationally expensive simulation to untangle the underlying samples for each local step. In this paper, we propose AGIC, a novel Approximate Gradient Inversion Attack that efficiently and effectively reconstructs images from both model or gradient updates, and across multiple epochs. In a nutshell, AGIC (i) approximates gradient updates of used training samples from model updates to avoid costly simulation procedures, (ii) leverages gradient/model updates collected from multiple epochs, and (iii) assigns increasing weights to layers with respect to the neural network structure for reconstruction quality. We extensively evaluate AGIC on three datasets, namely CIFAR-10, CIFAR- 100 and ImageNet. Our results show that AGIC increases the peak signal-to-noise ratio (PSNR) by up to 50% compared to two representative state-of-the-art gradient inversion attacks. Furthermore, AGIC is faster than the state-of-the-art simulation- based attack, e.g., it is 5x faster when attacking FedAvg with 8 local steps in between model updates. @en

FCT-GAN

Enhancing Global Correlation of Table Synthesis via Fourier Transform

Conference paper (2023) - Z. Zhao (author) , Robert Birke (author) , Lydia Y. Chen (author)

An alternative method for sharing knowledge while complying with strict data access regulations, such as the European General Data Protection Regulation (GDPR), is the emergence of synthetic tabular data. Mainstream table synthesizers utilize methodologies derived from Generative ...

Fabricated Flips

Poisoning Federated Learning without Data

Conference paper (2023) - Jiyue Huang (author) , Zilong Zhao (author) , Lydia Y. Chen (author) , Stefanie Roos (author)

Attacks on Federated Learning (FL) can severely reduce the quality of the generated models and limit the usefulness of this emerging learning paradigm that enables on-premise decentralized learning. However, existing untargeted attacks are not practical for many scenarios as they ...

CTAB-GAN+

Enhancing tabular data synthesis

Journal article (2023) - Zilong Zhao (author) , A. Kunar (author) , Robert Birke (author) , Hiek Van der Scheer (author) , Lydia Chen (author)

The usage of synthetic data is gaining momentum in part due to the unavailability of original data due to privacy and legal considerations and in part due to its utility as an augmentation to the authentic data. Generative adversarial networks (GANs), a paragon of generative mode ...

EdgeVisionBench

A Benchmark of Evolving Input Domains for Vision Applications at Edge

Conference paper (2023) - Qinglong Zhang (author) , Rui Han (author) , Chi Harold Liu (author) , Guoren Wang (author) , Y. Chen (author)

Vision applications powered by deep neural networks (DNNs) are widely deployed on edge devices and solve the learning tasks of incoming data streams whose class label and input feature continuously evolve, known as domain shift. Despite its prominent presence in real-world edge s ...

FedKNOW

Federated Continual Learning with Signature Task Knowledge Integration at Edge

Conference paper (2023) - Yaxin Luopan (author) , Rui Han (author) , Qinglong Zhang (author) , Chi Harold Harold Liu (author) , Guoren Wang (author) , Y. Chen (author)

Deep Neural Networks (DNNs) have been ubiquitously adopted in internet of things and are becoming an integral of our daily life. When tackling the evolving learning tasks in real world, such as classifying different types of objects, DNNs face the challenge to continually retrain ...

Robust Learning via Golden Symmetric Loss of (un)Trusted Labels

Conference paper (2023) - S. Ghiassi (author) , Robert Birke (author) , Lydia Y. Chen (author)

Learning robust deep models against noisy labels becomes ever critical when today's data is commonly collected from open platforms and subject to adversarial corruption. The information on the label corruption process, i.e., corruption matrix, can greatly enhance the robustness o ...

Fast DRL-based scheduler configuration tuning for reducing tail latency in edge-cloud jobs

Journal article (2023) - Shilin Wen (author) , Rui Han (author) , Chi Harold Liu (author) , Lydia Y. Chen (author)

Edge-cloud applications are rapidly prevailing in recent years and pose the challenge of using both resource-strenuous edge devices and elastic cloud resources under dynamic workloads. Efficient resource allocation on edge-cloud jobs via cluster schedulers (e.g. Kubernetes/Volcan ...