The Impact of Task Runtime Estimate Accuracy on Scheduling Workloads of Workflows

Ilyushkin, A.S.; Epema, D.H.J.

doi:10.1109/CCGRID.2018.00048

The Impact of Task Runtime Estimate Accuracy on Scheduling Workloads of Workflows

Conference paper (2018)

Authors

A.S. Ilyushkin Data-Intensive Systems -

D.H.J. Epema Data-Intensive Systems -

Research Group

Data-Intensive Systems () (TU Delft)

DOI: https://doi.org/10.1109/CCGRID.2018.00048

Scheduling Dynamic Workflow Fairness Dynamic scheduling Workload Plan Runtime estimates DAG

To reference this document use:

http://resolver.tudelft.nl/uuid:bf5bf685-bf94-4619-9f83-b872b21d3659

More Info

expand_more

Published Date

2018

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Software Technology

Research Group

Data-Intensive Systems

Abstract

Workflow schedulers often rely on task runtime estimates when making scheduling decisions, and they usually target the scheduling of a single workflow or batches of workflows. In contrast, in this paper, we evaluate the impact of the absence or limited accuracy of task runtime estimates on slowdown when scheduling complete workloads of workflows that arrive over time. We study a total of seven scheduling policies: four of these are popular existing policies for (batches of) workloads from the literature, including a simple backfilling policy which is not aware of task runtime estimates, two are novel workloadoriented policies, including one which targets fairness, and one is the well-known HEFT policy for a single workflow adapted to the online workload scenario. We simulate homogeneous and heterogeneous distributed systems to evaluate the performance of these policies under varying accuracy of task runtime estimates. Our results show that for high utilizations, the order in which workflows are processed is more important than the knowledge of correct task runtime estimates. Under low utilizations, all policies considered show good results, even a policy which does not use task runtime estimates. We also show that our Fair Workflow Prioritization (FWP) policy effectively decreases the variance of workflow slowdown and thus achieves fairness, and that the plan-based scheduling policy derived from HEFT does not show much performance improvement while bringing extra complexity to the scheduling process.

Files

Workflows_ccgrid18.pdf

(pdf | 1.38 Mb)

Unknown license