A dynamic data-throttling approach to minimize workflow imbalance
Financiación H2020 / H2020 Funds
Resumen: Scientific workflows enable scientists to undertake analysis on large datasets and perform complex scientific simulations. These workflows are often mapped onto distributed and parallel computational infrastructures to speed up their executions. Prior to its execution, a workflow structure may suffer transformations to accommodate the computing infrastructures, normally involving task clustering and partitioning. However, these transformations may cause workflow imbalance because of the difference between execution task times (runtime imbalance) or because of unconsidered data dependencies that lead to data locality issues (data imbalance). In this article, to mitigate these imbalances, we enhance the workflow lifecycle process in use by introducing a workflow imbalance phase that quantifies workflow imbalance after the transformations. Our technique is based on structural analysis of Petri nets, obtained by model transformation of a data-intensive workflow, and Linear Programming techniques. Our analysis can be used to assist workflow practitioners in finding more efficient ways of transforming and scheduling their workflows. Moreover, based on our analysis, we also propose a technique to mitigate workflow imbalance by data throttling. Our approach is based on autonomic computing principles that determine how data transmission must be throttled throughout workflow jobs. Our autonomic data-throttling approach mainly monitors the execution of the workflow and recompute data-throttling values when certain watchpoints are reached and time derivation is observed. We validate our approach by a formal proof and by simulations along with the Montage workflow. Our findings show that a dynamic data-throttling approach is feasible, does not introduce a significant overhead, and minimizes the usage of input buffers and network bandwidth.
Idioma: Inglés
DOI: 10.1145/3278720
Año: 2019
Publicado en: ACM Transactions on Internet Technology 19, 3 (2019), 32 [21 pp.]
ISSN: 1533-5399

Factor impacto JCR: 1.598 (2019)
Categ. JCR: COMPUTER SCIENCE, SOFTWARE ENGINEERING rank: 54 / 107 = 0.505 (2019) - Q3 - T2
Categ. JCR: COMPUTER SCIENCE, INFORMATION SYSTEMS rank: 112 / 155 = 0.723 (2019) - Q3 - T3

Factor impacto SCIMAGO: 0.675 - Computer Networks and Communications (Q1)

Financiación: info:eu-repo/grantAgreement/ES/DGA-FSE/T35-17D
Financiación: info:eu-repo/grantAgreement/ES/DGA/T21-17R-DISCO
Financiación: info:eu-repo/grantAgreement/EC/H2020/644869/EU/Developing Data-Intensive Cloud Applications with Iterative Quality Enhancements/DICE
Financiación: info:eu-repo/grantAgreement/ES/MICINN/TIN2014-58457-R
Tipo y forma: Article (PostPrint)
Área (Departamento): Área Lenguajes y Sistemas Inf. (Dpto. Informát.Ingenie.Sistms.)

Rights Reserved All rights reserved by journal editor


Exportado de SIDERAL (2024-02-08-10:06:05)


Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
Articles > Artículos por área > Lenguajes y Sistemas Informáticos



 Record created 2024-02-08, last modified 2024-02-08


Postprint:
 PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)