Loading…

Keeping track of user steering actions in dynamic workflows

In long-lasting scientific workflow executions in HPC machines, computational scientists (the users in this work) often need to fine-tune several workflow parameters. These tunings are done through user steering actions that may significantly improve performance (e.g, reduce execution time) or impro...

Full description

Saved in:
Bibliographic Details
Published in:Future generation computer systems 2019-10, Vol.99, p.624-643
Main Authors: Souza, Renan, Silva, Vítor, Camata, Jose J., Coutinho, Alvaro L.G.A., Valduriez, Patrick, Mattoso, Marta
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In long-lasting scientific workflow executions in HPC machines, computational scientists (the users in this work) often need to fine-tune several workflow parameters. These tunings are done through user steering actions that may significantly improve performance (e.g, reduce execution time) or improve the overall results. However, in executions that last for weeks, users can lose track of what has been adapted if the tunings are not properly registered. In this work, we build on provenance data management to address the problem of tracking online parameter fine-tuning in dynamic workflows steered by users. We propose a lightweight solution to capture and manage provenance of the steering actions online with negligible overhead. The resulting provenance database relates tuning data with data for domain, dataflow provenance, execution, and performance, and is available for analysis at runtime. We show how users may get a detailed view of the execution, providing insights to determine when and how to tune. We discuss the applicability of our solution in different domains and validate its ability to allow for online capture and analyses of parameter fine-tunings in a real workflow in the Oil and Gas industry. In this experiment, the user could determine which tuned parameters influenced simulation accuracy and performance. The observed overhead for keeping track of user steering actions at runtime is less than 1% of total execution time. •Computational Steering in Scientific Workflows and Provenance Data Management.•Human-in-the-loop: Keeping track of user steering actions in dynamic workflows.•Online data analysis enriched with user steering data for online data-driven decisions.•Formal definition for the track of user steering actions in dataflows.•Experimental validation using a real scientific workflow in the oil and gas industry on an HPC machine.
ISSN:0167-739X
1872-7115
DOI:10.1016/j.future.2019.05.011