Loading…

Maximizing Workflow Throughput for Streaming Applications in Distributed Environments

Large-scale computation-intensive applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate intermodule dependencies. Mapping such workflows in heterogeneous network environments and maximizing their throughput are cruci...

Full description

Saved in:
Bibliographic Details
Main Authors: Yi Gu, Qishi Wu
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 6
container_issue
container_start_page 1
container_title
container_volume
creator Yi Gu
Qishi Wu
description Large-scale computation-intensive applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate intermodule dependencies. Mapping such workflows in heterogeneous network environments and maximizing their throughput are crucial to the success of large-scale scientific applications that process streaming datasets. We construct analytical cost models and formulate workflow mapping as an optimization problem for maximum frame rate. The difficulty of this problem essentially arises from the topological matching nature in the spatial domain, which is further compounded by the resource sharing complicacy in the temporal dimension if multiple modules are deployed on the same node. We conduct a rigorous workflow stability analysis and design a workflow mapping scheme based on a topological layer-oriented dynamic programming solution to identify and minimize the global bottleneck. The performance superiority of the proposed mapping scheme is illustrated by extensive simulation-based comparisons with existing algorithms.
doi_str_mv 10.1109/ICCCN.2010.5560146
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_5560146</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5560146</ieee_id><sourcerecordid>5560146</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-a8bf6690f68e5903a3061ee41e7a50afcba3f12376dfe3ab039739f31c1941d23</originalsourceid><addsrcrecordid>eNpVkMtOAjEYhestkSAvoJu-wODf_r3QJRlBSVAXYlySDrRQZS7pdLw9vRjZeDYnJ-fkWxxCLhkMGQNzPcvz_GHIYZ-lVMCEOiIDo0dMcCE0Y1Ifkx5XqDMjEE7-dQJOSW8PkRkHKc_JoG1fYS8huUbeI8_39jOU4TtUG_pSxze_qz_oYhvrbrNtukR9HelTis6Wv4tx0-zCyqZQVy0NFb0JbYqh6JJb00n1HmJdla5K7QU583bXusHB-2QxnSzyu2z-eDvLx_MsGEiZHRVeKQNejZw0gBZBMecEc9pKsH5VWPSMo1Zr79AWgEaj8chWzAi25tgnV3_Y4JxbNjGUNn4tDx_hD0NuV34</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Maximizing Workflow Throughput for Streaming Applications in Distributed Environments</title><source>IEEE Xplore All Conference Series</source><creator>Yi Gu ; Qishi Wu</creator><creatorcontrib>Yi Gu ; Qishi Wu</creatorcontrib><description>Large-scale computation-intensive applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate intermodule dependencies. Mapping such workflows in heterogeneous network environments and maximizing their throughput are crucial to the success of large-scale scientific applications that process streaming datasets. We construct analytical cost models and formulate workflow mapping as an optimization problem for maximum frame rate. The difficulty of this problem essentially arises from the topological matching nature in the spatial domain, which is further compounded by the resource sharing complicacy in the temporal dimension if multiple modules are deployed on the same node. We conduct a rigorous workflow stability analysis and design a workflow mapping scheme based on a topological layer-oriented dynamic programming solution to identify and minimize the global bottleneck. The performance superiority of the proposed mapping scheme is illustrated by extensive simulation-based comparisons with existing algorithms.</description><identifier>ISSN: 1095-2055</identifier><identifier>ISBN: 9781424471140</identifier><identifier>ISBN: 1424471141</identifier><identifier>EISSN: 2637-9430</identifier><identifier>EISBN: 9781424471157</identifier><identifier>EISBN: 1424471168</identifier><identifier>EISBN: 9781424471164</identifier><identifier>EISBN: 142447115X</identifier><identifier>DOI: 10.1109/ICCCN.2010.5560146</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Complexity theory ; Computational modeling ; Computer networks ; Network topology ; Stability analysis ; Steady-state</subject><ispartof>2010 Proceedings of 19th International Conference on Computer Communications and Networks, 2010, p.1-6</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5560146$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54555,54920,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5560146$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Yi Gu</creatorcontrib><creatorcontrib>Qishi Wu</creatorcontrib><title>Maximizing Workflow Throughput for Streaming Applications in Distributed Environments</title><title>2010 Proceedings of 19th International Conference on Computer Communications and Networks</title><addtitle>ICCCN</addtitle><description>Large-scale computation-intensive applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate intermodule dependencies. Mapping such workflows in heterogeneous network environments and maximizing their throughput are crucial to the success of large-scale scientific applications that process streaming datasets. We construct analytical cost models and formulate workflow mapping as an optimization problem for maximum frame rate. The difficulty of this problem essentially arises from the topological matching nature in the spatial domain, which is further compounded by the resource sharing complicacy in the temporal dimension if multiple modules are deployed on the same node. We conduct a rigorous workflow stability analysis and design a workflow mapping scheme based on a topological layer-oriented dynamic programming solution to identify and minimize the global bottleneck. The performance superiority of the proposed mapping scheme is illustrated by extensive simulation-based comparisons with existing algorithms.</description><subject>Algorithm design and analysis</subject><subject>Complexity theory</subject><subject>Computational modeling</subject><subject>Computer networks</subject><subject>Network topology</subject><subject>Stability analysis</subject><subject>Steady-state</subject><issn>1095-2055</issn><issn>2637-9430</issn><isbn>9781424471140</isbn><isbn>1424471141</isbn><isbn>9781424471157</isbn><isbn>1424471168</isbn><isbn>9781424471164</isbn><isbn>142447115X</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2010</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpVkMtOAjEYhestkSAvoJu-wODf_r3QJRlBSVAXYlySDrRQZS7pdLw9vRjZeDYnJ-fkWxxCLhkMGQNzPcvz_GHIYZ-lVMCEOiIDo0dMcCE0Y1Ifkx5XqDMjEE7-dQJOSW8PkRkHKc_JoG1fYS8huUbeI8_39jOU4TtUG_pSxze_qz_oYhvrbrNtukR9HelTis6Wv4tx0-zCyqZQVy0NFb0JbYqh6JJb00n1HmJdla5K7QU583bXusHB-2QxnSzyu2z-eDvLx_MsGEiZHRVeKQNejZw0gBZBMecEc9pKsH5VWPSMo1Zr79AWgEaj8chWzAi25tgnV3_Y4JxbNjGUNn4tDx_hD0NuV34</recordid><startdate>201008</startdate><enddate>201008</enddate><creator>Yi Gu</creator><creator>Qishi Wu</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201008</creationdate><title>Maximizing Workflow Throughput for Streaming Applications in Distributed Environments</title><author>Yi Gu ; Qishi Wu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-a8bf6690f68e5903a3061ee41e7a50afcba3f12376dfe3ab039739f31c1941d23</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithm design and analysis</topic><topic>Complexity theory</topic><topic>Computational modeling</topic><topic>Computer networks</topic><topic>Network topology</topic><topic>Stability analysis</topic><topic>Steady-state</topic><toplevel>online_resources</toplevel><creatorcontrib>Yi Gu</creatorcontrib><creatorcontrib>Qishi Wu</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore Digital Library</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yi Gu</au><au>Qishi Wu</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Maximizing Workflow Throughput for Streaming Applications in Distributed Environments</atitle><btitle>2010 Proceedings of 19th International Conference on Computer Communications and Networks</btitle><stitle>ICCCN</stitle><date>2010-08</date><risdate>2010</risdate><spage>1</spage><epage>6</epage><pages>1-6</pages><issn>1095-2055</issn><eissn>2637-9430</eissn><isbn>9781424471140</isbn><isbn>1424471141</isbn><eisbn>9781424471157</eisbn><eisbn>1424471168</eisbn><eisbn>9781424471164</eisbn><eisbn>142447115X</eisbn><abstract>Large-scale computation-intensive applications in various science fields feature complex DAG-structured workflows comprised of distributed computing modules with intricate intermodule dependencies. Mapping such workflows in heterogeneous network environments and maximizing their throughput are crucial to the success of large-scale scientific applications that process streaming datasets. We construct analytical cost models and formulate workflow mapping as an optimization problem for maximum frame rate. The difficulty of this problem essentially arises from the topological matching nature in the spatial domain, which is further compounded by the resource sharing complicacy in the temporal dimension if multiple modules are deployed on the same node. We conduct a rigorous workflow stability analysis and design a workflow mapping scheme based on a topological layer-oriented dynamic programming solution to identify and minimize the global bottleneck. The performance superiority of the proposed mapping scheme is illustrated by extensive simulation-based comparisons with existing algorithms.</abstract><pub>IEEE</pub><doi>10.1109/ICCCN.2010.5560146</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1095-2055
ispartof 2010 Proceedings of 19th International Conference on Computer Communications and Networks, 2010, p.1-6
issn 1095-2055
2637-9430
language eng
recordid cdi_ieee_primary_5560146
source IEEE Xplore All Conference Series
subjects Algorithm design and analysis
Complexity theory
Computational modeling
Computer networks
Network topology
Stability analysis
Steady-state
title Maximizing Workflow Throughput for Streaming Applications in Distributed Environments
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T23%3A51%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Maximizing%20Workflow%20Throughput%20for%20Streaming%20Applications%20in%20Distributed%20Environments&rft.btitle=2010%20Proceedings%20of%2019th%20International%20Conference%20on%20Computer%20Communications%20and%20Networks&rft.au=Yi%20Gu&rft.date=2010-08&rft.spage=1&rft.epage=6&rft.pages=1-6&rft.issn=1095-2055&rft.eissn=2637-9430&rft.isbn=9781424471140&rft.isbn_list=1424471141&rft_id=info:doi/10.1109/ICCCN.2010.5560146&rft.eisbn=9781424471157&rft.eisbn_list=1424471168&rft.eisbn_list=9781424471164&rft.eisbn_list=142447115X&rft_dat=%3Cieee_CHZPO%3E5560146%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i90t-a8bf6690f68e5903a3061ee41e7a50afcba3f12376dfe3ab039739f31c1941d23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5560146&rfr_iscdi=true