Loading…

Simulation of the response time distribution of fault-tolerant multi-tier cloud services

We are considering the problem of obtaining the response time distribution of fault-tolerant multi-tier services. In the provision of software-as-a-service applications, the service provider is obliged to ensure a certain quality of service. Herein, we regard upper bounds on the response time. The s...

Full description

Saved in:
Bibliographic Details
Published in:Journal of simulation : JOS 2017-11, Vol.11 (4), p.369-379
Main Authors: Gullhav, Anders N, Nygreen, Bjørn, Heegaard, Poul E
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c412t-134184a2de110dc6d0db02b8a97a83b4907eca927f7b6296080cc91d2e45cdc23
cites cdi_FETCH-LOGICAL-c412t-134184a2de110dc6d0db02b8a97a83b4907eca927f7b6296080cc91d2e45cdc23
container_end_page 379
container_issue 4
container_start_page 369
container_title Journal of simulation : JOS
container_volume 11
creator Gullhav, Anders N
Nygreen, Bjørn
Heegaard, Poul E
description We are considering the problem of obtaining the response time distribution of fault-tolerant multi-tier services. In the provision of software-as-a-service applications, the service provider is obliged to ensure a certain quality of service. Herein, we regard upper bounds on the response time. The services consist of multiple components with different functionality, which are prone to failures, and fail according to a certain failure time distribution. However, due to redundancy, a failure will not necessarily bring the service down, but rather increase the response time. A fundamental difficulty with estimating the response time distribution while considering failures is related to the disparity in the time scales of the time between failures and service times. To overcome this issue, we propose an approach based on a decomposition, which combines an analytic model of the failure process and a discrete event simulation model to sample the response time distribution. In an experimental study, we compare this simulation-based approach with an analytic approach, and illustrate how this approach can be utilised by service providers as decision support. We also show that in certain cases, the analytic approach might provide a safe bound on the response time.
doi_str_mv 10.1057/s41273-016-0042-9
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1057_s41273_016_0042_9</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1946338086</sourcerecordid><originalsourceid>FETCH-LOGICAL-c412t-134184a2de110dc6d0db02b8a97a83b4907eca927f7b6296080cc91d2e45cdc23</originalsourceid><addsrcrecordid>eNp9kE9LBCEYhyUK2rY-QDehs6WOqw50iaV_sNChgm7iqFMuM-OmTrHfPmMqutTpfQ-_5_e-PAAcE3xK8EKcJUaoqBAmHGHMKKp3wIwIJpAQku_-7ELug4OU1hhzTKicgad734-dzj4MMLQwvzgYXdqEITmYfe-g9SlH34zfiVaPXUY5dC7qIcMCZ4-ydxGaLowWJhffvHHpEOy1ukvu6GvOwePV5cPyBq3urm-XFytkyscZkYoRyTS1jhBsDbfYNpg2UtdCy6phNRbO6JqKVjSc1hxLbExNLHVsYayh1RycTL2bGF5Hl7JahzEO5aQiNeNVJbHkJUWmlIkhpehatYm-13GrCFafAtUkUBWB6lOgqgtDJyaV7PDs4q_mf6DzCfJDG2Kv30PsrMp624XYFmPGJ1X9jX8AWyqH3Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1946338086</pqid></control><display><type>article</type><title>Simulation of the response time distribution of fault-tolerant multi-tier cloud services</title><source>ABI/INFORM Collection</source><source>Taylor and Francis Science and Technology Collection</source><creator>Gullhav, Anders N ; Nygreen, Bjørn ; Heegaard, Poul E</creator><creatorcontrib>Gullhav, Anders N ; Nygreen, Bjørn ; Heegaard, Poul E</creatorcontrib><description>We are considering the problem of obtaining the response time distribution of fault-tolerant multi-tier services. In the provision of software-as-a-service applications, the service provider is obliged to ensure a certain quality of service. Herein, we regard upper bounds on the response time. The services consist of multiple components with different functionality, which are prone to failures, and fail according to a certain failure time distribution. However, due to redundancy, a failure will not necessarily bring the service down, but rather increase the response time. A fundamental difficulty with estimating the response time distribution while considering failures is related to the disparity in the time scales of the time between failures and service times. To overcome this issue, we propose an approach based on a decomposition, which combines an analytic model of the failure process and a discrete event simulation model to sample the response time distribution. In an experimental study, we compare this simulation-based approach with an analytic approach, and illustrate how this approach can be utilised by service providers as decision support. We also show that in certain cases, the analytic approach might provide a safe bound on the response time.</description><identifier>ISSN: 1747-7778</identifier><identifier>EISSN: 1747-7786</identifier><identifier>DOI: 10.1057/s41273-016-0042-9</identifier><language>eng</language><publisher>London: Taylor &amp; Francis</publisher><subject>Applications programs ; Business and Management ; cloud computing ; Computer simulation ; Decision analysis ; Discrete event systems ; Failure ; Fault tolerance ; Information Systems and Communication Service ; Markov analysis ; Markov processes ; Mathematical analysis ; Operations Research/Decision Theory ; queueing ; Queuing theory ; Redundancy ; replication ; Response time ; Simulation ; Simulation and Modeling ; Software-as-a-service ; Upper bounds</subject><ispartof>Journal of simulation : JOS, 2017-11, Vol.11 (4), p.369-379</ispartof><rights>Copyright © 2016, The Operational Research Society 2016</rights><rights>The Operational Research Society 2016</rights><rights>Journal of Simulation is a copyright of Springer, 2017.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c412t-134184a2de110dc6d0db02b8a97a83b4907eca927f7b6296080cc91d2e45cdc23</citedby><cites>FETCH-LOGICAL-c412t-134184a2de110dc6d0db02b8a97a83b4907eca927f7b6296080cc91d2e45cdc23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1946338086/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1946338086?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11688,27924,27925,36060,44363,74767</link.rule.ids></links><search><creatorcontrib>Gullhav, Anders N</creatorcontrib><creatorcontrib>Nygreen, Bjørn</creatorcontrib><creatorcontrib>Heegaard, Poul E</creatorcontrib><title>Simulation of the response time distribution of fault-tolerant multi-tier cloud services</title><title>Journal of simulation : JOS</title><addtitle>J Simulation</addtitle><description>We are considering the problem of obtaining the response time distribution of fault-tolerant multi-tier services. In the provision of software-as-a-service applications, the service provider is obliged to ensure a certain quality of service. Herein, we regard upper bounds on the response time. The services consist of multiple components with different functionality, which are prone to failures, and fail according to a certain failure time distribution. However, due to redundancy, a failure will not necessarily bring the service down, but rather increase the response time. A fundamental difficulty with estimating the response time distribution while considering failures is related to the disparity in the time scales of the time between failures and service times. To overcome this issue, we propose an approach based on a decomposition, which combines an analytic model of the failure process and a discrete event simulation model to sample the response time distribution. In an experimental study, we compare this simulation-based approach with an analytic approach, and illustrate how this approach can be utilised by service providers as decision support. We also show that in certain cases, the analytic approach might provide a safe bound on the response time.</description><subject>Applications programs</subject><subject>Business and Management</subject><subject>cloud computing</subject><subject>Computer simulation</subject><subject>Decision analysis</subject><subject>Discrete event systems</subject><subject>Failure</subject><subject>Fault tolerance</subject><subject>Information Systems and Communication Service</subject><subject>Markov analysis</subject><subject>Markov processes</subject><subject>Mathematical analysis</subject><subject>Operations Research/Decision Theory</subject><subject>queueing</subject><subject>Queuing theory</subject><subject>Redundancy</subject><subject>replication</subject><subject>Response time</subject><subject>Simulation</subject><subject>Simulation and Modeling</subject><subject>Software-as-a-service</subject><subject>Upper bounds</subject><issn>1747-7778</issn><issn>1747-7786</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp9kE9LBCEYhyUK2rY-QDehs6WOqw50iaV_sNChgm7iqFMuM-OmTrHfPmMqutTpfQ-_5_e-PAAcE3xK8EKcJUaoqBAmHGHMKKp3wIwIJpAQku_-7ELug4OU1hhzTKicgad734-dzj4MMLQwvzgYXdqEITmYfe-g9SlH34zfiVaPXUY5dC7qIcMCZ4-ydxGaLowWJhffvHHpEOy1ukvu6GvOwePV5cPyBq3urm-XFytkyscZkYoRyTS1jhBsDbfYNpg2UtdCy6phNRbO6JqKVjSc1hxLbExNLHVsYayh1RycTL2bGF5Hl7JahzEO5aQiNeNVJbHkJUWmlIkhpehatYm-13GrCFafAtUkUBWB6lOgqgtDJyaV7PDs4q_mf6DzCfJDG2Kv30PsrMp624XYFmPGJ1X9jX8AWyqH3Q</recordid><startdate>20171101</startdate><enddate>20171101</enddate><creator>Gullhav, Anders N</creator><creator>Nygreen, Bjørn</creator><creator>Heegaard, Poul E</creator><general>Taylor &amp; Francis</general><general>Palgrave Macmillan UK</general><general>Taylor &amp; Francis Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L6V</scope><scope>M0C</scope><scope>M0N</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYYUZ</scope><scope>Q9U</scope></search><sort><creationdate>20171101</creationdate><title>Simulation of the response time distribution of fault-tolerant multi-tier cloud services</title><author>Gullhav, Anders N ; Nygreen, Bjørn ; Heegaard, Poul E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c412t-134184a2de110dc6d0db02b8a97a83b4907eca927f7b6296080cc91d2e45cdc23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Applications programs</topic><topic>Business and Management</topic><topic>cloud computing</topic><topic>Computer simulation</topic><topic>Decision analysis</topic><topic>Discrete event systems</topic><topic>Failure</topic><topic>Fault tolerance</topic><topic>Information Systems and Communication Service</topic><topic>Markov analysis</topic><topic>Markov processes</topic><topic>Mathematical analysis</topic><topic>Operations Research/Decision Theory</topic><topic>queueing</topic><topic>Queuing theory</topic><topic>Redundancy</topic><topic>replication</topic><topic>Response time</topic><topic>Simulation</topic><topic>Simulation and Modeling</topic><topic>Software-as-a-service</topic><topic>Upper bounds</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gullhav, Anders N</creatorcontrib><creatorcontrib>Nygreen, Bjørn</creatorcontrib><creatorcontrib>Heegaard, Poul E</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database</collection><collection>Engineering Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>ABI/INFORM Collection China</collection><collection>ProQuest Central Basic</collection><jtitle>Journal of simulation : JOS</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gullhav, Anders N</au><au>Nygreen, Bjørn</au><au>Heegaard, Poul E</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Simulation of the response time distribution of fault-tolerant multi-tier cloud services</atitle><jtitle>Journal of simulation : JOS</jtitle><stitle>J Simulation</stitle><date>2017-11-01</date><risdate>2017</risdate><volume>11</volume><issue>4</issue><spage>369</spage><epage>379</epage><pages>369-379</pages><issn>1747-7778</issn><eissn>1747-7786</eissn><abstract>We are considering the problem of obtaining the response time distribution of fault-tolerant multi-tier services. In the provision of software-as-a-service applications, the service provider is obliged to ensure a certain quality of service. Herein, we regard upper bounds on the response time. The services consist of multiple components with different functionality, which are prone to failures, and fail according to a certain failure time distribution. However, due to redundancy, a failure will not necessarily bring the service down, but rather increase the response time. A fundamental difficulty with estimating the response time distribution while considering failures is related to the disparity in the time scales of the time between failures and service times. To overcome this issue, we propose an approach based on a decomposition, which combines an analytic model of the failure process and a discrete event simulation model to sample the response time distribution. In an experimental study, we compare this simulation-based approach with an analytic approach, and illustrate how this approach can be utilised by service providers as decision support. We also show that in certain cases, the analytic approach might provide a safe bound on the response time.</abstract><cop>London</cop><pub>Taylor &amp; Francis</pub><doi>10.1057/s41273-016-0042-9</doi><tpages>11</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1747-7778
ispartof Journal of simulation : JOS, 2017-11, Vol.11 (4), p.369-379
issn 1747-7778
1747-7786
language eng
recordid cdi_crossref_primary_10_1057_s41273_016_0042_9
source ABI/INFORM Collection; Taylor and Francis Science and Technology Collection
subjects Applications programs
Business and Management
cloud computing
Computer simulation
Decision analysis
Discrete event systems
Failure
Fault tolerance
Information Systems and Communication Service
Markov analysis
Markov processes
Mathematical analysis
Operations Research/Decision Theory
queueing
Queuing theory
Redundancy
replication
Response time
Simulation
Simulation and Modeling
Software-as-a-service
Upper bounds
title Simulation of the response time distribution of fault-tolerant multi-tier cloud services
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T23%3A12%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Simulation%20of%20the%20response%20time%20distribution%20of%20fault-tolerant%20multi-tier%20cloud%20services&rft.jtitle=Journal%20of%20simulation%20:%20JOS&rft.au=Gullhav,%20Anders%20N&rft.date=2017-11-01&rft.volume=11&rft.issue=4&rft.spage=369&rft.epage=379&rft.pages=369-379&rft.issn=1747-7778&rft.eissn=1747-7786&rft_id=info:doi/10.1057/s41273-016-0042-9&rft_dat=%3Cproquest_cross%3E1946338086%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c412t-134184a2de110dc6d0db02b8a97a83b4907eca927f7b6296080cc91d2e45cdc23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1946338086&rft_id=info:pmid/&rfr_iscdi=true