Loading…

TCP BBR in Cloud Networks: Challenges, Analysis, and Solutions

Google introduced BBR representing a new model-based TCP class in 2016, which improves throughput and latency of Google's backbone and services and is now the second most popular TCP on the Internet. As BBR is designed as a general-purpose congestion control to replace current widely deployed c...

Full description

Saved in:
Bibliographic Details
Main Authors: Ha, Phuong, Vu, Minh, Le, Tuan-Anh, Xu, Lisong
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 953
container_issue
container_start_page 943
container_title
container_volume
creator Ha, Phuong
Vu, Minh
Le, Tuan-Anh
Xu, Lisong
description Google introduced BBR representing a new model-based TCP class in 2016, which improves throughput and latency of Google's backbone and services and is now the second most popular TCP on the Internet. As BBR is designed as a general-purpose congestion control to replace current widely deployed congestion control such as Reno and CUBIC, this raises the importance of studying its performance in different types of networks. In this paper, we study BBR's performance in cloud networks, which have grown rapidly but have not been studied in the existing BBR works. For the first time, we show both analytically and experimentally that due to the virtual machine (VM) scheduling in cloud networks, BBR underestimates the pacing rate, delivery rate, and estimated bandwidth, which are three key elements of its control loop. This underestimation can exacerbate iteratively and exponentially over time, and can cause BBR's throughput to reduce to almost zero. We propose a BBR patch that captures the VM scheduling impact on BBR's model and improves its throughput in cloud networks. Our evaluation of the modified BBR on the testbed and EC2 shows a significant improvement in the throughput and bandwidth estimation accuracy over the original BBR in cloud networks with heavy VM scheduling.
doi_str_mv 10.1109/ICDCS51616.2021.00094
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_9546441</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9546441</ieee_id><sourcerecordid>9546441</sourcerecordid><originalsourceid>FETCH-LOGICAL-i269t-b089b1b9e4222c395108f1e595f0b5ea97d39a9a85030cd8cf97adeaaead401b3</originalsourceid><addsrcrecordid>eNotjN1KwzAYQKMguE2fQIQ8gK3fl78mXghb_BsMFTevR7qkGo2tNB2yt3egV-dcHA4h5wglIpjLub2xS4kKVcmAYQkARhyQMSolhZDI4ZCMmKxkoQXiMRnn_LFvpFZ8RK5X9pnOZi80ttSmbuvpYxh-uv4zX1H77lIK7VvIF3TaurTLcW-u9XTZpe0QuzafkKPGpRxO_zkhr3e3K_tQLJ7u53a6KCJTZihq0KbG2gTBGNtwIxF0g0Ea2UAtgzOV58YZpyVw2Hi9aUzlfHAuOC8Aaz4hZ3_fGEJYf_fxy_W7tZFCCYH8F0ADR9s</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>TCP BBR in Cloud Networks: Challenges, Analysis, and Solutions</title><source>IEEE Xplore All Conference Series</source><creator>Ha, Phuong ; Vu, Minh ; Le, Tuan-Anh ; Xu, Lisong</creator><creatorcontrib>Ha, Phuong ; Vu, Minh ; Le, Tuan-Anh ; Xu, Lisong</creatorcontrib><description>Google introduced BBR representing a new model-based TCP class in 2016, which improves throughput and latency of Google's backbone and services and is now the second most popular TCP on the Internet. As BBR is designed as a general-purpose congestion control to replace current widely deployed congestion control such as Reno and CUBIC, this raises the importance of studying its performance in different types of networks. In this paper, we study BBR's performance in cloud networks, which have grown rapidly but have not been studied in the existing BBR works. For the first time, we show both analytically and experimentally that due to the virtual machine (VM) scheduling in cloud networks, BBR underestimates the pacing rate, delivery rate, and estimated bandwidth, which are three key elements of its control loop. This underestimation can exacerbate iteratively and exponentially over time, and can cause BBR's throughput to reduce to almost zero. We propose a BBR patch that captures the VM scheduling impact on BBR's model and improves its throughput in cloud networks. Our evaluation of the modified BBR on the testbed and EC2 shows a significant improvement in the throughput and bandwidth estimation accuracy over the original BBR in cloud networks with heavy VM scheduling.</description><identifier>EISSN: 2575-8411</identifier><identifier>EISBN: 1665445130</identifier><identifier>EISBN: 9781665445139</identifier><identifier>DOI: 10.1109/ICDCS51616.2021.00094</identifier><identifier>CODEN: IEEPAD</identifier><language>eng</language><publisher>IEEE</publisher><subject>Analytical models ; Bandwidth ; BBR ; Cloud computing ; Clouds ; Computational modeling ; Conferences ; Estimation ; Processor scheduling ; TCP ; VM Scheduling</subject><ispartof>2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS), 2021, p.943-953</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9546441$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,23910,23911,25119,27904,54534,54911</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9546441$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ha, Phuong</creatorcontrib><creatorcontrib>Vu, Minh</creatorcontrib><creatorcontrib>Le, Tuan-Anh</creatorcontrib><creatorcontrib>Xu, Lisong</creatorcontrib><title>TCP BBR in Cloud Networks: Challenges, Analysis, and Solutions</title><title>2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS)</title><addtitle>ICDCS</addtitle><description>Google introduced BBR representing a new model-based TCP class in 2016, which improves throughput and latency of Google's backbone and services and is now the second most popular TCP on the Internet. As BBR is designed as a general-purpose congestion control to replace current widely deployed congestion control such as Reno and CUBIC, this raises the importance of studying its performance in different types of networks. In this paper, we study BBR's performance in cloud networks, which have grown rapidly but have not been studied in the existing BBR works. For the first time, we show both analytically and experimentally that due to the virtual machine (VM) scheduling in cloud networks, BBR underestimates the pacing rate, delivery rate, and estimated bandwidth, which are three key elements of its control loop. This underestimation can exacerbate iteratively and exponentially over time, and can cause BBR's throughput to reduce to almost zero. We propose a BBR patch that captures the VM scheduling impact on BBR's model and improves its throughput in cloud networks. Our evaluation of the modified BBR on the testbed and EC2 shows a significant improvement in the throughput and bandwidth estimation accuracy over the original BBR in cloud networks with heavy VM scheduling.</description><subject>Analytical models</subject><subject>Bandwidth</subject><subject>BBR</subject><subject>Cloud computing</subject><subject>Clouds</subject><subject>Computational modeling</subject><subject>Conferences</subject><subject>Estimation</subject><subject>Processor scheduling</subject><subject>TCP</subject><subject>VM Scheduling</subject><issn>2575-8411</issn><isbn>1665445130</isbn><isbn>9781665445139</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2021</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotjN1KwzAYQKMguE2fQIQ8gK3fl78mXghb_BsMFTevR7qkGo2tNB2yt3egV-dcHA4h5wglIpjLub2xS4kKVcmAYQkARhyQMSolhZDI4ZCMmKxkoQXiMRnn_LFvpFZ8RK5X9pnOZi80ttSmbuvpYxh-uv4zX1H77lIK7VvIF3TaurTLcW-u9XTZpe0QuzafkKPGpRxO_zkhr3e3K_tQLJ7u53a6KCJTZihq0KbG2gTBGNtwIxF0g0Ea2UAtgzOV58YZpyVw2Hi9aUzlfHAuOC8Aaz4hZ3_fGEJYf_fxy_W7tZFCCYH8F0ADR9s</recordid><startdate>20210701</startdate><enddate>20210701</enddate><creator>Ha, Phuong</creator><creator>Vu, Minh</creator><creator>Le, Tuan-Anh</creator><creator>Xu, Lisong</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20210701</creationdate><title>TCP BBR in Cloud Networks: Challenges, Analysis, and Solutions</title><author>Ha, Phuong ; Vu, Minh ; Le, Tuan-Anh ; Xu, Lisong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i269t-b089b1b9e4222c395108f1e595f0b5ea97d39a9a85030cd8cf97adeaaead401b3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Analytical models</topic><topic>Bandwidth</topic><topic>BBR</topic><topic>Cloud computing</topic><topic>Clouds</topic><topic>Computational modeling</topic><topic>Conferences</topic><topic>Estimation</topic><topic>Processor scheduling</topic><topic>TCP</topic><topic>VM Scheduling</topic><toplevel>online_resources</toplevel><creatorcontrib>Ha, Phuong</creatorcontrib><creatorcontrib>Vu, Minh</creatorcontrib><creatorcontrib>Le, Tuan-Anh</creatorcontrib><creatorcontrib>Xu, Lisong</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ha, Phuong</au><au>Vu, Minh</au><au>Le, Tuan-Anh</au><au>Xu, Lisong</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>TCP BBR in Cloud Networks: Challenges, Analysis, and Solutions</atitle><btitle>2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS)</btitle><stitle>ICDCS</stitle><date>2021-07-01</date><risdate>2021</risdate><spage>943</spage><epage>953</epage><pages>943-953</pages><eissn>2575-8411</eissn><eisbn>1665445130</eisbn><eisbn>9781665445139</eisbn><coden>IEEPAD</coden><abstract>Google introduced BBR representing a new model-based TCP class in 2016, which improves throughput and latency of Google's backbone and services and is now the second most popular TCP on the Internet. As BBR is designed as a general-purpose congestion control to replace current widely deployed congestion control such as Reno and CUBIC, this raises the importance of studying its performance in different types of networks. In this paper, we study BBR's performance in cloud networks, which have grown rapidly but have not been studied in the existing BBR works. For the first time, we show both analytically and experimentally that due to the virtual machine (VM) scheduling in cloud networks, BBR underestimates the pacing rate, delivery rate, and estimated bandwidth, which are three key elements of its control loop. This underestimation can exacerbate iteratively and exponentially over time, and can cause BBR's throughput to reduce to almost zero. We propose a BBR patch that captures the VM scheduling impact on BBR's model and improves its throughput in cloud networks. Our evaluation of the modified BBR on the testbed and EC2 shows a significant improvement in the throughput and bandwidth estimation accuracy over the original BBR in cloud networks with heavy VM scheduling.</abstract><pub>IEEE</pub><doi>10.1109/ICDCS51616.2021.00094</doi><tpages>11</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2575-8411
ispartof 2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS), 2021, p.943-953
issn 2575-8411
language eng
recordid cdi_ieee_primary_9546441
source IEEE Xplore All Conference Series
subjects Analytical models
Bandwidth
BBR
Cloud computing
Clouds
Computational modeling
Conferences
Estimation
Processor scheduling
TCP
VM Scheduling
title TCP BBR in Cloud Networks: Challenges, Analysis, and Solutions
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T14%3A16%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=TCP%20BBR%20in%20Cloud%20Networks:%20Challenges,%20Analysis,%20and%20Solutions&rft.btitle=2021%20IEEE%2041st%20International%20Conference%20on%20Distributed%20Computing%20Systems%20(ICDCS)&rft.au=Ha,%20Phuong&rft.date=2021-07-01&rft.spage=943&rft.epage=953&rft.pages=943-953&rft.eissn=2575-8411&rft.coden=IEEPAD&rft_id=info:doi/10.1109/ICDCS51616.2021.00094&rft.eisbn=1665445130&rft.eisbn_list=9781665445139&rft_dat=%3Cieee_CHZPO%3E9546441%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i269t-b089b1b9e4222c395108f1e595f0b5ea97d39a9a85030cd8cf97adeaaead401b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9546441&rfr_iscdi=true