Loading…

A Restless Bandit Model for Resource Allocation, Competition, and Reservation

In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restles...

Full description

Saved in:
Bibliographic Details
Published in:Operations research 2022-01, Vol.70 (1), p.416-431
Main Authors: Fu, Jing, Moran, Bill, Taylor, Peter G.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873
cites cdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873
container_end_page 431
container_issue 1
container_start_page 416
container_title Operations research
container_volume 70
creator Fu, Jing
Moran, Bill
Taylor, Peter G.
description In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.
doi_str_mv 10.1287/opre.2020.2066
format article
fullrecord <record><control><sourceid>proquest_infor</sourceid><recordid>TN_cdi_informs_primary_10_1287_opre_2020_2066</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2632992009</sourcerecordid><originalsourceid>FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</originalsourceid><addsrcrecordid>eNqFUE1LAzEQDaJgrV49L3h16-Rzk2MtVoUWQRS8hXSTwJbtZk22gv_erOvdywwz897MvIfQNYYFJrK6C310CwIEchDiBM0wJ6LkTNBTNAOgUFLBPs7RRUp7AFBc8BnaLotXl4bWpVTcm842Q7EN1rWFD3GchGOsXbFs21CboQndbbEKh94NzVRkxohy8et3eonOvGmTu_rLc_S-fnhbPZWbl8fn1XJT1lTxoWSsqrlSxknGnPHMSua5xJjuABsrFTPUGsMUcGyrnWSQm7gi0lvFdl5WdI5upr19DJ_H_L_e50e7fFITQYlSJOvLqMWEqmNIKTqv-9gcTPzWGPRomR4t06NlerQsE8qJ0HRZ_iH9h_8BbTttkA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2632992009</pqid></control><display><type>article</type><title>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</title><source>Informs</source><creator>Fu, Jing ; Moran, Bill ; Taylor, Peter G.</creator><creatorcontrib>Fu, Jing ; Moran, Bill ; Taylor, Peter G.</creatorcontrib><description>In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.</description><identifier>ISSN: 0030-364X</identifier><identifier>EISSN: 1526-5463</identifier><identifier>DOI: 10.1287/opre.2020.2066</identifier><language>eng</language><publisher>Linthicum: INFORMS</publisher><subject>Asymptotic methods ; Asymptotic properties ; Estimating techniques ; Markov decision process ; Numerical analysis ; Operations research ; Optimization ; Resource allocation ; resource sharing ; restless bandits ; Stochastic Models</subject><ispartof>Operations research, 2022-01, Vol.70 (1), p.416-431</ispartof><rights>Copyright Institute for Operations Research and the Management Sciences Jan/Feb 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</citedby><cites>FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</cites><orcidid>0000-0003-4615-8391</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubsonline.informs.org/doi/full/10.1287/opre.2020.2066$$EHTML$$P50$$Ginforms$$H</linktohtml><link.rule.ids>314,780,784,3692,27924,27925,62616</link.rule.ids></links><search><creatorcontrib>Fu, Jing</creatorcontrib><creatorcontrib>Moran, Bill</creatorcontrib><creatorcontrib>Taylor, Peter G.</creatorcontrib><title>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</title><title>Operations research</title><description>In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.</description><subject>Asymptotic methods</subject><subject>Asymptotic properties</subject><subject>Estimating techniques</subject><subject>Markov decision process</subject><subject>Numerical analysis</subject><subject>Operations research</subject><subject>Optimization</subject><subject>Resource allocation</subject><subject>resource sharing</subject><subject>restless bandits</subject><subject>Stochastic Models</subject><issn>0030-364X</issn><issn>1526-5463</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNqFUE1LAzEQDaJgrV49L3h16-Rzk2MtVoUWQRS8hXSTwJbtZk22gv_erOvdywwz897MvIfQNYYFJrK6C310CwIEchDiBM0wJ6LkTNBTNAOgUFLBPs7RRUp7AFBc8BnaLotXl4bWpVTcm842Q7EN1rWFD3GchGOsXbFs21CboQndbbEKh94NzVRkxohy8et3eonOvGmTu_rLc_S-fnhbPZWbl8fn1XJT1lTxoWSsqrlSxknGnPHMSua5xJjuABsrFTPUGsMUcGyrnWSQm7gi0lvFdl5WdI5upr19DJ_H_L_e50e7fFITQYlSJOvLqMWEqmNIKTqv-9gcTPzWGPRomR4t06NlerQsE8qJ0HRZ_iH9h_8BbTttkA</recordid><startdate>202201</startdate><enddate>202201</enddate><creator>Fu, Jing</creator><creator>Moran, Bill</creator><creator>Taylor, Peter G.</creator><general>INFORMS</general><general>Institute for Operations Research and the Management Sciences</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>K9.</scope><orcidid>https://orcid.org/0000-0003-4615-8391</orcidid></search><sort><creationdate>202201</creationdate><title>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</title><author>Fu, Jing ; Moran, Bill ; Taylor, Peter G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Asymptotic methods</topic><topic>Asymptotic properties</topic><topic>Estimating techniques</topic><topic>Markov decision process</topic><topic>Numerical analysis</topic><topic>Operations research</topic><topic>Optimization</topic><topic>Resource allocation</topic><topic>resource sharing</topic><topic>restless bandits</topic><topic>Stochastic Models</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fu, Jing</creatorcontrib><creatorcontrib>Moran, Bill</creatorcontrib><creatorcontrib>Taylor, Peter G.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><jtitle>Operations research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fu, Jing</au><au>Moran, Bill</au><au>Taylor, Peter G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</atitle><jtitle>Operations research</jtitle><date>2022-01</date><risdate>2022</risdate><volume>70</volume><issue>1</issue><spage>416</spage><epage>431</epage><pages>416-431</pages><issn>0030-364X</issn><eissn>1526-5463</eissn><abstract>In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.</abstract><cop>Linthicum</cop><pub>INFORMS</pub><doi>10.1287/opre.2020.2066</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0003-4615-8391</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0030-364X
ispartof Operations research, 2022-01, Vol.70 (1), p.416-431
issn 0030-364X
1526-5463
language eng
recordid cdi_informs_primary_10_1287_opre_2020_2066
source Informs
subjects Asymptotic methods
Asymptotic properties
Estimating techniques
Markov decision process
Numerical analysis
Operations research
Optimization
Resource allocation
resource sharing
restless bandits
Stochastic Models
title A Restless Bandit Model for Resource Allocation, Competition, and Reservation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T02%3A36%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_infor&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Restless%20Bandit%20Model%20for%20Resource%20Allocation,%20Competition,%20and%20Reservation&rft.jtitle=Operations%20research&rft.au=Fu,%20Jing&rft.date=2022-01&rft.volume=70&rft.issue=1&rft.spage=416&rft.epage=431&rft.pages=416-431&rft.issn=0030-364X&rft.eissn=1526-5463&rft_id=info:doi/10.1287/opre.2020.2066&rft_dat=%3Cproquest_infor%3E2632992009%3C/proquest_infor%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2632992009&rft_id=info:pmid/&rfr_iscdi=true