Loading…

A Restless Bandit Model for Resource Allocation, Competition, and Reservation

In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restles...

Full description

Saved in:

Bibliographic Details
Published in:	Operations research 2022-01, Vol.70 (1), p.416-431
Main Authors:	Fu, Jing, Moran, Bill, Taylor, Peter G.
Format:	Article
Language:	English
Subjects:	Asymptotic methods Asymptotic properties Estimating techniques Markov decision process Numerical analysis Operations research Optimization Resource allocation resource sharing restless bandits Stochastic Models
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873
cites	cdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873
container_end_page	431
container_issue	1
container_start_page	416
container_title	Operations research
container_volume	70
creator	Fu, Jing Moran, Bill Taylor, Peter G.
description	In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.
doi_str_mv	10.1287/opre.2020.2066
format	article
fullrecord	<record><control><sourceid>proquest_infor</sourceid><recordid>TN_cdi_informs_primary_10_1287_opre_2020_2066</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2632992009</sourcerecordid><originalsourceid>FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</originalsourceid><addsrcrecordid>eNqFUE1LAzEQDaJgrV49L3h16-Rzk2MtVoUWQRS8hXSTwJbtZk22gv_erOvdywwz897MvIfQNYYFJrK6C310CwIEchDiBM0wJ6LkTNBTNAOgUFLBPs7RRUp7AFBc8BnaLotXl4bWpVTcm842Q7EN1rWFD3GchGOsXbFs21CboQndbbEKh94NzVRkxohy8et3eonOvGmTu_rLc_S-fnhbPZWbl8fn1XJT1lTxoWSsqrlSxknGnPHMSua5xJjuABsrFTPUGsMUcGyrnWSQm7gi0lvFdl5WdI5upr19DJ_H_L_e50e7fFITQYlSJOvLqMWEqmNIKTqv-9gcTPzWGPRomR4t06NlerQsE8qJ0HRZ_iH9h_8BbTttkA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2632992009</pqid></control><display><type>article</type><title>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</title><source>Informs</source><creator>Fu, Jing ; Moran, Bill ; Taylor, Peter G.</creator><creatorcontrib>Fu, Jing ; Moran, Bill ; Taylor, Peter G.</creatorcontrib><description>In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.</description><identifier>ISSN: 0030-364X</identifier><identifier>EISSN: 1526-5463</identifier><identifier>DOI: 10.1287/opre.2020.2066</identifier><language>eng</language><publisher>Linthicum: INFORMS</publisher><subject>Asymptotic methods ; Asymptotic properties ; Estimating techniques ; Markov decision process ; Numerical analysis ; Operations research ; Optimization ; Resource allocation ; resource sharing ; restless bandits ; Stochastic Models</subject><ispartof>Operations research, 2022-01, Vol.70 (1), p.416-431</ispartof><rights>Copyright Institute for Operations Research and the Management Sciences Jan/Feb 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</citedby><cites>FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</cites><orcidid>0000-0003-4615-8391</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubsonline.informs.org/doi/full/10.1287/opre.2020.2066$$EHTML$$P50$$Ginforms$$H</linktohtml><link.rule.ids>314,780,784,3692,27924,27925,62616</link.rule.ids></links><search><creatorcontrib>Fu, Jing</creatorcontrib><creatorcontrib>Moran, Bill</creatorcontrib><creatorcontrib>Taylor, Peter G.</creatorcontrib><title>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</title><title>Operations research</title><description>In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.</description><subject>Asymptotic methods</subject><subject>Asymptotic properties</subject><subject>Estimating techniques</subject><subject>Markov decision process</subject><subject>Numerical analysis</subject><subject>Operations research</subject><subject>Optimization</subject><subject>Resource allocation</subject><subject>resource sharing</subject><subject>restless bandits</subject><subject>Stochastic Models</subject><issn>0030-364X</issn><issn>1526-5463</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNqFUE1LAzEQDaJgrV49L3h16-Rzk2MtVoUWQRS8hXSTwJbtZk22gv_erOvdywwz897MvIfQNYYFJrK6C310CwIEchDiBM0wJ6LkTNBTNAOgUFLBPs7RRUp7AFBc8BnaLotXl4bWpVTcm842Q7EN1rWFD3GchGOsXbFs21CboQndbbEKh94NzVRkxohy8et3eonOvGmTu_rLc_S-fnhbPZWbl8fn1XJT1lTxoWSsqrlSxknGnPHMSua5xJjuABsrFTPUGsMUcGyrnWSQm7gi0lvFdl5WdI5upr19DJ_H_L_e50e7fFITQYlSJOvLqMWEqmNIKTqv-9gcTPzWGPRomR4t06NlerQsE8qJ0HRZ_iH9h_8BbTttkA</recordid><startdate>202201</startdate><enddate>202201</enddate><creator>Fu, Jing</creator><creator>Moran, Bill</creator><creator>Taylor, Peter G.</creator><general>INFORMS</general><general>Institute for Operations Research and the Management Sciences</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>K9.</scope><orcidid>https://orcid.org/0000-0003-4615-8391</orcidid></search><sort><creationdate>202201</creationdate><title>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</title><author>Fu, Jing ; Moran, Bill ; Taylor, Peter G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Asymptotic methods</topic><topic>Asymptotic properties</topic><topic>Estimating techniques</topic><topic>Markov decision process</topic><topic>Numerical analysis</topic><topic>Operations research</topic><topic>Optimization</topic><topic>Resource allocation</topic><topic>resource sharing</topic><topic>restless bandits</topic><topic>Stochastic Models</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fu, Jing</creatorcontrib><creatorcontrib>Moran, Bill</creatorcontrib><creatorcontrib>Taylor, Peter G.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><jtitle>Operations research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fu, Jing</au><au>Moran, Bill</au><au>Taylor, Peter G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Restless Bandit Model for Resource Allocation, Competition, and Reservation</atitle><jtitle>Operations research</jtitle><date>2022-01</date><risdate>2022</risdate><volume>70</volume><issue>1</issue><spage>416</spage><epage>431</epage><pages>416-431</pages><issn>0030-364X</issn><eissn>1526-5463</eissn><abstract>In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, and P. G. Taylor study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. This problem is modeled as a set of heterogeneous restless multi-armed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s idea of relaxing the constraints and Weber and Weiss’s proof of asymptotic optimality, the authors propose an index policy and establish conditions for it to be asymptotically optimal in a regime where both arrival rates and capacities increase. In particular, they provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. Via numerical experiments, they demonstrate the effectiveness of these results even in the pre-limit case. We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.</abstract><cop>Linthicum</cop><pub>INFORMS</pub><doi>10.1287/opre.2020.2066</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0003-4615-8391</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0030-364X
ispartof	Operations research, 2022-01, Vol.70 (1), p.416-431
issn	0030-364X 1526-5463
language	eng
recordid	cdi_informs_primary_10_1287_opre_2020_2066
source	Informs
subjects	Asymptotic methods Asymptotic properties Estimating techniques Markov decision process Numerical analysis Operations research Optimization Resource allocation resource sharing restless bandits Stochastic Models
title	A Restless Bandit Model for Resource Allocation, Competition, and Reservation
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T02%3A36%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_infor&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Restless%20Bandit%20Model%20for%20Resource%20Allocation,%20Competition,%20and%20Reservation&rft.jtitle=Operations%20research&rft.au=Fu,%20Jing&rft.date=2022-01&rft.volume=70&rft.issue=1&rft.spage=416&rft.epage=431&rft.pages=416-431&rft.issn=0030-364X&rft.eissn=1526-5463&rft_id=info:doi/10.1287/opre.2020.2066&rft_dat=%3Cproquest_infor%3E2632992009%3C/proquest_infor%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c395t-447c599ae844eaf4d84f58113b01ad894a3daa49051d7b840ad81728fd94bf873%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2632992009&rft_id=info:pmid/&rfr_iscdi=true