Loading…

Constrained Reinforcement Learning for Dynamic Material Handling

As one of the core parts of flexible manufacturing systems, material handling involves storage and transportation of materials between workstations with automated vehicles. The improvement in material handling can impulse the overall efficiency of the manufacturing system. However, the occurrence of...

Full description

Saved in:
Bibliographic Details
Main Authors: Hu, Chengpeng, Wang, Ziming, Liu, Jialin, Wen, Junyi, Mao, Bifei, Yao, Xin
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 9
container_issue
container_start_page 1
container_title
container_volume
creator Hu, Chengpeng
Wang, Ziming
Liu, Jialin
Wen, Junyi
Mao, Bifei
Yao, Xin
description As one of the core parts of flexible manufacturing systems, material handling involves storage and transportation of materials between workstations with automated vehicles. The improvement in material handling can impulse the overall efficiency of the manufacturing system. However, the occurrence of dynamic events during the optimisation of task arrangements poses a challenge that requires adaptability and effectiveness. In this paper, we aim at the scheduling of automated guided vehicles for dynamic material handling. Motivated by some real-world scenarios, unknown new tasks and unexpected vehicle breakdowns are regarded as dynamic events in our problem. We formulate the problem as a constrained Markov decision process which takes into account tardiness and available vehicles as cumulative and instantaneous constraints, respectively. An adaptive constrained reinforcement learning algorithm that combines Lagrangian relaxation and invalid action masking, named RCPOM, is proposed to address the problem with two hybrid constraints. Moreover, a gym-like dynamic material handling simulator, named DMH-GYM, is developed and equipped with diverse problem instances, which can be used as benchmarks for dynamic material handling. Experimental results on the problem instances demonstrate the outstanding performance of our proposed approach compared with eight state-of-the-art constrained and non-constrained reinforcement learning algorithms, and widely used dispatching rules for material handling.
doi_str_mv 10.1109/IJCNN54540.2023.10191999
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_10191999</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10191999</ieee_id><sourcerecordid>10191999</sourcerecordid><originalsourceid>FETCH-LOGICAL-i204t-3a9e9a67873c412df3f8b73adb0ff5f667a3d8aa4c39363538501b9f35e9f3cc3</originalsourceid><addsrcrecordid>eNo1j0FLxDAUhKMguLv6DzzkD3RN-pI0uSlVd1fqCqLn5TV9kUiblbSX_fcW1MsMzDcMDGNcirWUwt3unuv9XiutxLoUJaylkE46587YUhqjlbWmEudsUUojC6VEdcmW4_gl5q5zsGB39TGNU8aYqONvFFM4Zk8DpYk3hDnF9MnniD-cEg7R8xecKEfs-RZT18_0il0E7Ee6_vMV-3h6fK-3RfO62dX3TRFLoaYC0JFDU9kKvJJlFyDYtgLsWhGCDsZUCJ1FVB4cGNBgtZCtC6BpFu9hxW5-dyMRHb5zHDCfDv934QdGjUru</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Constrained Reinforcement Learning for Dynamic Material Handling</title><source>IEEE Xplore All Conference Series</source><creator>Hu, Chengpeng ; Wang, Ziming ; Liu, Jialin ; Wen, Junyi ; Mao, Bifei ; Yao, Xin</creator><creatorcontrib>Hu, Chengpeng ; Wang, Ziming ; Liu, Jialin ; Wen, Junyi ; Mao, Bifei ; Yao, Xin</creatorcontrib><description>As one of the core parts of flexible manufacturing systems, material handling involves storage and transportation of materials between workstations with automated vehicles. The improvement in material handling can impulse the overall efficiency of the manufacturing system. However, the occurrence of dynamic events during the optimisation of task arrangements poses a challenge that requires adaptability and effectiveness. In this paper, we aim at the scheduling of automated guided vehicles for dynamic material handling. Motivated by some real-world scenarios, unknown new tasks and unexpected vehicle breakdowns are regarded as dynamic events in our problem. We formulate the problem as a constrained Markov decision process which takes into account tardiness and available vehicles as cumulative and instantaneous constraints, respectively. An adaptive constrained reinforcement learning algorithm that combines Lagrangian relaxation and invalid action masking, named RCPOM, is proposed to address the problem with two hybrid constraints. Moreover, a gym-like dynamic material handling simulator, named DMH-GYM, is developed and equipped with diverse problem instances, which can be used as benchmarks for dynamic material handling. Experimental results on the problem instances demonstrate the outstanding performance of our proposed approach compared with eight state-of-the-art constrained and non-constrained reinforcement learning algorithms, and widely used dispatching rules for material handling.</description><identifier>EISSN: 2161-4407</identifier><identifier>EISBN: 1665488670</identifier><identifier>EISBN: 9781665488679</identifier><identifier>DOI: 10.1109/IJCNN54540.2023.10191999</identifier><language>eng</language><publisher>IEEE</publisher><subject>automated guided vehicle ; benchmark ; constrained reinforcement learning ; Dispatching ; Dynamic material handling ; Dynamic scheduling ; Electric breakdown ; Heuristic algorithms ; manufacturing system ; Markov processes ; Materials handling ; Reinforcement learning</subject><ispartof>2023 International Joint Conference on Neural Networks (IJCNN), 2023, p.1-9</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10191999$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,23930,23931,25140,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10191999$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Hu, Chengpeng</creatorcontrib><creatorcontrib>Wang, Ziming</creatorcontrib><creatorcontrib>Liu, Jialin</creatorcontrib><creatorcontrib>Wen, Junyi</creatorcontrib><creatorcontrib>Mao, Bifei</creatorcontrib><creatorcontrib>Yao, Xin</creatorcontrib><title>Constrained Reinforcement Learning for Dynamic Material Handling</title><title>2023 International Joint Conference on Neural Networks (IJCNN)</title><addtitle>IJCNN</addtitle><description>As one of the core parts of flexible manufacturing systems, material handling involves storage and transportation of materials between workstations with automated vehicles. The improvement in material handling can impulse the overall efficiency of the manufacturing system. However, the occurrence of dynamic events during the optimisation of task arrangements poses a challenge that requires adaptability and effectiveness. In this paper, we aim at the scheduling of automated guided vehicles for dynamic material handling. Motivated by some real-world scenarios, unknown new tasks and unexpected vehicle breakdowns are regarded as dynamic events in our problem. We formulate the problem as a constrained Markov decision process which takes into account tardiness and available vehicles as cumulative and instantaneous constraints, respectively. An adaptive constrained reinforcement learning algorithm that combines Lagrangian relaxation and invalid action masking, named RCPOM, is proposed to address the problem with two hybrid constraints. Moreover, a gym-like dynamic material handling simulator, named DMH-GYM, is developed and equipped with diverse problem instances, which can be used as benchmarks for dynamic material handling. Experimental results on the problem instances demonstrate the outstanding performance of our proposed approach compared with eight state-of-the-art constrained and non-constrained reinforcement learning algorithms, and widely used dispatching rules for material handling.</description><subject>automated guided vehicle</subject><subject>benchmark</subject><subject>constrained reinforcement learning</subject><subject>Dispatching</subject><subject>Dynamic material handling</subject><subject>Dynamic scheduling</subject><subject>Electric breakdown</subject><subject>Heuristic algorithms</subject><subject>manufacturing system</subject><subject>Markov processes</subject><subject>Materials handling</subject><subject>Reinforcement learning</subject><issn>2161-4407</issn><isbn>1665488670</isbn><isbn>9781665488679</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNo1j0FLxDAUhKMguLv6DzzkD3RN-pI0uSlVd1fqCqLn5TV9kUiblbSX_fcW1MsMzDcMDGNcirWUwt3unuv9XiutxLoUJaylkE46587YUhqjlbWmEudsUUojC6VEdcmW4_gl5q5zsGB39TGNU8aYqONvFFM4Zk8DpYk3hDnF9MnniD-cEg7R8xecKEfs-RZT18_0il0E7Ee6_vMV-3h6fK-3RfO62dX3TRFLoaYC0JFDU9kKvJJlFyDYtgLsWhGCDsZUCJ1FVB4cGNBgtZCtC6BpFu9hxW5-dyMRHb5zHDCfDv934QdGjUru</recordid><startdate>20230618</startdate><enddate>20230618</enddate><creator>Hu, Chengpeng</creator><creator>Wang, Ziming</creator><creator>Liu, Jialin</creator><creator>Wen, Junyi</creator><creator>Mao, Bifei</creator><creator>Yao, Xin</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20230618</creationdate><title>Constrained Reinforcement Learning for Dynamic Material Handling</title><author>Hu, Chengpeng ; Wang, Ziming ; Liu, Jialin ; Wen, Junyi ; Mao, Bifei ; Yao, Xin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i204t-3a9e9a67873c412df3f8b73adb0ff5f667a3d8aa4c39363538501b9f35e9f3cc3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>automated guided vehicle</topic><topic>benchmark</topic><topic>constrained reinforcement learning</topic><topic>Dispatching</topic><topic>Dynamic material handling</topic><topic>Dynamic scheduling</topic><topic>Electric breakdown</topic><topic>Heuristic algorithms</topic><topic>manufacturing system</topic><topic>Markov processes</topic><topic>Materials handling</topic><topic>Reinforcement learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Hu, Chengpeng</creatorcontrib><creatorcontrib>Wang, Ziming</creatorcontrib><creatorcontrib>Liu, Jialin</creatorcontrib><creatorcontrib>Wen, Junyi</creatorcontrib><creatorcontrib>Mao, Bifei</creatorcontrib><creatorcontrib>Yao, Xin</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hu, Chengpeng</au><au>Wang, Ziming</au><au>Liu, Jialin</au><au>Wen, Junyi</au><au>Mao, Bifei</au><au>Yao, Xin</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Constrained Reinforcement Learning for Dynamic Material Handling</atitle><btitle>2023 International Joint Conference on Neural Networks (IJCNN)</btitle><stitle>IJCNN</stitle><date>2023-06-18</date><risdate>2023</risdate><spage>1</spage><epage>9</epage><pages>1-9</pages><eissn>2161-4407</eissn><eisbn>1665488670</eisbn><eisbn>9781665488679</eisbn><abstract>As one of the core parts of flexible manufacturing systems, material handling involves storage and transportation of materials between workstations with automated vehicles. The improvement in material handling can impulse the overall efficiency of the manufacturing system. However, the occurrence of dynamic events during the optimisation of task arrangements poses a challenge that requires adaptability and effectiveness. In this paper, we aim at the scheduling of automated guided vehicles for dynamic material handling. Motivated by some real-world scenarios, unknown new tasks and unexpected vehicle breakdowns are regarded as dynamic events in our problem. We formulate the problem as a constrained Markov decision process which takes into account tardiness and available vehicles as cumulative and instantaneous constraints, respectively. An adaptive constrained reinforcement learning algorithm that combines Lagrangian relaxation and invalid action masking, named RCPOM, is proposed to address the problem with two hybrid constraints. Moreover, a gym-like dynamic material handling simulator, named DMH-GYM, is developed and equipped with diverse problem instances, which can be used as benchmarks for dynamic material handling. Experimental results on the problem instances demonstrate the outstanding performance of our proposed approach compared with eight state-of-the-art constrained and non-constrained reinforcement learning algorithms, and widely used dispatching rules for material handling.</abstract><pub>IEEE</pub><doi>10.1109/IJCNN54540.2023.10191999</doi><tpages>9</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2161-4407
ispartof 2023 International Joint Conference on Neural Networks (IJCNN), 2023, p.1-9
issn 2161-4407
language eng
recordid cdi_ieee_primary_10191999
source IEEE Xplore All Conference Series
subjects automated guided vehicle
benchmark
constrained reinforcement learning
Dispatching
Dynamic material handling
Dynamic scheduling
Electric breakdown
Heuristic algorithms
manufacturing system
Markov processes
Materials handling
Reinforcement learning
title Constrained Reinforcement Learning for Dynamic Material Handling
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T06%3A15%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Constrained%20Reinforcement%20Learning%20for%20Dynamic%20Material%20Handling&rft.btitle=2023%20International%20Joint%20Conference%20on%20Neural%20Networks%20(IJCNN)&rft.au=Hu,%20Chengpeng&rft.date=2023-06-18&rft.spage=1&rft.epage=9&rft.pages=1-9&rft.eissn=2161-4407&rft_id=info:doi/10.1109/IJCNN54540.2023.10191999&rft.eisbn=1665488670&rft.eisbn_list=9781665488679&rft_dat=%3Cieee_CHZPO%3E10191999%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i204t-3a9e9a67873c412df3f8b73adb0ff5f667a3d8aa4c39363538501b9f35e9f3cc3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10191999&rfr_iscdi=true