Loading…
Online optimization of replacement policies using learning automata
A global optimization algorithm operating online in a stochastic multi-teacher environment is suggested. An application example introduces a new perspective for solving some optimization problems dealing with reliability. First, a hybrid scheme combining reinforcement-based learning automata and con...
Saved in:
Published in: | International journal of systems science 2008-03, Vol.39 (3), p.237-249 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | A global optimization algorithm operating online in a stochastic multi-teacher environment is suggested. An application example introduces a new perspective for solving some optimization problems dealing with reliability. First, a hybrid scheme combining reinforcement-based learning automata and confidence probabilistics is developed for a single-teacher environment. The scheme is able to find the optimal solution with high confidence, yet providing a sequence of search actions that converge to the minimal loss. In addition, the suggested approach provides an on-line measure of the confidence to the current solution. Second, a multi-teacher environment is considered. A simple application of a database enables any single-teacher reinforcement algorithm to be used for updating the learning automaton action probability distribution. Two alternative approaches are suggested, where the former provides superior performance in terms of confidence and loss; the latter is able to deal with dependencies between the cost and the duration of the evaluation of the cost function. The performance of the learning schemes is studied in simulations on maintenance optimization, where an accumulated number of failures is optimized online for a deteriorating production system with preventive maintenance. The simulations indicate superior performance of the hybrid scheme. A significant speed-up is observed by taking advantage of information from processes running online in parallel, thus making the learning automata approach a much more feasible approach for solving engineering problems of practical interest. |
---|---|
ISSN: | 0020-7721 1464-5319 |
DOI: | 10.1080/00207720701750790 |