Abstract
In this paper, we present an algorithm based on reinforcement learning for scheduling the operations of an industrial plant which is modeled as a network of machines on a directed acyclic graph. The algorithm is assumed to have access to a high-fidelity simulator of the plant, but not a mathematical model. The algorithm is designed to optimize an objective function over a moving window, similar to receding horizon control, for a typical industrial plant which converts raw material into finished products. The delivery schedule for the incoming raw material is assumed to be known but subject to uncertainty. A novel feature of our technique is the use of schedule moments to train the algorithm to handle a large class of incoming delivery schedules.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2020 Winter Simulation Conference, WSC 2020 |
| Editors | K.-H. Bae, B. Feng, S. Kim, S. Lazarova-Molnar, Z. Zheng, T. Roeder, R. Thiesing |
| Publisher | IEEE, Institute of Electrical and Electronics Engineers |
| Pages | 572-583 |
| Number of pages | 12 |
| ISBN (Electronic) | 9781728194998 |
| DOIs | |
| Publication status | Published - 2020 |
| Externally published | Yes |
| Event | Winter Simulation Conference 2020 - Orlando, United States of America Duration: 14 Dec 2020 → 18 Dec 2020 https://ieeexplore.ieee.org/xpl/conhome/9383852/proceeding (Proceedings) https://meetings.informs.org/wordpress/wsc2020/ (Website) |
Publication series
| Name | Proceedings - Winter Simulation Conference |
|---|---|
| Publisher | IEEE, Institute of Electrical and Electronics Engineers |
| Volume | 2020-December |
| ISSN (Print) | 0891-7736 |
Conference
| Conference | Winter Simulation Conference 2020 |
|---|---|
| Abbreviated title | WSC 2020 |
| Country/Territory | United States of America |
| City | Orlando |
| Period | 14/12/20 → 18/12/20 |
| Internet address |