Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. / Heger, Jens; Voß, Thomas.
Proceedings of the 2020 Winter Simulation Conference, WSC 2020. ed. / K.-H. Bae; B. Feng; S. Kim; S. Lazarova-Molnar; Z. Zheng; T. Roeder; R. Thiesing. IEEE - Institute of Electrical and Electronics Engineers Inc., 2020. p. 1608 - 1618 9383903 (Proceedings - Winter Simulation Conference; Vol. 2020-December).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Heger, J & Voß, T 2020, Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. in K-H Bae, B Feng, S Kim, S Lazarova-Molnar, Z Zheng, T Roeder & R Thiesing (eds), Proceedings of the 2020 Winter Simulation Conference, WSC 2020., 9383903, Proceedings - Winter Simulation Conference, vol. 2020-December, IEEE - Institute of Electrical and Electronics Engineers Inc., pp. 1608 - 1618, Winter Simulation Conference - WSC 2020, Orlando, United States, 14.12.20. https://doi.org/10.1109/WSC48552.2020.9383903

APA

Heger, J., & Voß, T. (2020). Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. In K.-H. Bae, B. Feng, S. Kim, S. Lazarova-Molnar, Z. Zheng, T. Roeder, & R. Thiesing (Eds.), Proceedings of the 2020 Winter Simulation Conference, WSC 2020 (pp. 1608 - 1618). Article 9383903 (Proceedings - Winter Simulation Conference; Vol. 2020-December). IEEE - Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WSC48552.2020.9383903

Vancouver

Heger J, Voß T. Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. In Bae KH, Feng B, Kim S, Lazarova-Molnar S, Zheng Z, Roeder T, Thiesing R, editors, Proceedings of the 2020 Winter Simulation Conference, WSC 2020. IEEE - Institute of Electrical and Electronics Engineers Inc. 2020. p. 1608 - 1618. 9383903. (Proceedings - Winter Simulation Conference). doi: 10.1109/WSC48552.2020.9383903

Bibtex

@inbook{91c46c919068401cbc21216d68d1cf16,
title = "Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences",
abstract = "Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.",
keywords = "Engineering",
author = "Jens Heger and Thomas Vo{\ss}",
year = "2020",
month = dec,
day = "14",
doi = "10.1109/WSC48552.2020.9383903",
language = "English",
series = "Proceedings - Winter Simulation Conference",
publisher = "IEEE - Institute of Electrical and Electronics Engineers Inc.",
pages = "1608 -- 1618",
editor = "K.-H. Bae and B. Feng and S. Kim and S. Lazarova-Molnar and Z. Zheng and T. Roeder and R. Thiesing",
booktitle = "Proceedings of the 2020 Winter Simulation Conference, WSC 2020",
address = "United States",
note = "Winter Simulation Conference - WSC 2020 : Simulation Drives Innovation, WSC2020 ; Conference date: 14-12-2020 Through 18-12-2020",
url = "http://meetings2.informs.org/wordpress/wsc2020/",

}

RIS

TY - CHAP

T1 - Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences

AU - Heger, Jens

AU - Voß, Thomas

PY - 2020/12/14

Y1 - 2020/12/14

N2 - Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.

AB - Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.

KW - Engineering

UR - http://www.scopus.com/inward/record.url?scp=85103874223&partnerID=8YFLogxK

U2 - 10.1109/WSC48552.2020.9383903

DO - 10.1109/WSC48552.2020.9383903

M3 - Article in conference proceedings

AN - SCOPUS:85103874223

T3 - Proceedings - Winter Simulation Conference

SP - 1608

EP - 1618

BT - Proceedings of the 2020 Winter Simulation Conference, WSC 2020

A2 - Bae, K.-H.

A2 - Feng, B.

A2 - Kim, S.

A2 - Lazarova-Molnar, S.

A2 - Zheng, Z.

A2 - Roeder, T.

A2 - Thiesing, R.

PB - IEEE - Institute of Electrical and Electronics Engineers Inc.

T2 - Winter Simulation Conference - WSC 2020

Y2 - 14 December 2020 through 18 December 2020

ER -

Recently viewed

Publications

  1. A decoupled MPC using a geometric approach and feedforward action for motion control in robotino
  2. Model predictive control for switching gain adaptation in a sliding mode controller of a DC drive with nonlinear friction
  3. Finding Creativity in Predictability: Seizing Kairos in Chronos Through Temporal Work in Complex Innovation Processes
  4. An application of multiple behavior SIA for analyzing data from student exams
  5. Continuous and Discrete Concepts for Detecting Transport Barriers in the Planar Circular Restricted Three Body Problem
  6. Control of an Electromagnetic Linear Actuator Using Flatness Property and Systems Inversion
  7. Machine Learning and Knowledge Discovery in Databases
  8. Design of controllers applied to autonomous unmanned aerial vehicles using software in the loop
  9. A Wavelet Packet Algorithm for Online Detection of Pantograph Vibrations
  10. Integrating errors into the training process
  11. Formative Perspectives on the Relation Between CSR Communication and CSR Practices
  12. Sensitivity to complexity - an important prerequisite of problem solving mathematics teaching
  13. An extended analytical approach to evaluating monotonic functions of fuzzy numbers
  14. Comparison of Bio-Inspired Algorithms in a Case Study for Optimizing Capacitor Bank Allocation in Electrical Power Distribution
  15. Mining positional data streams
  16. HAWK - hybrid question answering using linked data
  17. Development and validation of the short form of the Later Life Workplace Index
  18. A Lyapunov based PI controller with an anti-windup scheme for a purification process of potable water
  19. Age effects on controlling tools with sensorimotor transformations
  20. Towards a Global Script?
  21. Gain Adaptation in Sliding Mode Control Using Model Predictive Control and Disturbance Compensation with Application to Actuators
  22. Overcoming Multi-legacy Application Challenges through Building Dynamic Capabilities for Low-Code Adoption
  23. Validation of an open source, remote web-based eye-tracking method (WebGazer) for research in early childhood
  24. A Cross-Classified CFA-MTMM Model for Structurally Different and Nonindependent Interchangeable Methods
  25. Using heuristic worked examples to promote solving of reality‑based tasks in mathematics in lower secondary school
  26. Interaction-Dominant Causation in Mind and Brain, and Its Implication for Questions of Generalization and Replication
  27. A simple control strategy for increasing the soft bending actuator performance by using a pressure boost
  28. Use of Machine-Learning Algorithms Based on Text, Audio and Video Data in the Prediction of Anxiety and Post-Traumatic Stress in General and Clinical Populations
  29. Mathematical relation between extended connectivity and eigenvector coefficients.
  30. Intraspecific trait variation patterns along a precipitation gradient in Mongolian rangelands
  31. Outperformed by a Computer? - Comparing Human Decisions to Reinforcement Learning Agents, Assigning Lot Sizes in a Learning Factory
  32. Artificial intelligence