Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. / Heger, Jens; Voß, Thomas.
Proceedings of the 2020 Winter Simulation Conference, WSC 2020. ed. / K.-H. Bae; B. Feng; S. Kim; S. Lazarova-Molnar; Z. Zheng; T. Roeder; R. Thiesing. IEEE - Institute of Electrical and Electronics Engineers Inc., 2020. p. 1608 - 1618 9383903 (Proceedings - Winter Simulation Conference; Vol. 2020-December).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Heger, J & Voß, T 2020, Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. in K-H Bae, B Feng, S Kim, S Lazarova-Molnar, Z Zheng, T Roeder & R Thiesing (eds), Proceedings of the 2020 Winter Simulation Conference, WSC 2020., 9383903, Proceedings - Winter Simulation Conference, vol. 2020-December, IEEE - Institute of Electrical and Electronics Engineers Inc., pp. 1608 - 1618, Winter Simulation Conference - WSC 2020, Orlando, United States, 14.12.20. https://doi.org/10.1109/WSC48552.2020.9383903

APA

Heger, J., & Voß, T. (2020). Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. In K.-H. Bae, B. Feng, S. Kim, S. Lazarova-Molnar, Z. Zheng, T. Roeder, & R. Thiesing (Eds.), Proceedings of the 2020 Winter Simulation Conference, WSC 2020 (pp. 1608 - 1618). Article 9383903 (Proceedings - Winter Simulation Conference; Vol. 2020-December). IEEE - Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WSC48552.2020.9383903

Vancouver

Heger J, Voß T. Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences. In Bae KH, Feng B, Kim S, Lazarova-Molnar S, Zheng Z, Roeder T, Thiesing R, editors, Proceedings of the 2020 Winter Simulation Conference, WSC 2020. IEEE - Institute of Electrical and Electronics Engineers Inc. 2020. p. 1608 - 1618. 9383903. (Proceedings - Winter Simulation Conference). doi: 10.1109/WSC48552.2020.9383903

Bibtex

@inbook{91c46c919068401cbc21216d68d1cf16,
title = "Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences",
abstract = "Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.",
keywords = "Engineering",
author = "Jens Heger and Thomas Vo{\ss}",
year = "2020",
month = dec,
day = "14",
doi = "10.1109/WSC48552.2020.9383903",
language = "English",
series = "Proceedings - Winter Simulation Conference",
publisher = "IEEE - Institute of Electrical and Electronics Engineers Inc.",
pages = "1608 -- 1618",
editor = "K.-H. Bae and B. Feng and S. Kim and S. Lazarova-Molnar and Z. Zheng and T. Roeder and R. Thiesing",
booktitle = "Proceedings of the 2020 Winter Simulation Conference, WSC 2020",
address = "United States",
note = "Winter Simulation Conference - WSC 2020 : Simulation Drives Innovation, WSC2020 ; Conference date: 14-12-2020 Through 18-12-2020",
url = "http://meetings2.informs.org/wordpress/wsc2020/",

}

RIS

TY - CHAP

T1 - Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences

AU - Heger, Jens

AU - Voß, Thomas

PY - 2020/12/14

Y1 - 2020/12/14

N2 - Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.

AB - Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.

KW - Engineering

UR - http://www.scopus.com/inward/record.url?scp=85103874223&partnerID=8YFLogxK

U2 - 10.1109/WSC48552.2020.9383903

DO - 10.1109/WSC48552.2020.9383903

M3 - Article in conference proceedings

AN - SCOPUS:85103874223

T3 - Proceedings - Winter Simulation Conference

SP - 1608

EP - 1618

BT - Proceedings of the 2020 Winter Simulation Conference, WSC 2020

A2 - Bae, K.-H.

A2 - Feng, B.

A2 - Kim, S.

A2 - Lazarova-Molnar, S.

A2 - Zheng, Z.

A2 - Roeder, T.

A2 - Thiesing, R.

PB - IEEE - Institute of Electrical and Electronics Engineers Inc.

T2 - Winter Simulation Conference - WSC 2020

Y2 - 14 December 2020 through 18 December 2020

ER -

Recently viewed

Publications

  1. Model predictive control for switching gain adaptation in a sliding mode controller of a DC drive with nonlinear friction
  2. A Control Scheme for PMSMs using Model Predictive Control and a Feedforward Action in the Presence of Saturated Inputs
  3. Promising practices for dealing with complexity in research for development
  4. Sliding-Mode-Based Input-Output Linearization of a Peltier Element for Ice Clamping Using a State and Disturbance Observer
  5. Energy Optimization in Motion Planning of a Two-Link Manipulator using Bernstein Polynomials
  6. Children's use of spatial skills in solving two map-reading tasks in real space.
  7. Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics
  8. A tutorial introduction to adaptive fractal analysis
  9. Template-based Question Answering using Recursive Neural Networks
  10. A sensor fault detection scheme as a functional safety feature for DC-DC converters
  11. Evaluating structural and compositional canopy characteristics to predict the light-demand signature of the forest understorey in mixed, semi-natural temperate forests
  12. lp-Norm Multiple Kernel Learning
  13. Design optimization of spiral coils for textile applications by genetic algorithm
  14. Exact and approximate inference for annotating graphs with structural SVMs
  15. Fast, Fully Automated Analysis of Voriconazole from Serum by LC-LC-ESI-MS-MS with Parallel Column-Switching Technique
  16. Recurrence Quantification Analysis of Processes and Products of Discourse
  17. Lessons learned for spatial modelling of ecosystem services in support of ecosystem accounting
  18. Construct Objectification and De-Objectification in Organization Theory
  19. Computational modeling of amorphous polymers
  20. Modeling and numerical simulation of multiscale behavior in polycrystals via extended crystal plasticity
  21. Influence of Process Parameters and Die Design on the Microstructure and Texture Development of Direct Extruded Magnesium Flat Products
  22. Simple saturated PID control for fast transient of motion systems
  23. Dynamic Lot Size Optimization with Reinforcement Learning
  24. The delay vector variance method and the recurrence quantification analysis of energy markets
  25. Introducing parametric uncertainty into a nonlinear friction model
  26. Faulty Process Detection Using Machine Learning Techniques
  27. TextGraphs 2024 Shared Task on Text-Graph Representations for Knowledge Graph Question Answering
  28. Clause identification using entropy guided transformation learning
  29. Mathematical Modeling for Robot 3D Laser Scanning in Complete Darkness Environments to Advance Pipeline Inspection
  30. Dispatching rule selection with Gaussian processes
  31. Constraints are the solution, not the problem
  32. Dynamic priority based dispatching of AGVs in flexible job shops
  33. Mining positional data streams
  34. Understanding the properties of isospectral points and pairs in graphs
  35. Improving students’ science text comprehension through metacognitive self-regulation when applying learning strategies
  36. Comments on "Tracking Control of Robotic Manipulators With Uncertain Kinematics and Dynamics"