Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.
Translated title of the contributionDynamische Auswahl von Reihenfolgeregeln mit bestärkendem Lernen in einer Werkstattfertigung mit stochastischen Einflüssen
Original languageEnglish
Title of host publicationProceedings of the 2020 Winter Simulation Conference, WSC 2020
EditorsK.-H. Bae, B. Feng, S. Kim, S. Lazarova-Molnar, Z. Zheng, T. Roeder, R. Thiesing
Number of pages11
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Publication date14.12.2020
Pages1608 - 1618
Article number9383903
ISBN (electronic)978-1-7281-9499-8
DOIs
Publication statusPublished - 14.12.2020
EventWinter Simulation Conference - WSC 2020: Simulation Drives Innovation - Orlando, United States
Duration: 14.12.202018.12.2020
http://meetings2.informs.org/wordpress/wsc2020/

Recently viewed

Publications

  1. A New Framework for Production Planning and Control to Support the Positioning in Fields of Tension Created by Opposing Logistic Objectives
  2. A Python toolbox for the numerical solution of the Maxey-Riley equation
  3. Joint entity and relation linking using EARL
  4. Human–learning–machines: introduction to a special section on how cybernetics and constructivism inspired new forms of learning
  5. A Wavelet Packet Tree Denoising Algorithm for Images of Atomic-Force Microscopy
  6. Introducing parametric uncertainty into a nonlinear friction model
  7. Finding Similar Movements in Positional Data Streams
  8. A change of values is in the air
  9. Integrating Mobile Devices into AAL-Environments using Knowledge based Assistance Systems
  10. Integrating errors into the training process
  11. Modeling Effective and Ineffective Knowledge Communication and Learning Discourses in CSCL with Hidden Markov Models
  12. Parking space management through deep learning – an approach for automated, low-cost and scalable real-time detection of parking space occupancy
  13. Analysis of Complexity Reduction in Kalman Filters Through Decoupling Control With Chattered Inputs in PMSM
  14. Lyapunov stability analysis to set up a PI controller for a mass flow system in case of a non-saturating input
  15. Volume of Imbalance Container Prediction using Kalman Filter and Long Short-Term Memory
  16. Problem structuring for transitions
  17. Patching Together a Global Script
  18. The delay vector variance method and the recurrence quantification analysis of energy markets
  19. Multidimensional Cross-Recurrence Quantification Analysis (MdCRQA)–A Method for Quantifying Correlation between Multivariate Time-Series
  20. Using cross-recurrence quantification analysis to compute similarity measures for time series of unequal length with applications to sleep stage analysis
  21. Spatial mislocalization as a consequence of sequential coding of stimuli
  22. Data-Generating Mechanisms Versus Constructively Defined Latent Variables in Multitrait–Multimethod Analysis:
  23. Stepwise-based optimizing approaches for arrangements of loudspeaker in multi-zone sound field reproduction
  24. Scaffolding argumentation in mathematics with CSCL scripts