Dynamically changing sequencing rules with reinforcement learning in a job shop system with stochastic influences

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Sequencing operations can be difficult, especially under uncertain conditions. Applying decentral sequencing rules has been a viable option; however, no rule exists that can outperform all other rules under varying system performance. For this reason, reinforcement learning (RL) is used as a hyper heuristic to select a sequencing rule based on the system status. Based on multiple training scenarios considering stochastic influences, such as varying inter arrival time or customers changing the product mix, the advantages of RL are presented. For evaluation, the trained agents are exploited in a generic manufacturing system. The best agent trained is able to dynamically adjust sequencing rules based on system performance, thereby matching and outperforming the presumed best static sequencing rules by ~ 3%. Using the trained policy in an unknown scenario, the RL heuristic is still able to change the sequencing rule according to the system status, thereby providing a robust performance.
Titel in ÜbersetzungDynamische Auswahl von Reihenfolgeregeln mit bestärkendem Lernen in einer Werkstattfertigung mit stochastischen Einflüssen
OriginalspracheEnglisch
TitelProceedings of the 2020 Winter Simulation Conference, WSC 2020
HerausgeberK.-H. Bae, B. Feng, S. Kim, S. Lazarova-Molnar, Z. Zheng, T. Roeder, R. Thiesing
Anzahl der Seiten11
VerlagIEEE - Institute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum14.12.2020
Seiten1608 - 1618
Aufsatznummer9383903
ISBN (elektronisch)978-1-7281-9499-8
DOIs
PublikationsstatusErschienen - 14.12.2020
VeranstaltungWinter Simulation Conference 2020: Simulation Drives Innovation - Orlando, USA / Vereinigte Staaten
Dauer: 14.12.202018.12.2020
http://meetings2.informs.org/wordpress/wsc2020/

Zugehörige Aktivitäten

DOI

Zuletzt angesehen

Publikationen

  1. Multilevel bridge governor by using model predictive control in wavelet packets for tracking trajectories
  2. Experiments on the Fehrer-Raab effect and the ‘Weather Station Model’ of visual backward masking
  3. Parking space management through deep learning – an approach for automated, low-cost and scalable real-time detection of parking space occupancy
  4. Lyapunov stability analysis to set up a PI controller for a mass flow system in case of a non-saturating input
  5. Springback prediction and reduction in deep drawing under influence of unloading modulus degradation
  6. Should learners use their hands for learning? Results from an eye-tracking study
  7. Modeling of Logistic Processes in Assembly Areas
  8. Different kinds of interactive exercises with response analysis on the web
  9. A sensor fault detection scheme as a functional safety feature for DC-DC converters
  10. Harvesting information from captions for weakly supervised semantic segmentation
  11. Understanding the socio-technical aspects of low-code adoption for software development
  12. Introduction Mobile Digital Practices. Situating People, Things, and Data
  13. Fast, Fully Automated Analysis of Voriconazole from Serum by LC-LC-ESI-MS-MS with Parallel Column-Switching Technique
  14. Exact and approximate inference for annotating graphs with structural SVMs
  15. Exploration strategies, performance, and error consequences when learning a complex computer task
  16. Lessons learned for spatial modelling of ecosystem services in support of ecosystem accounting
  17. How to support synchronous net-based learning discourses
  18. Construct Objectification and De-Objectification in Organization Theory
  19. Development and validation of a method for the determination of trace alkylphenols and phthalates in the atmosphere
  20. Modeling and numerical simulation of multiscale behavior in polycrystals via extended crystal plasticity
  21. A fast sequential injection analysis system for the simultaneous determination of ammonia and phosphate
  22. Taking the pulse of Earth's tropical forests using networks of highly distributed plots
  23. Backstepping-based Input-Output Linearization of a Peltier Element for Ice Clamping using an Unscented Kalman Filter
  24. A simple nonlinear PD control for faster and high-precision positioning of servomechanisms with actuator saturation
  25. How, when and why do negotiators use reference points?
  26. A lyapunov approach in the derivative approximation using a dynamic system
  27. Hierarchical trait filtering at different spatial scales determines beetle assemblages in deadwood
  28. Transductive support vector machines for structured variables
  29. Training effects of two different unstable shoe constructions on postural control in static and dynamic testing situations