Dynamically adjusting the k-values of the ATCS rule in a flexible flow shop scenario with reinforcement learning

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

Given the fact that finding the optimal sequence in a flexible flow shop is usually an NP-hard problem, priority-based sequencing rules are applied in many real-world scenarios. In this contribution, an innovative reinforcement learning approach is used as a hyper-heuristic to dynamically adjust the k-values of the ATCS sequencing rule in a complex manufacturing scenario. For different product mixes as well as different utilisation levels, the reinforcement learning approach is trained and compared to the k-values found with an extensive simulation study. This contribution presents a human comprehensible hyper-heuristic, which is able to adjust the k-values to internal and external stimuli and can reduce the mean tardiness up to 5%.
Original languageEnglish
JournalInternational Journal of Production Research
Volume61
Issue number1
Pages (from-to)147-161
Number of pages15
ISSN0020-7543
DOIs
Publication statusPublished - 2023

Bibliographical note

Publisher Copyright:
© 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
Titel der Ausgabe: Analytics and Machine Learning in Scheduling and Routing Optimization

    Research areas

  • Engineering - Sequencing rules, dynamic adjustment, simulation study, reinforcement learning, production planning and control

Recently viewed

Publications

  1. Modeling and numerical simulation of multiscale behavior in polycrystals via extended crystal plasticity
  2. Early Detection of Faillure in Conveyor Chain Systems by Wireless Sensor Node
  3. Hierarchical trait filtering at different spatial scales determines beetle assemblages in deadwood
  4. Backstepping-based Input-Output Linearization of a Peltier Element for Ice Clamping using an Unscented Kalman Filter
  5. A simple nonlinear PD control for faster and high-precision positioning of servomechanisms with actuator saturation
  6. How, when and why do negotiators use reference points?
  7. There is no Software, there are just Services: Introduction
  8. A lyapunov approach in the derivative approximation using a dynamic system
  9. Beyond Path Dependency
  10. Measuring cognitive load with subjective rating scales during problem solving
  11. On the added value of considering effects of generic and subject-specific instructional quality on students’ achievements – an exploratory study on the example of implementing formative assessment in mathematics education
  12. Transductive support vector machines for structured variables
  13. E-stability and stability of adaptive learning in models with asymmetric information
  14. What the term agent stands for in the Smart Grid definition of agents and multi-agent systems from an engineer's perspective
  15. Dynamic Lot Size Optimization with Reinforcement Learning
  16. Volume of Imbalance Container Prediction using Kalman Filter and Long Short-Term Memory
  17. Intentionality
  18. Comparison of Odor Thresholds obtained by a Three Alternative Choice Procedure and by the Method of Limits
  19. How does Enterprise Architecture support the Design and Realization of Data-Driven Business Models?
  20. Constraint breeds creativity
  21. Message passing for hyper-relational knowledge graphs
  22. Technological System and the Problem of Desymbolization
  23. The Influence of Note-taking on Mathematical Solution Processes while Working on Reality-Based Tasks
  24. Holistic and scalable ranking of RDF data
  25. Comparison of different FEM codes approach for extrusion process analysis
  26. Database on Learning for Sustainable Development – analysis of projects
  27. A Wavelet Packet Algorithm for Online Detection of Pantograph Vibrations
  28. Robust decoupling through algebraic output feedback in manipulation systems