RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

Federico Berto; Chuanbo Hua; Junyoung Park; Laurin Luttmann; Yining Ma; Fanchen Bu; Jiarui Wang; Haoran Ye; Minsu Kim; Sanghyeok Choi; Nayeli Gast Zepeda; André Hottung; Jianan Zhou; Jieyi Bi; Yu Hu; Fei Liu; Hyeonah Kim; Jiwoo Son; Haeyeon Kim; Davide Angioni; Wouter Kool; Zhiguang Cao; Qingfu Zhang; Joungho Kim; Jie Zhang; Kijung Shin; Cathy Wu; Sungsoo Ahn; Guojie Song; Changhyun Kwon; Kevin Tierney; Lin Xie; Jinkyoo Park

doi:10.1145/3711896.3737433

RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Authors

Federico Berto
Chuanbo Hua
Junyoung Park
Yining Ma
Fanchen Bu
Jiarui Wang
Haoran Ye
Minsu Kim
Sanghyeok Choi
Nayeli Gast Zepeda
André Hottung
Jianan Zhou
Jieyi Bi
Yu Hu
Fei Liu
Hyeonah Kim
Jiwoo Son
Haeyeon Kim
Davide Angioni
Wouter Kool
Zhiguang Cao
Qingfu Zhang
Joungho Kim
Jie Zhang
Kijung Shin
Cathy Wu
Sungsoo Ahn
Guojie Song
Changhyun Kwon
Kevin Tierney
Lin Xie
Jinkyoo Park

Professorship for Information Systems, in particular Data Science

Combinatorial optimization (CO) is fundamental to several real-world applications, from logistics and scheduling to hardware design and resource allocation. Deep reinforcement learning (RL) has recently shown significant benefits in solving CO problems, reducing reliance on domain expertise and improving computational efficiency. However, the absence of a unified benchmarking framework leads to inconsistent evaluations, limits reproducibility, and increases engineering overhead, raising barriers to adoption for new researchers. To address these challenges, we introduce RL4CO, a unified and extensive benchmark with in-depth library coverage of 27 CO problem environments and 23 state-of-the-art baselines. Built on efficient software libraries and best practices in implementation, RL4CO features modularized implementation and flexible configurations of diverse environments, policy architectures, RL algorithms, and utilities with extensive documentation. RL4CO helps researchers build on existing successes while exploring and developing their own designs, facilitating the entire research process by decoupling science from heavy engineering. We finally provide extensive benchmark studies to inspire new insights and future work. RL4CO has already attracted numerous researchers in the community and is open-sourced at https://github.com/ai4co/rl4co.

Original language	English
Title of host publication	KDD 2025 - Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Editors	Luiza Antonie, Jian Pei, Xiaohui Yu
Number of pages	12
Publisher	Association for Computing Machinery
Publication date	03.08.2025
Pages	5278-5289
ISBN (electronic)	9798400714542
DOIs	https://doi.org/10.1145/3711896.3737433
Publication status	Published - 03.08.2025
Event	31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2025 - Toronto Convention Centre, Toronto, Canada Duration: 03.08.2025 → 07.08.2025 Conference number: 31 https://kdd2025.kdd.org/

Bibliographical note

Publisher Copyright:
© 2025 Association for Computing Machinery. All rights reserved.

Research areas

benchmark, combinatorial optimization, neural combinatorial optimization, open research community, reinforcement learning
Business informatics

ASJC Scopus Subject Areas

Software
Information Systems

Other publications by the same author(s)

Neural Combinatorial Optimization on Heterogeneous Graphs: An Application to the Picker Routing Problem in Mixed-Shelves Warehouses

Luttmann, L. & Xie, L., 30.05.2024, In: Proceedings International Conference on Automated Planning and Scheduling, ICAPS. 34, p. 351-359 9 p.

Research output: Journal contributions › Conference article in journal › Research › peer-review

Comparison of Backpropagation and Kalman Filter-based Training for Neural Networks

Luttmann, L. & Mercorelli, P., 20.10.2021, 2021 25th International Conference on System Theory, Control and Computing (ICSTCC): October 20 – 23, 2021 Iași, ROMANIA, Proceedings. Ferariu, L., Matcovschi, M.-H. & Ungureanu, F. (eds.). Piscataway: Institute of Electrical and Electronics Engineers Inc., p. 234-241 8 p. (International Conference on System Theory, Control and Computing; no. 25).

Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review

Formulating and solving integrated order batching and routing in multi-depot AGV-assisted mixed-shelves warehouses

Xie, L., Li, H. & Luttmann, L., 01.06.2023, In: European Journal of Operational Research . 307, 2, p. 713-730 18 p.

Research output: Journal contributions › Journal articles › Research › peer-review

DOI

https://doi.org/10.1145/3711896.3737433
Final published version

RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

Authors

Bibliographical note

Research areas

ASJC Scopus Subject Areas

Other publications by the same author(s)

Neural Combinatorial Optimization on Heterogeneous Graphs: An Application to the Picker Routing Problem in Mixed-Shelves Warehouses

Comparison of Backpropagation and Kalman Filter-based Training for Neural Networks

Formulating and solving integrated order batching and routing in multi-depot AGV-assisted mixed-shelves warehouses

DOI

Recently viewed

Researchers

Projects

Activities

Prizes

Publications