ML-guided Search Algorithms for Multi-Agent Path Finding

ML-guided Search Algorithms for Multi-Agent Path Finding

Abstract

Multi-agent path finding (MAPF) is the problem of finding collision-free paths for agents in a shared environment that minimizes their total travel time. It is an NP-hard problem that has important applications for distribution centers, traffic management and computer games. Various search algorithms have been proposed to solve MAPF combinatorially. Search algorithms, such as Conflict-Based Search (CBS), are guaranteed to find optimal solutions. To trade off runtime with solution quality, bounded-suboptimal search algorithms, such as Enhanced CBS (ECBS), have been proposed to find solutions with a guaranteed approximation ratio and are more scalable than optimal search algorithms. To find solutions even faster, unbounded-suboptimal search algorithms, such as Prioritized Planning (PP) and Large Neighborhood Search (LNS), drop optimality guarantees to find solutions even faster. There are a handful of decisions in these search algorithms that concern partitioning the search space, prioritizing search space exploration and pruning the search space. Thus, they typically have a big impact on the efficiency and/or effectiveness of the search. In the past decade of research on MAPF, those decisions have mainly been made manually by humans. In this research, we show that one can leverage machine learning to improve decision-making in various types of search algorithms for MAPF and introduce CBS+ML, ECBS+ML, LNS+ML and PP+ML. Specifically, we apply general machine learning techniques to learn (1) which conflict to resolve next in CBS, (2) which search tree node to expand next in ECBS, (3) which part of the solutions to improve next in LNS and (4) which priority to assign to agents in PP. In these four settings, we deploy imitation learning to imitate slow but effective experts and reduce the ML task to learning-to-rank problems where the ML models rank available decisions. Empirically, we demonstrate that our ML-guided search algorithms show substantial improvement in terms of the success rates, runtimes and/or solution qualities over their state-of-the-art non-ML-guided counterparts on several different types of grid maps from the MAPF benchmark.

Poster

BibTeX

Please cite our papers if you find them useful in your research.


@inproceedings{huang2021conflict,
  title={Learning to resolve conflicts for multi-agent path finding with conflict-based search},
  author={Huang, Taoan and Dilkina, Bistra and Koenig, Sven},
  booktitle={AAAI Conference on Artificial Intelligence},
  year={2021},
  pages={11246--11253}
}
@inproceedings{huang2021node,
  title={Learning node-selection strategies in bounded-suboptimal conflict-based search for multi-agent path finding},
  author={Huang, Taoan and Dilkina, Bistra and Koenig, Sven},
  booktitle={International Conference on Autonomous Agents and Multiagent Systems},
  pages={611--619},
  year={2021}
}
@inproceedings{huang2022anytime,
  author    = {Taoan Huang and Jiaoyang Li and Sven Koenig and Bistra Dilkina},
  title     = {Anytime Multi-Agent Path Finding via Machine Learning-Guided Large Neighborhood Search},
  booktitle = {Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)},
  pages     = {9368--9376},
  year      = {2022}
}
@inproceedings{zhang2022learning,
  title={Learning a Priority Ordering for Prioritized Planning in Multi-Agent Path Finding},
  author={Zhang, Shuyang and Li, Jiaoyang and Huang, Taoan and Koenig, Sven and Dilkina, Bistra},
  booktitle={Proceedings of the International Symposium on Combinatorial Search},
  year={2022}
}