AI Speeds Up Robot Team Navigation by Splitting Planning

TL;DR

A new method helps hundreds of robots navigate crowded spaces with fewer delays, resolving route conflicts after planning and achieving 100% success rates.

Coordinating multiple robots or autonomous agents in shared spaces, such as warehouses or urban environments, presents a fundamental in artificial intelligence and robotics. Known as Multi-Agent Path Finding (MAPF), this problem requires computing collision-free trajectories for each agent while minimizing total travel time, but traditional approaches often struggle with scalability as the number of agents increases. Many existing solvers rely on time-expanded models that synchronize agents at every step, leading to computational bottlenecks and excessive waiting in dense scenarios. Researchers from the University of Zaragoza have developed a novel framework that decouples geometric planning from execution-time coordination, enabling efficient pathfinding for up to 1,000 agents with minimal delays.

The key finding is that by separating route planning from real-time conflict resolution, achieves near-linear runtime scaling and maintains a 100% success rate on instances where geometric feasibility is assured. In experiments on standard benchmark maps, including a large open grid and a constrained room layout with narrow corridors, the approach consistently produced collision-free trajectories. It outperformed or matched state-of-the-art s like Cooperative A* and PIBT in terms of sum-of-costs (SOC), which measures the total number of actions taken by all agents, particularly in bottleneck-heavy environments where waiting actions typically dominate costs. For example, on the room map with 1,000 agents, reduced SOC by 23.1% compared to a variant without geometric cost inflation, demonstrating that preemptive detours can significantly cut down on execution-time delays.

Ology involves a two-stage process: Geometric Conflict Preemption (GCP) and Decentralized Local Controller (DLC). In the first stage, GCP plans paths sequentially for each agent using a prioritized ordering, such as by shortest path distance or conflict scores, on the original graph without time expansion. It inflates costs for transitions into vertices already used by higher-priority paths, encouraging spatial detours to reduce future congestion. This is done by applying a penalty parameter Cp, where costs are increased based on the visit indices of higher-priority agents, as defined in equation (3) of the paper. In the second stage, DLC executes these geometric paths using per-vertex FIFO authorization queues, inserting wait actions only when necessary to avoid vertex and edge-swap conflicts, ensuring collision-free asynchronous movement without global synchronization.

From the experimental evaluation show that scales efficiently, with runtime exhibiting an empirically near-linear trend, as illustrated in Figure 2 of the paper. On the bottleneck-heavy room map, it achieved a 100% success rate across all solvable instances, while baselines like ECBS and PIBT showed degradation at high agent densities. The data in Table I reveals that enabling GCP reduces temporal costs (waiting actions) by about 70% at 1,000 agents, even though spatial costs increase by 16%, leading to an overall SOC improvement. Additionally, Table II indicates that using a priority heuristic like Conflicting Path Last (CL) minimizes SOC, with CL yielding 18% lower costs than Longest Path First (LPF) in dense scenarios, highlighting the importance of ordering in congestion mitigation.

Of this research extend to real-world applications in logistics, robotics, and large-scale system control, where efficient coordination of autonomous vehicles or drones is critical. By avoiding time-expanded models and centralized conflict resolution, the framework reduces computational overhead and waiting times, making it suitable for dynamic environments with asynchronous agents. For instance, in warehouse automation, this could enable faster item retrieval with fewer bottlenecks, while in urban mobility, it might improve traffic flow for self-driving cars. 's decentralized nature also aligns with trends toward distributed control architectures, enhancing robustness in systems with communication delays or limited synchronization capabilities.

Limitations of the approach include its reliance on Assumption 3.1, which requires that each agent's goal remains reachable after removing higher-priority goals from the graph, potentially restricting applicability to instances where this geometric feasibility holds. The paper notes that future work could address adaptive tuning of the inflation parameter, online priority re-ordering based on observed contention, and extensions to continuous-time or heterogeneous team models. While demonstrates strong performance in simulated discrete-time environments, its effectiveness in real-world continuous settings with kinematic constraints remains to be validated, as acknowledged in the conclusion section.

About the Author

Guilherme A.

Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.

Connect on LinkedIn