Multi-AGV Collaborative Task Scheduling and Deep Reinforcement Learning Optimization Under Multi-Feature Constraints

Dongping Zhao; Hui Li; Ziyang Wang; Hang Li

doi:10.3390/pr13113754

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-AGV Collaborative Task Scheduling and Deep Reinforcement Learning Optimization Under Multi-Feature Constraints

Dongping Zhao Hui Li Ziyang Wang Hang Li

Year: 2025 Journal: Processes Vol: 13 (11)Pages: 3754-3754 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/pr13113754

Get Full-Text PDF Get Analytical Report

Abstract

To address the challenges of low efficiency, instability, and difficulties in meeting multiple constraints simultaneously in multi-AGV (Automated Guided Vehicle) task scheduling for intelligent manufacturing and logistics, this paper introduces a scheduling method based on multi-feature constraints and an improved deep reinforcement learning (DRL) approach (Improved Proximal Policy Optimization, IPPO). The method integrates multiple constraints, including minimizing task completion time, reducing penalty levels, and minimizing scheduling time deviation, into the scheduling optimization process. Building on the conventional PPO algorithm, several enhancements are introduced: a dynamic penalty mechanism is implemented to adaptively adjust constraint weights, a structured reward function is designed to boost learning efficiency, and sampling bias correction is combined with global state awareness to improve training stability and global coordination. Simulation experiments demonstrate that, after 10,000 iterations, the minimum task completion time drops from 98.2 s to 30 s, the penalty level decreases from 130 to 82, and scheduling time deviation reduces from 12 s to 0.5 s, representing improvements of 69.4%, 37%, and 95.8% in the same scenario, respectively. Compared to genetic algorithms (GAs) and rule-based scheduling methods, the IPPO approach demonstrates significant advantages in average task completion time, total system makespan, and overall throughput, along with faster convergence and better stability. These findings demonstrate that the proposed methodology enables effective multi-objective collaborative optimization and efficient task scheduling within complex dynamic environments, holding significant value for intelligent manufacturing and logistics systems.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Multi-AGV Collaborative Task Scheduling and Deep Reinforcement Learning Optimization Under Multi-Feature Constraints

Abstract

Metrics

Topics

Related Documents

Multi-Agent Deep Reinforcement Learning for Collaborative Task Scheduling

Multi-Objective Optimization of AGV Real-Time Scheduling Based on Deep Reinforcement Learning

An AGV Task Scheduling Method Based on Multi-Agent Reinforcement Learning

Simultaneous Production and AGV Scheduling using Multi-Agent Deep Reinforcement Learning

Multi-AGV Task Allocation with Attention Based on Deep Reinforcement Learning