Dame Seck DiopSamuel Yanes LuisManuel A. Perales‐EsteveDaniel Gutiérrez ReinaS. L. Toral
Plastic pollution in water bodies threatens and disrupts aquatic life, requiring effective cleanup solutions. This paper proposes a strategy for plastic cleanup using a fleet of autonomous surface vehicles in a multitask scenario, with a focus on both exploration and cleaning tasks. The mission is decoupled into two phases: an exploration phase for locating trash and a cleaning phase for collection. A Multitask Deep Q‐Network with two heads estimates Q ‐values for each task, and all ASVs share the same policy through an egocentric state formulation to enhance scalability. A multiobjective learning approach is applied, resulting in distinct policies that balance the duration of the exploration and cleaning phases, leading to the construction of a Pareto front, which provides a visual representation of trade‐offs between task priorities. The framework adapts to various environmental conditions, demonstrated in both the larger Malaga Port and the smaller Alamillo Lake. The study also highlights the importance of a dedicated exploration phase for larger areas, while minimal exploration is sufficient for smaller spaces. Compared to the decomposition weighting sum strategy, the approach consistently produces superior Pareto‐optimal policies, ensuring broader and more effective exploration of the objective space.
Yuxiang MaiYifan ZangQiyue YinWancheng NiKaiqi Huang
Shang ZhangRui YangZhen ChenMing Li
Vanita JainAditya ChaudhryManas BatraPrakhar Gupta
Rishikesh Venkatrao KadamVishakha VidhaniBhavika ValechaAnushree BaneNupur Giri