A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding task specialist agents depending on their prior performance while anticipating performance of newly entering agents. By basing mediator behavior on a novel multicriteria-driven (including quality of service, deadline, reputation, cost, and user preferences) reinforcement learning algorithm, integrating the exploitation of acquired knowledge with optimal, undirected, continual exploration, adaptability to changes in agent availability and performance is ensured. The reported experiments indicate the algorithm behaves as expected and outperforms two standard approaches.
Ivan JuretaStéphane FaulknerYoussef AchbanyMarco Saerens
Qian LinZhu MeiJun YuGuangxin ZhuMei FengWenda LuLin WangHengmao PangMingjie XuHaiyang Chen
Thomas WeishäuplErich Schikuta
ChakHuah TanZhao YizhiMing LuoSomchaya LiemhetcharatJingbing ZhangMingMao WongJun-Hong Zhou