Locally Centralized Execution for Less Redundant Computation in Multi-Agent Cooperation

Bibliographic Details
Title: Locally Centralized Execution for Less Redundant Computation in Multi-Agent Cooperation
Authors: Yidong Bai, Toshiharu Sugawara
Source: Information, Vol 15, Iss 5, p 279 (2024)
Publisher Information: MDPI AG, 2024.
Publication Year: 2024
Collection: LCC:Information technology
Subject Terms: cooperation, multi-agent deep reinforcement learning, redundant computation, Information technology, T58.5-58.64
More Details: Decentralized execution is a widely used framework in multi-agent reinforcement learning. However, it has a well-known but neglected shortcoming, redundant computation, that is, the same/similar computation is performed redundantly in different agents owing to their overlapping observations. This study proposes a novel method, the locally centralized team transformer (LCTT), to address this problem. This method first proposes a locally centralized execution framework that autonomously determines some agents as leaders that generate instructions and other agents as workers to act according to the received instructions without running their policy networks. For the LCTT, we subsequently propose the team-transformer (T-Trans) structure, which enables leaders to generate targeted instructions for each worker, and the leadership shift, which enables agents to determine those that should instruct or be instructed by others. The experimental results demonstrated that the proposed method significantly reduces redundant computations without decreasing rewards and achieves faster learning convergence.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2078-2489
Relation: https://www.mdpi.com/2078-2489/15/5/279; https://doaj.org/toc/2078-2489
DOI: 10.3390/info15050279
Access URL: https://doaj.org/article/4d99cc3690504944808f13cc531c5018
Accession Number: edsdoj.4d99cc3690504944808f13cc531c5018
Database: Directory of Open Access Journals
More Details
ISSN:20782489
DOI:10.3390/info15050279
Published in:Information
Language:English