IDEAS home Printed from https://ideas.repec.org/a/epw/ejece0/v4y2020i6id19262.html

Research on Tensor-Based Cooperative and Competitive in Multi-Agent Reinforcement Learning

Author

Listed:
  • Tsega Weldu Araya

    (Northwestern Polytechnical University, China)

  • Md Rashed Ibn Nawab

    (Northwestern Polytechnical University, China)

  • A. P. Yuan Ling

    (Huazhong University of Science and Technology, China)

Abstract

As technology overgrows, the assortment of information and the density of work becomes demanding to manage. To resolve the density of employment and human labor, machine-learning (ML) technology developed. Reinforcement learning (RL) is the recent advancement of ML studies. Multi-agent reinforcement learning (MARL) is useful to train multiple agents in the surrounding environment. The previous research studies focused on two-agent cooperation. Their data representation was held in a two-dimensional array, which is called a matrix. The limitation of this two-dimensional array appears as the training data of agents increases. The growth in the training data of agents creates storage drawbacks and data redundancy. Our first aim in this research is to improve an algorithm that can represent MARL training in tensor. In MARL, multiple agents are work together to achieve joint work. To share the training records and data of numerous agents, we need to collect the previous cumulative experience of agents in tensor. Secondly, we will discover the agent's cooperation and competition, with local and global goals of agents in MARL. Local goals are the cooperation of agents in a group or team where we use the training model as a student and teacher agent. The global goal is the competition between two contrary teams to acquire the reward. All learning agents have their Q table for storing the individual agent's training data in an environment. The growth in the number of learning agents, their training experience in Q tables, and the requirement for representing multiple data become the most challenging issue. We introduce tensor to store various data to resolve the challenges for data representation in multiple agent associations. Tensor is expressed as the three-dimensional array, although it is an N-way array, which is useful for representing and accessing numerous data. Finally, we will implement an algorithm for learning three cooperative agents against the opposed team using a tensor-based framework in the Q learning algorithm. We will provide an algorithm that can store the training records and data of multiple agents. Tensor advances to get a small storage size than the matrix for the training records of agents. Although three agent cooperation benefits to having maximum optimal reward.

Suggested Citation

  • Tsega Weldu Araya & Md Rashed Ibn Nawab & A. P. Yuan Ling, 2020. "Research on Tensor-Based Cooperative and Competitive in Multi-Agent Reinforcement Learning," European Journal of Electrical Engineering and Computer Science, European Open Science, vol. 4(6), November.
  • Handle: RePEc:epw:ejece0:v:4:y:2020:i:6:id:19262
    DOI: 10.24018/ejece.2020.4.6.262
    as

    Download full text from publisher

    File URL: https://eu-opensci.org/index.php/ejece/article/view/19262
    File Function: Abstract page
    Download Restriction: no

    File URL: https://eu-opensci.org/index.php/ejece/article/download/19262/11136
    File Function: Full text
    Download Restriction: no

    File URL: https://libkey.io/10.24018/ejece.2020.4.6.262?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:epw:ejece0:v:4:y:2020:i:6:id:19262. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: support (email available below). General contact details of provider: https://eu-opensci.org/index.php/ejece .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.