Graph Laplacian Based Transfer Learning Methods in Reinforcement Learning

Yi-Ting Tsao; Ke-Ting Xiao; Von‐Wun Soo; Chung‐Cheng Chiu

doi:10.5772/9654

ScienceGate Book Chapters

BOOK-CHAPTER

Graph Laplacian Based Transfer Learning Methods in Reinforcement Learning

Yi-Ting Tsao Ke-Ting Xiao Von‐Wun Soo Chung‐Cheng Chiu

Year: 2010 InTech eBooks

DOI: 10.5772/9654

Get Full-Text PDF Get Analytical Report

Abstract

The theoretical analysis of the simple transfer method is based on the spectral analysis on graph Laplacian. Low-order basis functions of graph Laplacian tend to represent more features of the value functions and high-order basis functions tend to represent fewer features. If low-order basis functions of two tasks are similar, the simple transfer method performs well. In other words, similar tasks tend to keep similar structures in low-order basis functions so transferring weights from one task to another could acquire a good approximate policy. The experimental results show that if two tasks are similar, the transferred policy of the simple transfer method could be very close to the optimal one. However, even though the simple transfer method seems to be good in the domain transfer cases, it could not be used in the task transfer. Furthermore, it still needs more theoretical analysis as to determine if topological similarity is close enough to apply the simple transfer method that ensures the simple transferred policy to be close to the optimal one. The transfer method could be used in three transfer types: the scaling domain transfer, the topological domain transfer and the task transfer. However, the transfer method is not always better than the simple transfer method. The experimental results show that the transferred policy of the transfer method converges earlier than the random policy. In other words, the evidence demonstrates the accelerated effect of the transfer method. The reason why the transfer method could work in the task transfer is taking rewards into consideration on the modified graph Laplacian. However, how to evaluate the accelerated effect of the transfer method in more objective manner is a challenge because different tasks tend to have different effects. In this chapter, we have proposed the transfer method based on the topology of state transitions for reinforcement learning. It could be used in three transfer types: the scaling domain transfer, the topological domain transfer and the task transfer. Because the transfer method is transferring the state-value function, we need a perfect transition model to obtain the policy. However, to obtain the perfect transition model sometimes is not easy so extending this idea to the action-value function might be an approach to avoid this problem. Because the transfer method only deals with the discrete tasks, mapping continuous tasks to discrete tasks might be an approach to deal with the transfer in continuous tasks.

Keywords:

Toilet Reuse Recall Orange (colour) Computer science Graph Reinforcement learning Phenomenon Artificial intelligence Engineering Theoretical computer science Epistemology Psychology Cognitive psychology Physics Philosophy Environmental engineering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.09

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Evolutionary Algorithms and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Adaptive Dynamic Programming Control

Physical Sciences → Computer Science → Computational Theory and Mathematics

Graph Laplacian Based Transfer Learning Methods in Reinforcement Learning

Abstract

Metrics

Topics

Related Documents

Graph Laplacian based transfer learning in reinforcement learning

GRAND: Graph Laplacian-based Reinforcement Learning for Autonomous Navigation Decisions

Dynamic Graph Learning Based on Graph Laplacian

Transfer between Reinforcement Learning Methods

A Dynamic Financial Knowledge Graph Based on Reinforcement Learning and Transfer Learning