Annak érdekében, hogy Önnek a legjobb élményt nyújtsuk "sütiket" használunk honlapunkon. Az oldal használatával Ön beleegyezik a "sütik" használatába.

Applying Expert Heuristic as an a Priori Knowledge for FRIQ-Learning

  •  Minősített cikkek
  • 2023-02-02 11:15:00
Many Reinforcement Learning methods start the learning phase from an empty, or randomly filled knowledge-base. Having some a priori knowledge about the way as the studied system could be controlled, e.g. in the form of some state-action control rules, the convergence speed of the learning process can be significantly improved. In this case, the learning stage could start from a sketch, from a knowledge-base formed based upon the already existing knowledge. In this paper. the a priori (expert) knowledge is considered to be given in the form state-action fuzzy control rules of a Fuzzy Rule Interpolation (FRI) reasoning model and the studied reinforcement learning method is restricted to be a Fuzzy Rule Interpolation-based Q-Learning (FRIQ-Learning) method. The main goal of this paper is the introduction of a methodology, which is suitable for merging the a priori stateaction fuzzy control rule-base to the initial state-action-value function (Q-function) representation. For demonstrating the benefits of the suggested methodology, the a priori knowledge-base accelerated FRIQ-Learning solution of the “mountain car” benchmark is also discussed briefly in the paper

A teljes cikk innen tölthető le.

 

 

Hivatkozás

MLA: Tompa, Tamás, and Szilveszter Kovács. "Applying expert heuristic as an a priori knowledge for FRIQ-learning." Acta Polytechnica Hungarica 17.4 (2020): 27-45.

APA:  Tompa, T., & Kovács, S. (2020). Applying expert heuristic as an a priori knowledge for FRIQ-learning. Acta Polytechnica Hungarica17(4), 27-45.

ISO690: TOMPA, Tamás; KOVÁCS, Szilveszter. Applying expert heuristic as an a priori knowledge for FRIQ-learning. Acta Polytechnica Hungarica, 2020, 17.4: 27-45.

BibTeX:

@article{tompa2020applying,
  title={Applying expert heuristic as an a priori knowledge for FRIQ-learning},
  author={Tompa, Tam{'a}s and Kov{'a}cs, Szilveszter},
  journal={Acta Polytechnica Hungarica},
  volume={17},
  number={4},
  pages={27--45},
  year={2020}
}

 

 

 

Megosztás