Técnicas de aprendizagem por reforço na resolução do Mundo de Wumpus

RODRIGUES, Rodrigo Moraes

Artigo

Técnicas de aprendizagem por reforço na resolução do Mundo de Wumpus

This work aims to analyze the performance of an agent based on Reinforcement Learning. Your learning engine is based on three algorithms: Qlearning (QL), Deep Q-Network (DQN) and Double Deep Q-Network (DDQN). To validate the agent and its methods, it was defined as environment the World of Wumpus, w...

ver descrição completa

Autor principal:	RODRIGUES, Rodrigo Moraes
Grau:	Artigo
Publicado em:	2023
Assuntos:	Q-learning Deep Q-Network Double Deep Q-Network Mundo de Wumpus CNPQ::ENGENHARIAS CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
Acesso em linha:	https://bdm.ufpa.br:8443/jspui/handle/prefix/5176

Resumo:
This work aims to analyze the performance of an agent based on Reinforcement Learning. Your learning engine is based on three algorithms: Qlearning (QL), Deep Q-Network (DQN) and Double Deep Q-Network (DDQN). To validate the agent and its methods, it was defined as environment the World of Wumpus, which was modeled according to the environment standards adopted by DeepMind Lab. From the experiments performed and their respective configurations, it was observed that the agents managed to reach the main objective only in two configurations of environments. In the 4x4 environment the winning percentage of the QL, DQN algorithms and DDQN were 0.005, 22.96, 18.73% respectively, which drastically reduced specifically for the 10x10 scenario and failing to meet the objective for the other environments.

Técnicas de aprendizagem por reforço na resolução do Mundo de Wumpus

Registros relacionados