Reinforcement learning Speedup method using Q-value Initialization

;

대한전자공학회:학술대회논문집 (Proceedings of the IEEK Conference)

대한전자공학회 (The Institute of Electronics and Information Engineers)

Q-value Initialization을 이용한 Reinforcement Learning Speedup Method

Reinforcement learning Speedup method using Q-value Initialization

최정환

발행 : 2001.06.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

In reinforcement teaming, Q-learning converges quite slowly to a good policy. Its because searching for the goal state takes very long time in a large stochastic domain. So I propose the speedup method using the Q-value initialization for model-free reinforcement learning. In the speedup method, it learns a naive model of a domain and makes boundaries around the goal state. By using these boundaries, it assigns the initial Q-values to the state-action pairs and does Q-learning with the initial Q-values. The initial Q-values guide the agent to the goal state in the early states of learning, so that Q-teaming updates Q-values efficiently. Therefore it saves exploration time to search for the goal state and has better performance than Q-learning. 1 present Speedup Q-learning algorithm to implement the speedup method. This algorithm is evaluated. in a grid-world domain and compared to Q-teaming.

대한전자공학회:학술대회논문집 (Proceedings of the IEEK Conference)

Q-value Initialization을 이용한 Reinforcement Learning Speedup Method

Reinforcement learning Speedup method using Q-value Initialization

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)