통합 검색 | Korea Science

강화 학습에 기초한 로봇 축구 에이전트의 설계 및 구현 (Design and implementation of Robot Soccer Agent Based on Reinforcement Learning)

김인철
- 정보처리학회논문지B
- /
- 제9B권2호
- /
- pp.139-146
- /
- 2002
로봇 축구 시뮬레이션 게임은 하나의 동적 다중 에이전트 환경이다. 본 논문에서는 그러한 환경 하에서 각 에이전트의 동적 위치 결정을 위한 새로운 강화학습 방법을 제안한다. 강화학습은 한 에이전트가 환경으로부터 받는 간접적 지연 보상을 기초로 누적 보상값을 최대화할 수 있는 최적의 행동 전략을 학습하는 기계학습 방법이다. 따라서 강화학습은 입력-출력 쌍들이 훈련 예로 직접 제공되지 않는 다는 점에서 교사학습과 크게 다르다. 더욱이 Q-학습과 같은 비-모델 기반의 강화학습 알고리즘들은 주변 환경에 대한 어떤 모델도 학습하거나 미리 정의하는 것을 요구하지 않는다. 그럼에도 불구하고 이 알고리즘들은 에이전트가 모든 상태-행동 쌍들을 충분히 반복 경험할 수 있다면 최적의 행동전략에 수렴할 수 있다. 하지만 단순한 강화학습 방법들의 가장 큰 문제점은 너무 큰 상태 공간 때문에 보다 복잡한 환경들에 그대로 적용하기 어렵다는 것이다. 이런 문제점을 해결하기 위해 본 연구에서는 기존의 모듈화 Q-학습방법(MQL)을 개선한 적응적 중재에 기초한 모듈화 Q-학습 방법(AMMQL)을 제안한다. 종래의 단순한 모듈화 Q-학습 방법에서는 각 학습 모듈들의 결과를 결합하는 방식이 매우 단순하고 고정적이었으나 AMMQL학습 방법에서는 보상에 끼친 각 모듈의 기여도에 따라 모듈들에 서로 다른 가중치를 부여함으로써 보다 유연한 방식으로 각 모듈의 학습결과를 결합한다. 따라서 AMMQL 학습 방법은 큰 상태공간의 문제를 해결할 수 있을 뿐 아니라 동적인 환경변화에 보다 높은 적응성을 제공할 수 있다. 본 논문에서는 로봇 축구 에이전트의 동적 위치 결정을 위한 학습 방법으로 AMMQL 학습 방법을 사용하였고 이를 기초로 Cogitoniks 축구 에이전트 시스템을 구현하였다.
https://doi.org/10.3745/KIPSTB.2002.9B.2.139 인용 PDF KSCI

On a Multi-Agent System for Assisting Human Intention

Tawaki, Hajime;Tan, Joo Kooi;Kim, Hyoung-Seop;Ishikawa, Seiji
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2003년도 ICCAS
- /
- pp.1126-1129
- /
- 2003
In this paper, we propose a multi-agent system for assisting those who need help in taking objects around him/her. One may imagine this kind of situation when a person is lying in bed and wishes to take an object on a distant table that cannot be reached only by stretching his/her hand. The proposed multi-agent system is composed of three main independent agents; a vision agent, a robot agent, and a pass agent. Once a human expresses his/her intention by pointing to a particular object using his/her hand and a finger, these agents cooperatively bring the object to him/her. Natural communication between a human and the multi-agent system is realized in this way. Performance of the proposed system is demonstrated in an experiment, in which a human intends to take one of the four objects on the floor and the three agents successfully cooperate to find out the object and to bring it to the human.
PDF

Dynamic Positioning of Robot Soccer Simulation Game Agents using Reinforcement learning

Kwon, Ki-Duk;Cho, Soo-Sin;Kim, In-Cheol
- 한국지능정보시스템학회:학술대회논문집
- /
- 한국지능정보시스템학회 2001년도 The Pacific Aisan Confrence On Intelligent Systems 2001
- /
- pp.59-64
- /
- 2001
The robot soccer simulation game is a dynamic multi-agent environment. In this paper we suggest a new reinforcement learning approach to each agent's dynamic positioning in such dynamic environment. Reinforcement learning is the machine learning in which an agent learns from indirect, delayed reward an optimal policy to chose sequences of actions that produce the greatest cumulative reward. Therefore the reinforcement learning is different from supervised learning in the sense that there is no presentation of input pairs as training examples. Furthermore, model-free reinforcement learning algorithms like Q-learning do not require defining or learning any models of the surrounding environment. Nevertheless it can learn the optimal policy if the agent can visit every state- action pair infinitely. However, the biggest problem of monolithic reinforcement learning is that its straightforward applications do not successfully scale up to more complex environments due to the intractable large space of states. In order to address this problem. we suggest Adaptive Mediation-based Modular Q-Learning (AMMQL)as an improvement of the existing Modular Q-Learning (MQL). While simple modular Q-learning combines the results from each learning module in a fixed way, AMMQL combines them in a more flexible way by assigning different weight to each module according to its contribution to rewards. Therefore in addition to resolving the problem of large state effectively, AMMQL can show higher adaptability to environmental changes than pure MQL. This paper introduces the concept of AMMQL and presents details of its application into dynamic positioning of robot soccer agents.
PDF

Reinforcement Learning Approach to Agents Dynamic Positioning in Robot Soccer Simulation Games

Kwon, Ki-Duk;Kim, In-Cheol
- 한국시뮬레이션학회:학술대회논문집
- /
- 한국시뮬레이션학회 2001년도 The Seoul International Simulation Conference
- /
- pp.321-324
- /
- 2001
The robot soccer simulation game is a dynamic multi-agent environment. In this paper we suggest a new reinforcement learning approach to each agent's dynamic positioning in such dynamic environment. Reinforcement Beaming is the machine learning in which an agent learns from indirect, delayed reward an optimal policy to choose sequences of actions that produce the greatest cumulative reward. Therefore the reinforcement loaming is different from supervised teaming in the sense that there is no presentation of input-output pairs as training examples. Furthermore, model-free reinforcement loaming algorithms like Q-learning do not require defining or loaming any models of the surrounding environment. Nevertheless it can learn the optimal policy if the agent can visit every state-action pair infinitely. However, the biggest problem of monolithic reinforcement learning is that its straightforward applications do not successfully scale up to more complex environments due to the intractable large space of states. In order to address this problem, we suggest Adaptive Mediation-based Modular Q-Learning(AMMQL) as an improvement of the existing Modular Q-Learning(MQL). While simple modular Q-learning combines the results from each learning module in a fixed way, AMMQL combines them in a more flexible way by assigning different weight to each module according to its contribution to rewards. Therefore in addition to resolving the problem of large state space effectively, AMMQL can show higher adaptability to environmental changes than pure MQL. This paper introduces the concept of AMMQL and presents details of its application into dynamic positioning of robot soccer agents.
PDF

다개체 협력 시스템을 위한 비젼 기반 축구 로봇 시스템의 개발 (Development of vision-based soccer robots for multi-agent cooperative systems)

심현식;정명진;최인환;김종환
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1997년도 한국자동제어학술회의논문집; 한국전력공사 서울연수원; 17-18 Oct. 1997
- /
- pp.608-611
- /
- 1997
The soccer robot system consists of multi agents, with highly coordinated operation and movements so as to fulfill specific objectives, even under adverse situation. The coordination of the multi-agents is associated with a lot of supplementary work in advance. The associated issues are the position correction, prevention of communication congestion, local information sensing in addition to the need for imitating the human-like decision making. A control structure for soccer robot is designed and several behaviors and actions for a soccer robot are proposed. Variable zone defense as a basic strategy and several special strategies for fouls are applied to SOTY2 team.
PDF

URL 분석을 위한 웹 로봇 구현 및 성능분석 (Implementation and Performance Analysis of Web Robot for URL Analysis)

김원;김희철;진용옥
- 한국통신학회논문지
- /
- 제27권3C호
- /
- pp.226-233
- /
- 2002
This paper proposes the web robot based on Multi-Agent which the mutual dependency should be minimized each other with dividing the function each to collect Webpage. In result it is written to make a foundation for producing the effective statistics to analyze the domestic webpages and text, multimedia file composition ratio through performance analysis of the implemented system. It is easy that Web robot of the sequential processing method to collect Webpage on the same resource environment produces the limit of collecting performance. So to speak Webpages have "Dead-links" URL which is produced by the temporary host down and instability of network resource. If there are much "Dead-links" URL in the webpages, it takes a lot of time for web robot to collect HTML. The propose of this paper to be proposed, makes the maximum improvement to extract the webpages to process "Dead-links" URL on the Inactive URL scanner Agent.
PDF KSCI

Cognitive and Emotional Structure of a Robotic Game Player in Turn-based Interaction

Yang, Jeong-Yean
- International journal of advanced smart convergence
- /
- 제4권2호
- /
- pp.154-162
- /
- 2015
This paper focuses on how cognitive and emotional structures affect humans during long-term interaction. We design an interaction with a turn-based game, the Chopstick Game, in which two agents play with numbers using their fingers. While a human and a robot agent alternate turn, the human user applies herself to play the game and to learn new winning skills from the robot agent. Conventional valence and arousal space is applied to design emotional interaction. For the robotic system, we implement finger gesture recognition and emotional behaviors that are designed for three-dimensional virtual robot. In the experimental tests, the properness of the proposed schemes is verified and the effect of the emotional interaction is discussed.
https://doi.org/10.7236/IJASC.2015.4.2.154 인용 PDF KSCI

다 개체 시스템의 협동 행동제어기 (Cooperative Action Controller of Multi-Agent System)

김용백;장홍민;김대준;최영규;김성신
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 1999년도 하계학술대회 논문집 G
- /
- pp.3024-3026
- /
- 1999
This paper presents a cooperative action controller of a multi-agent system. To achieve an object, i.e. win a game, it is necessary that a robot has its own roles, actions and work with each other. The presented incorporated action controller consists of the role selection, action selection and execution layer. In the first layer, a fuzzy logic controller is used. Each robot selects its own action and makes its own path trajectory in the second layer. In the third layer, each robot performs their own action based on the velocity information which is sent from main computer. Finally, simulation shows that each robot selects proper roles and incorporates actions by the proposed controller.
PDF

지능로봇에서 에이전트와 ESB를 사용한 서비스 지향 애플리케이션의 자가 재구성 (Self-Reconfiguration of Service-Oriented Application using Agent and ESB in Intelligent Robot)

이재정;김진한;이창호;이병정
- 한국정보과학회논문지:컴퓨팅의 실제 및 레터
- /
- 제14권8호
- /
- pp.813-817
- /
- 2008
지능로봇(Intelligent Robot)은 주변환경을 감지하는 센서로부터 실시간 정보를 수집하고 지능적인 기능을 수행한다. 지능로봇의 자가 재구성(Self-Reconfiguration) 능력은 외부 환경의 변화에 대응하기 위해 기능을 재구성하고, 오류가 발생하였을 때 중지 없이 스스로 회복할 수 있는 중요한 요소이다. 본 논문에서는 ESB(Enterprise Service Bus)를 사용한 지능로봇의 에이전트 기반 자가 재구성 프레임워크를 제안한다. 본 논문의 프궤임워크는 멀티에이전트 시스템을 이용한 서비스 지향 애플리케이션의 동적인 발견과 자가 재구성에 초점을 맞춘다. 지능로봇이 예외적인 상황을 만났을 때, 지능로봇은 외부의 서비스 저장소로부터 새로운 서비스를 다운로드 후 실행시켜 상황을 해결한다. 에이전트 기술은 로봇들이 상호작용하기 위한 지능적인 접근법을 제공하고, ESB는 분산된 서비스 또는 지식을 활용하고 조직하기 위한 방법을 제공한다. 또한 본 연구의 유효성을 보여주기 위해 프로토타입을 구현하였다.
PDF KSCI

축구 로봇을 위한 제어기 설계 (The design of controllers for soccer robots)

김광춘;김동한;김종환
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1997년도 한국자동제어학술회의논문집; 한국전력공사 서울연수원; 17-18 Oct. 1997
- /
- pp.612-616
- /
- 1997
In this paper, two kinds of controller are proposed for a soccer robot system.. One for Supervisor and defense mode, and the other for attack mode. Robot soccer game has very dynamic characteristics. Furthermore, there exist competitions between agents. The soccer-playing robot should take an appropriate action according to its surroundings. Initially, an attack mode controller using a vector field concept is designed, then a supervisor and a defense mode controller are designed with a Petri-net. The efficiency and applicability of the proposed controllers are demonstrated through a real robot soccer game(MiroSot 97).
PDF

검색결과 147건 처리시간 0.029초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)