• Title/Summary/Keyword: Action based learning

Search Result 374, Processing Time 0.032 seconds

Strategy of Object Search for Distributed Autonomous Robotic Systems

  • Kim Ho-Duck;Yoon Han-Ul;Sim Kwee-Bo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.3
    • /
    • pp.264-269
    • /
    • 2006
  • This paper presents the strategy for searching a hidden object in an unknown area for using by multiple distributed autonomous robotic systems (DARS). To search the target in Markovian space, DARS should recognize th ε ir surrounding at where they are located and generate some rules to act upon by themselves. First of all, DARS obtain 6-distances from itself to environment by infrared sensor which are hexagonally allocated around itself. Second, it calculates 6-areas with those distances then take an action, i.e., turn and move toward where the widest space will be guaranteed. After the action is taken, the value of Q will be updated by relative formula at the state. We set up an experimental environment with five small mobile robots, obstacles, and a target object, and tried to research for a target object while navigating in a un known hallway where some obstacles were placed. In the end of this paper, we present the results of three algorithms - a random search, an area-based action making process to determine the next action of the robot and hexagon-based Q-learning to enhance the area-based action making process.

A Study on the Development of a Training Program to Reinforce the Teachers' Performance as Facilitators (교원의 퍼실리테이터 수행지원 강화를 위한 연수 프로그램 개발 연구)

  • Jung, Ju-Young;Hong, Kwang-Pyo
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.22 no.3
    • /
    • pp.431-444
    • /
    • 2010
  • This research aims at developing a teachers' training program to reinforce teachers' capability to perform the action learning program. To accomplish this goal, the key value of the training program based on action learning, the process of the core learning activities, and the elements to support learners and facilitators respectively were deducted on the foundation of documentary research and case study, based on which, the program was developed through the formative test by professionals and application to the field. This research was applied to 105 middle or high school teachers, the participants of the in-service training on creative problem solving hosted by B metropolitan city for one week (30 hours) from 9 a.m. on Monday, January 25th, 2010 to 4 p.m. on Friday, January 29th. The result of this research is as follows. First, as for the key values of this study, (1) the team-based learning centered on the trainees, not lecturers-oriented, knowledge-transmitting training, is possible, (2)for each process, guidelines, related information, tools, and various kinds of media are supported just in time, and (3)a focus is given on fostering facilitators centered on teachers. Second, the process of the core learning activities of the teachers' training program based on action learning consists of the procedure of a prior lecture${\rightarrow}$break${\rightarrow}$investigation into problems${\rightarrow}$clarification of problems${\rightarrow}$drawing possible solutions${\rightarrow}$decision on the priority${\rightarrow}$making an action plan${\rightarrow}$performance${\rightarrow}$evaluation, and on each stage, the contents for the activities of teachers and learners and detailed supportive elements are offered.

Motivation based Behavior Sequence Learning for an Autonomous Agent in Virtual Reality

  • Song, Wei;Cho, Kyung-Eun;Um, Ky-Hyun
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.12
    • /
    • pp.1819-1826
    • /
    • 2009
  • To enhance the automatic performance of existing predicting and planning algorithms that require a predefined probability of the states' transition, this paper proposes a multiple sequence generation system. When interacting with unknown environments, a virtual agent needs to decide which action or action order can result in a good state and determine the transition probability based on the current state and the action taken. We describe a sequential behavior generation method motivated from the change in the agent's state in order to help the virtual agent learn how to adapt to unknown environments. In a sequence learning process, the sensed states are grouped by a set of proposed motivation filters in order to reduce the learning computation of the large state space. In order to accomplish a goal with a high payoff, the learning agent makes a decision based on the observation of states' transitions. The proposed multiple sequence behaviors generation system increases the complexity and heightens the automatic planning of the virtual agent for interacting with the dynamic unknown environment. This model was tested in a virtual library to elucidate the process of the system.

  • PDF

Multi-Agent Reinforcement Learning Model based on Fuzzy Inference (퍼지 추론 기반의 멀티에이전트 강화학습 모델)

  • Lee, Bong-Keun;Chung, Jae-Du;Ryu, Keun-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.10
    • /
    • pp.51-58
    • /
    • 2009
  • Reinforcement learning is a sub area of machine learning concerned with how an agent ought to take actions in an environment so as to maximize some notion of long-term reward. In the case of multi-agent, especially, which state space and action space gets very enormous in compared to single agent, so it needs to take most effective measure available select the action strategy for effective reinforcement learning. This paper proposes a multi-agent reinforcement learning model based on fuzzy inference system in order to improve learning collect speed and select an effective action in multi-agent. This paper verifies an effective action select strategy through evaluation tests based on Robocup Keepaway which is one of useful test-beds for multi-agent. Our proposed model can apply to evaluate efficiency of the various intelligent multi-agents and also can apply to strategy and tactics of robot soccer system.

Comparison of learning performance of character controller based on deep reinforcement learning according to state representation (상태 표현 방식에 따른 심층 강화 학습 기반 캐릭터 제어기의 학습 성능 비교)

  • Sohn, Chaejun;Kwon, Taesoo;Lee, Yoonsang
    • Journal of the Korea Computer Graphics Society
    • /
    • v.27 no.5
    • /
    • pp.55-61
    • /
    • 2021
  • The character motion control based on physics simulation using reinforcement learning continue to being carried out. In order to solve a problem using reinforcement learning, the network structure, hyperparameter, state, action and reward must be properly set according to the problem. In many studies, various combinations of states, action and rewards have been defined and successfully applied to problems. Since there are various combinations in defining state, action and reward, many studies are conducted to analyze the effect of each element to find the optimal combination that improves learning performance. In this work, we analyzed the effect on reinforcement learning performance according to the state representation, which has not been so far. First we defined three coordinate systems: root attached frame, root aligned frame, and projected aligned frame. and then we analyze the effect of state representation by three coordinate systems on reinforcement learning. Second, we analyzed how it affects learning performance when various combinations of joint positions and angles for state.

Aspect-based Sentiment Analysis of Product Reviews using Multi-agent Deep Reinforcement Learning

  • M. Sivakumar;Srinivasulu Reddy Uyyala
    • Asia pacific journal of information systems
    • /
    • v.32 no.2
    • /
    • pp.226-248
    • /
    • 2022
  • The existing model for sentiment analysis of product reviews learned from past data and new data was labeled based on training. But new data was never used by the existing system for making a decision. The proposed Aspect-based multi-agent Deep Reinforcement learning Sentiment Analysis (ADRSA) model learned from its very first data without the help of any training dataset and labeled a sentence with aspect category and sentiment polarity. It keeps on learning from the new data and updates its knowledge for improving its intelligence. The decision of the proposed system changed over time based on the new data. So, the accuracy of the sentiment analysis using deep reinforcement learning was improved over supervised learning and unsupervised learning methods. Hence, the sentiments of premium customers on a particular site can be explored to other customers effectively. A dynamic environment with a strong knowledge base can help the system to remember the sentences and usage State Action Reward State Action (SARSA) algorithm with Bidirectional Encoder Representations from Transformers (BERT) model improved the performance of the proposed system in terms of accuracy when compared to the state of art methods.

Human Action Recognition Using Pyramid Histograms of Oriented Gradients and Collaborative Multi-task Learning

  • Gao, Zan;Zhang, Hua;Liu, An-An;Xue, Yan-Bing;Xu, Guang-Ping
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.2
    • /
    • pp.483-503
    • /
    • 2014
  • In this paper, human action recognition using pyramid histograms of oriented gradients and collaborative multi-task learning is proposed. First, we accumulate global activities and construct motion history image (MHI) for both RGB and depth channels respectively to encode the dynamics of one action in different modalities, and then different action descriptors are extracted from depth and RGB MHI to represent global textual and structural characteristics of these actions. Specially, average value in hierarchical block, GIST and pyramid histograms of oriented gradients descriptors are employed to represent human motion. To demonstrate the superiority of the proposed method, we evaluate them by KNN, SVM with linear and RBF kernels, SRC and CRC models on DHA dataset, the well-known dataset for human action recognition. Large scale experimental results show our descriptors are robust, stable and efficient, and outperform the state-of-the-art methods. In addition, we investigate the performance of our descriptors further by combining these descriptors on DHA dataset, and observe that the performances of combined descriptors are much better than just using only sole descriptor. With multimodal features, we also propose a collaborative multi-task learning method for model learning and inference based on transfer learning theory. The main contributions lie in four aspects: 1) the proposed encoding the scheme can filter the stationary part of human body and reduce noise interference; 2) different kind of features and models are assessed, and the neighbor gradients information and pyramid layers are very helpful for representing these actions; 3) The proposed model can fuse the features from different modalities regardless of the sensor types, the ranges of the value, and the dimensions of different features; 4) The latent common knowledge among different modalities can be discovered by transfer learning to boost the performance.

Behavior-based Learning Controller for Mobile Robot using Topological Map (Topolgical Map을 이용한 이동로봇의 행위기반 학습제어기)

  • Yi, Seok-Joo;Moon, Jung-Hyun;Han, Shin;Cho, Young-Jo;Kim, Kwang-Bae
    • Proceedings of the KIEE Conference
    • /
    • 2000.07d
    • /
    • pp.2834-2836
    • /
    • 2000
  • This paper introduces the behavior-based learning controller for mobile robot using topological map. When the mobile robot navigates to the goal position, it utilizes given information of topological map and its location. Under navigating in unknown environment, the robot classifies its situation using ultrasonic sensor data, and calculates each motor schema multiplied by respective gain for all behaviors, and then takes an action according to the vector sum of all the motor schemas. After an action, the information of the robot's location in given topological map is incorporated to the learning module to adapt the weights of the neural network for gain learning. As a result of simulation, the robot navigates to the goal position successfully after iterative gain learning with topological information.

  • PDF

Design of Autonomous Mobile Robot System Based on Artificial Immune Network and Internet (인공 면역망과 인터넷에 의한 자율이동로봇 시스템 설계)

  • Lee, Dong-Je;Lee, Min-Jung;Choi, Young-Kiu
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.50 no.11
    • /
    • pp.522-531
    • /
    • 2001
  • Recently conventional artificial intelligence(AI) approaches have been employed to build action selectors for the autonomous mobile robot(AMR). However, in these approaches, the decision making process to choose an action from multiple competence modules is still an open question. Many researches have been focused on the reactive planning systems such as the biological immune system. In this paper, we attempt to construct an action selector for an AMR based on the artificial immune network and internet. The information from vision sensors is used for antibody. We propose a learning method for artificial immune network using evolutionary algorithm to produce antibody automatically. The internet environment for an AMR action selector shows the usefulness of the proposed learning artificial immune network application.

  • PDF

A Study on Intention to Solve the Problem via the Prior Experience of Creativity Programs based on the Action Learning Methodology (액션러닝방법론에 기반한 창의성프로그램 참여경험에 따른 문제해결의도의 영향에 관한 연구)

  • Kim, Soung-Hyun;Park, JaeSung
    • Journal of Digital Convergence
    • /
    • v.19 no.6
    • /
    • pp.73-83
    • /
    • 2021
  • This study identified the factors that influence the problem solving intention of students who completed the action learning-based creativity programs based on the theory of reasoned action, and examined the influence relationship between these factors. As a result of the study, first, subjective norms of problem solving had a positive effect on problem solving intention, and second, attitude toward problem solving had a significant effect on problem solving intention. Third, it was found that students who had experience of participating in a patent education program applying the action learning methodology had a positive effect on the relationship between their attitude toward problem solving and their intention to solve the problem. These findings confirmed that in order to enhance the problem-solving intention, which is the basis for creativity of university students, it is important to spread the positive value of problem-solving within the university and to give students a positive attitude toward problem solving.