• Title/Summary/Keyword: multi-task learning

Search Result 132, Processing Time 0.036 seconds

Learning soccer robot using genetic programming

  • Wang, Xiaoshu;Sugisaka, Masanori
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1999.10a
    • /
    • pp.292-297
    • /
    • 1999
  • Evolving in artificial agent is an extremely difficult problem, but on the other hand, a challenging task. At present the studies mainly centered on single agent learning problem. In our case, we use simulated soccer to investigate multi-agent cooperative learning. Consider the fundamental differences in learning mechanism, existing reinforcement learning algorithms can be roughly classified into two types-that based on evaluation functions and that of searching policy space directly. Genetic Programming developed from Genetic Algorithms is one of the most well known approaches belonging to the latter. In this paper, we give detailed algorithm description as well as data construction that are necessary for learning single agent strategies at first. In following step moreover, we will extend developed methods into multiple robot domains. game. We investigate and contrast two different methods-simple team learning and sub-group loaming and conclude the paper with some experimental results.

  • PDF

Additional Learning Framework for Multipurpose Image Recognition

  • Itani, Michiaki;Iyatomi, Hitoshi;Hagiwara, Masafumi
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.480-483
    • /
    • 2003
  • We propose a new framework that aims at multi-purpose image recognition, a difficult task for the conventional rule-based systems. This framework is farmed based on the idea of computer-based learning algorithm. In this research, we introduce the new functions of an additional learning and a knowledge reconstruction on the Fuzzy Inference Neural Network (FINN) (1) to enable the system to accommodate new objects and enhance the accuracy as necessary. We examine the capability of the proposed framework using two examples. The first one is the capital letter recognition task from UCI machine learning repository to estimate the effectiveness of the framework itself, Even though the whole training data was not given in advance, the proposed framework operated with a small loss of accuracy by introducing functions of the additional learning and the knowledge reconstruction. The other is the scenery image recognition. We confirmed that the proposed framework could recognize images with high accuracy and accommodate new object recursively.

  • PDF

A Reinforcement learning-based for Multi-user Task Offloading and Resource Allocation in MEC

  • Xiang, Tiange;Joe, Inwhee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.45-47
    • /
    • 2022
  • Mobile edge computing (MEC), which enables mobile terminals to offload computational tasks to a server located at the user's edge, is considered an effective way to reduce the heavy computational burden and achieve efficient computational offloading. In this paper, we study a multi-user MEC system in which multiple user devices (UEs) can offload computation to the MEC server via a wireless channel. To solve the resource allocation and task offloading problem, we take the total cost of latency and energy consumption of all UEs as our optimization objective. To minimize the total cost of the considered MEC system, we propose an DRL-based method to solve the resource allocation problem in wireless MEC. Specifically, we propose a Asynchronous Advantage Actor-Critic (A3C)-based scheme. Asynchronous Advantage Actor-Critic (A3C) is applied to this framework and compared with DQN, and Double Q-Learning simulation results show that this scheme significantly reduces the total cost compared to other resource allocation schemes

Explicit Dynamic Coordination Reinforcement Learning Based on Utility

  • Si, Huaiwei;Tan, Guozhen;Yuan, Yifu;peng, Yanfei;Li, Jianping
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.3
    • /
    • pp.792-812
    • /
    • 2022
  • Multi-agent systems often need to achieve the goal of learning more effectively for a task through coordination. Although the introduction of deep learning has addressed the state space problems, multi-agent learning remains infeasible because of the joint action spaces. Large-scale joint action spaces can be sparse according to implicit or explicit coordination structure, which can ensure reasonable coordination action through the coordination structure. In general, the multi-agent system is dynamic, which makes the relations among agents and the coordination structure are dynamic. Therefore, the explicit coordination structure can better represent the coordinative relationship among agents and achieve better coordination between agents. Inspired by the maximization of social group utility, we dynamically construct a factor graph as an explicit coordination structure to express the coordinative relationship according to the utility among agents and estimate the joint action values based on the local utility transfer among factor graphs. We present the application of such techniques in the scenario of multiple intelligent vehicle systems, where state space and action space are a problem and have too many interactions among agents. The results on the multiple intelligent vehicle systems demonstrate the efficiency and effectiveness of our proposed methods.

Direct Learning Control for a Class of Multi-Input Multi-Output Nonlinear Systems (다입력 다출력 비선형시스템에 대한 직접학습제어)

  • 안현식
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.40 no.2
    • /
    • pp.19-25
    • /
    • 2003
  • For a class of multi-input multi-output nonlinear systems which perform a given task repetitively, an extended type of a direct leaning control (DLC) is proposed using the information on the (vector) relative degree of a multi-input multi-output system. Existing DLC methods are observed to be applied to a limited class of systems with the relative degree one and a new DLC law is suggested which can be applied to systems having higher relative degree. Using the proposed control law, the control input corresponding to the new desired output trajectory is synthesized directly based on the control inputs obtained from the learning process for other output trajectories. To show the validity and the performance of the proposed DLC, simulations are performed for trajectory tracking control of a two-axis SCARA robot.

A study on the performance improvement of the quality prediction neural network of injection molded products reflecting the process conditions and quality characteristics of molded products by process step based on multi-tasking learning structure (다중 작업 학습 구조 기반 공정단계별 공정조건 및 성형품의 품질 특성을 반영한 사출성형품 품질 예측 신경망의 성능 개선에 대한 연구)

  • Hyo-Eun Lee;Jun-Han Lee;Jong-Sun Kim;Gu-Young Cho
    • Design & Manufacturing
    • /
    • v.17 no.4
    • /
    • pp.72-78
    • /
    • 2023
  • Injection molding is a process widely used in various industries because of its high production speed and ease of mass production during the plastic manufacturing process, and the product is molded by injecting molten plastic into the mold at high speed and pressure. Since process conditions such as resin and mold temperature mutually affect the process and the quality of the molded product, it is difficult to accurately predict quality through mathematical or statistical methods. Recently, studies to predict the quality of injection molded products by applying artificial neural networks, which are known to be very useful for analyzing nonlinear types of problems, are actively underway. In this study, structural optimization of neural networks was conducted by applying multi-task learning techniques according to the characteristics of the input and output parameters of the artificial neural network. A structure reflecting the characteristics of each process step was applied to the input parameters, and a structure reflecting the quality characteristics of the injection molded part was applied to the output parameters using multi-tasking learning. Building an artificial neural network to predict the three qualities (mass, diameter, height) of injection-molded product under six process conditions (melt temperature, mold temperature, injection speed, packing pressure, pacing time, cooling time) and comparing its performance with the existing neural network, we observed enhancements in prediction accuracy for mass, diameter, and height by approximately 69.38%, 24.87%, and 39.87%, respectively.

Multi-Object Goal Visual Navigation Based on Multimodal Context Fusion (멀티모달 맥락정보 융합에 기초한 다중 물체 목표 시각적 탐색 이동)

  • Jeong Hyun Choi;In Cheol Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.9
    • /
    • pp.407-418
    • /
    • 2023
  • The Multi-Object Goal Visual Navigation(MultiOn) is a visual navigation task in which an agent must visit to multiple object goals in an unknown indoor environment in a given order. Existing models for the MultiOn task suffer from the limitation that they cannot utilize an integrated view of multimodal context because use only a unimodal context map. To overcome this limitation, in this paper, we propose a novel deep neural network-based agent model for MultiOn task. The proposed model, MCFMO, uses a multimodal context map, containing visual appearance features, semantic features of environmental objects, and goal object features. Moreover, the proposed model effectively fuses these three heterogeneous features into a global multimodal context map by using a point-wise convolutional neural network module. Lastly, the proposed model adopts an auxiliary task learning module to predict the observation status, goal direction and the goal distance, which can guide to learn the navigational policy efficiently. Conducting various quantitative and qualitative experiments using the Habitat-Matterport3D simulation environment and scene dataset, we demonstrate the superiority of the proposed model.

The Effect of Worker Heterogeneity in Learning and Forgetting on System Productivity (학습과 망각에 대한 작업자들의 이질성 정도가 시스템 생산성에 미치는 영향)

  • Kim, Sungsu
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.40 no.4
    • /
    • pp.145-156
    • /
    • 2015
  • Incorporation of individual learning and forgetting behaviors within worker-task assignment models produces a mixed integer nonlinear program (MINLP) problem, which is difficult to solve as a NP hard due to its nonlinearity in the objective function. Previous studies commonly assume homogeneity among workers in workforce scheduling that takes account of learning and forgetting characteristics. This paper expands previous researches by considering heterogeneous individual learning/forgetting, and investigates the impact of worker heterogeneity in initial expertise, steady-state productivity, learning and forgetting on system performance to assist manager's decision-making in worker-task assignments without tackling complex MINLP models. In order to understand the performance implications of workforce heterogeneity, this paper examines analytically how heterogeneity in each of the four parameters of the exponential learning and forgetting (L/F) model affects system performance in three cases : consecutive assignments with no break, n breaks of s-length each, and total b break-periods occurred over T periods. The study presents the direction of change in worker performance under different assignment schedules as the variance in initial expertise, steady-state productivity, learning or forgetting increases. Thus, it implies whether having more heterogenous workforce in terms of each of four parameters in the L/F model is desired or not in different schedules from the perspective of system productivity measurement.

Light-weight Gender Classification and Age Estimation based on Ensemble Multi-tasking Deep Learning (앙상블 멀티태스킹 딥러닝 기반 경량 성별 분류 및 나이별 추정)

  • Huy Tran, Quoc Bao;Park, JongHyeon;Chung, SunTae
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.1
    • /
    • pp.39-51
    • /
    • 2022
  • Image-based gender classification and age estimation of human are classic problems in computer vision. Most of researches in this field focus just only one task of either gender classification or age estimation and most of the reported methods for each task focus on accuracy performance and are not computationally light. Thus, running both tasks together simultaneously on low cost mobile or embedded systems with limited cpu processing speed and memory capacity are practically prohibited. In this paper, we propose a novel light-weight gender classification and age estimation method based on ensemble multitasking deep learning with light-weight processing neural network architecture, which processes both gender classification and age estimation simultaneously and in real-time even for embedded systems. Through experiments over various well-known datasets, it is shown that the proposed method performs comparably to the state-of-the-art gender classification and/or age estimation methods with respect to accuracy and runs fast enough (average 14fps) on a Jestson Nano embedded board.

Korean Ironic Expression Detector (한국어 반어 표현 탐지기)

  • Seung Ju Bang;Yo-Han Park;Jee Eun Kim;Kong Joo Lee
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.3
    • /
    • pp.148-155
    • /
    • 2024
  • Despite the increasing importance of irony and sarcasm detection in the field of natural language processing, research on the Korean language is relatively scarce compared to other languages. This study aims to experiment with various models for irony detection in Korean text. The study conducted irony detection experiments using KoBERT, a BERT-based model, and ChatGPT. For KoBERT, two methods of additional training on sentiment data were applied (Transfer Learning and MultiTask Learning). Additionally, for ChatGPT, the Few-Shot Learning technique was applied by increasing the number of example sentences entered as prompts. The results of the experiments showed that the Transfer Learning and MultiTask Learning models, which were trained with additional sentiment data, outperformed the baseline model without additional sentiment data. On the other hand, ChatGPT exhibited significantly lower performance compared to KoBERT, and increasing the number of example sentences did not lead to a noticeable improvement in performance. In conclusion, this study suggests that a model based on KoBERT is more suitable for irony detection than ChatGPT, and it highlights the potential contribution of additional training on sentiment data to improve irony detection performance.