통합 검색 | Korea Science

Self-Imitation Learning을 이용한 개선된 Deep Q-Network 알고리즘 (Improved Deep Q-Network Algorithm Using Self-Imitation Learning)

선우영민;이원창
- 전기전자학회논문지
- /
- 제25권4호
- /
- pp.644-649
- /
- 2021
Self-Imitation Learning은 간단한 비활성 정책 actor-critic 알고리즘으로써 에이전트가 과거의 좋은 경험을 활용하여 최적의 정책을 찾을 수 있도록 해준다. 그리고 actor-critic 구조를 갖는 강화학습 알고리즘에 결합되어 다양한 환경들에서 알고리즘의 상당한 개선을 보여주었다. 하지만 Self-Imitation Learning이 강화학습에 큰 도움을 준다고 하더라도 그 적용 분야는 actor-critic architecture를 가지는 강화학습 알고리즘으로 제한되어 있다. 본 논문에서 Self-Imitation Learning의 알고리즘을 가치 기반 강화학습 알고리즘인 DQN에 적용하는 방법을 제안하고, Self-Imitation Learning이 적용된 DQN 알고리즘의 학습을 다양한 환경에서 진행한다. 아울러 그 결과를 기존의 결과와 비교함으로써 Self-Imitation Leaning이 DQN에도 적용될 수 있으며 DQN의 성능을 개선할 수 있음을 보인다.
https://doi.org/10.7471/ikeee.2021.25.4.644 인용 PDF KSCI

Self-Organizing Network에서 기계학습 연구동향-II (Research Status on Machine Learning for Self-Organizing Network-II)

권동승;나지현
- 전자통신동향분석
- /
- 제35권4호
- /
- pp.115-134
- /
- 2020
Several studies on machine learning (ML) based self-organizing networks (SONs) have been conducted, specifically for LTE, since studies to apply ML to optimize mobile communication systems started with 2G. However, they are still in the infancy stage. Owing to the complicated KPIs and stringent user requirements of 5G, it is necessary to design the 5G SON engine with intelligence to enable users to seamlessly and unlimitedly achieve connectivity regardless of the state of the mobile communication network. Therefore, in this study, we analyze and summarize the current state of machine learning studies applied to SONs as solutions to the complicated optimization problems that are caused by the unpredictable context of mobile communication scenarios.
https://doi.org/10.22648/ETRI.2020.J.350411 인용 PDF

Intelligent Agent System by Self Organizing Neural Network

Cho, Young-Im
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2005년도 ICCAS
- /
- pp.1468-1473
- /
- 2005
In this paper, I proposed the INTelligent Agent System by Kohonen's Self Organizing Neural Network (INTAS). INTAS creates each user's profile from the information. Based on it, learning community grouping suitable to each individual is automatically executed by using unsupervised learning algorithm. In INTAS, grouping and learning are automatically performed on real time by multiagents, regardless of the number of learners. A new framework has been proposed to generate multiagents, and it is a feature that efficient multiagents can be executed by proposing a new negotiation mode between multiagents..
PDF

문제중심학습방법이 대학생들의 학습자 상호작용 및 자기주도학습능력에 미치는 영향: 사회연결망 분석을 중심으로 (The Effect of Problem Based Learning on Nursing Students' Interaction and Self-directed Learning: A Social Network Analysis)

박미화;김정은
- Perspectives in Nursing Science
- /
- 제13권1호
- /
- pp.29-35
- /
- 2016
Purpose: This study aimed to explore the underlying structures of students' interaction networks to monitor network changes during the year, to verify the relationship with self-directed learning, and to identify the effect of problem-based learning on interaction and self-directed learning. Methods: A longitudinal study was designed which included 3 parts (A=25, B=27, C=26) with a total of 78 second-year nursing students from 2013 to 2014. Interaction indicators used group network centralization and density, and individual in-degree centrality. Results: Group network centralization showed mean reversion patterns, however, centralization and density showed a slight increase from 2013 to 2014 (Centralization of A part from 52.78 to 36.96, B part from 20.56 to 32.20, C part from 34.40 to 37.24; Density of A part from 0.122 to 0.123, B part from 0.111 to 0.121, C part from 0.109 to 0.121). The individual in-degree centrality is significantly correlated with self-directed learning and the correlation coefficient increased during the year (r=.274 in 2013, r=.356 in 2014, p<.001). Conclusion: Students share information more interactively during the year and the more they share the higher the scores of self-directed learning.
https://doi.org/10.16952/pns.2016.13.1.29 인용 PDF

시불변 학습계수와 이진 강화 함수를 가진 자기 조직화 형상지도 신경회로망의 동적특성 (The dynamics of self-organizing feature map with constant learning rate and binary reinforcement function)

석진욱;조성원
- 제어로봇시스템학회논문지
- /
- 제2권2호
- /
- pp.108-114
- /
- 1996
We present proofs of the stability and convergence of Self-organizing feature map (SOFM) neural network with time-invarient learning rate and binary reinforcement function. One of the major problems in Self-organizing feature map neural network concerns with learning rate-"Kalman Filter" gain in stochsatic control field which is monotone decreasing function and converges to 0 for satisfying minimum variance property. In this paper, we show that the stability and convergence of Self-organizing feature map neural network with time-invariant learning rate. The analysis of the proposed algorithm shows that the stability and convergence is guranteed with exponentially stable and weak convergence properties as well.s as well.
PDF

오차 자기 순환 신경회로망을 이용한 현가시스템 인식과 슬라이딩 모드 제어기 개발 (Identification of suspension systems using error self recurrent neural network and development of sliding mode controller)

송광현;이창구;김성중
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1997년도 한국자동제어학술회의논문집; 한국전력공사 서울연수원; 17-18 Oct. 1997
- /
- pp.625-628
- /
- 1997
In this paper the new neural network and sliding mode suspension controller is proposed. That neural network is error self-recurrent neural network. For fast on-line learning, this paper use recursive least squares method. A new neural networks converges considerably faster than the backpropagation algorithm and has advantages of being less affected by the poor initial weights and learning rate. The controller for suspension systems is designed according to sliding mode technique based on new proposed neural network.
PDF

신경회로망을 이용한 도립전자의 학습제어 (Learning Control of Inverted Pendulum Using Neural Networks)

이재강;김일환
- 산업기술연구
- /
- 제24권A호
- /
- pp.99-107
- /
- 2004
This paper considers reinforcement learning control with the self-organizing map. Reinforcement learning uses the observable states of objective system and signals from interaction of the system and the environments as input data. For fast learning in neural network training, it is necessary to reduce learning data. In this paper, we use the self-organizing map to parition the observable states. Partitioning states reduces the number of learning data which is used for training neural networks. And neural dynamic programming design method is used for the controller. For evaluating the designed reinforcement learning controller, an inverted pendulum of the cart system is simulated. The designed controller is composed of serial connection of self-organizing map and two Multi-layer Feed-Forward Neural Networks.
PDF

자기 분열 및 구조화 신경 회로망 (A self creating and organizing neural network)

최두일;박상희
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1991년도 한국자동제어학술회의논문집(국내학술편); KOEX, Seoul; 22-24 Oct. 1991
- /
- pp.768-772
- /
- 1991
The Self Creating and organizing (SCO) is a new architecture and one of the unsupervized learning algorithm for the artificial neural network. SCO begins with only one output node which has a sufficiently wide response range, and the response ranges of all the nodes decrease with time. Self Creating and Organizing Neural Network (SCONN) decides automatically whether adapting the weights of existing node or creating a new node. It is compared to the Kohonen's Self Organizing Feature Map (SOFM). The results show that SCONN has lots of advantages over other competitive learning architecture.
PDF

Stable Predictive Control of Chaotic Systems Using Self-Recurrent Wavelet Neural Network

Yoo Sung Jin;Park Jin Bae;Choi Yoon Ho
- International Journal of Control, Automation, and Systems
- /
- 제3권1호
- /
- pp.43-55
- /
- 2005
In this paper, a predictive control method using self-recurrent wavelet neural network (SRWNN) is proposed for chaotic systems. Since the SRWNN has a self-recurrent mother wavelet layer, it can well attract the complex nonlinear system though the SRWNN has less mother wavelet nodes than the wavelet neural network (WNN). Thus, the SRWNN is used as a model predictor for predicting the dynamic property of chaotic systems. The gradient descent method with the adaptive learning rates is applied to train the parameters of the SRWNN based predictor and controller. The adaptive learning rates are derived from the discrete Lyapunov stability theorem, which are used to guarantee the convergence of the predictive controller. Finally, the chaotic systems are provided to demonstrate the effectiveness of the proposed control strategy.
PDF KSCI

코호넨의 자기조직화 구조를 이용한 클러스터링 망에 관한 연구 (On the Clustering Networks using the Kohonen's Elf-Organization Architecture)

이지영
- 정보학연구
- /
- 제8권1호
- /
- pp.119-124
- /
- 2005
Learning procedure in the neural network is updating of weights between neurons. Unadequate initial learning coefficient causes excessive iterations of learning process or incorrect learning results and degrades learning efficiency. In this paper, adaptive learning algorithm is proposed to increase the efficient in the learning algorithms of Kohonens Self-Organization Neural networks. The algorithm updates the weights adaptively when learning procedure runs. To prove the efficiency the algorithm is experimented to clustering of the random weight. The result shows improved learning rate about 42~55% ; less iteration counts with correct answer.
PDF

검색결과 416건 처리시간 0.033초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)