통합 검색 | Korea Science

Visual Analysis of Deep Q-network

Seng, Dewen;Zhang, Jiaming;Shi, Xiaoying
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제15권3호
- /
- pp.853-873
- /
- 2021
In recent years, deep reinforcement learning (DRL) models are enjoying great interest as their success in a variety of challenging tasks. Deep Q-Network (DQN) is a widely used deep reinforcement learning model, which trains an intelligent agent that executes optimal actions while interacting with an environment. This model is well known for its ability to surpass skilled human players across many Atari 2600 games. Although DQN has achieved excellent performance in practice, there lacks a clear understanding of why the model works. In this paper, we present a visual analytics system for understanding deep Q-network in a non-blind matter. Based on the stored data generated from the training and testing process, four coordinated views are designed to expose the internal execution mechanism of DQN from different perspectives. We report the system performance and demonstrate its effectiveness through two case studies. By using our system, users can learn the relationship between states and Q-values, the function of convolutional layers, the strategies learned by DQN and the rationality of decisions made by the agent.
https://doi.org/10.3837/tiis.2021.03.003 인용 PDF KSCI HTML

칼만 필터를 이용한 도시철도 열차 정위치 정차에 관한 연구 (A Study on the Accurate Stopping Control of a Train for the Urban Rail Transit Using Kalman Filter)

김정태;이재호;김무선;박철홍
- 한국산학기술학회논문지
- /
- 제17권11호
- /
- pp.655-662
- /
- 2016
역사에 스크린도어의 설치가 보편화 되면서 도시철도에서 열차가 정차지점에 정확히정차하도록 하는 기술의 중요성이 더욱 커지게 되었다. 이를 위한 기술로써 이론적으로는 여러 가지 제어 방안이 제안되었으나 실제 구현 상의 어려움으로 인하여 국내 열차에서는 대부분 PID 제어와 같은 고전 제어기법을 사용한다. PID 제어는 비교적 간단하게 정확한 제어를 수행할 수 있다는 장점이 있다. 그러나 열차 시스템에서는 제어 주기가 비교적 길고 제동기의 지연 시간이 있으며 위치 및 속도 검지 과정에서의 오차 등으로 인하여 PID 제어로는 만족할 만한 결과를 얻지못하는 경우가 있으며 또한 적절한 계수를 찾는데 많은 시간이 필요하다. 본 논문에서는 상태나 신호 추정에 사용되는 칼만 필터를 활용하여 제어를 수행하는 방안을 제안한다. 칼만 필터를 통해 제동기가 동작할 시점의 상태를 예측하고 이에 맞는 제동 입력을 구하여 제어를 수행한다. 제동기 특성 및 제어 주기를 반영하여 열차 모델링과 정위치 정차제어 시뮬레이션을 수행하였으며 정차 오차 분석을 통해 제안하는 방안의 우수성을 보인다.
https://doi.org/10.5762/KAIS.2016.17.11.655 인용 PDF KSCI

The Importance of the University Education System for Trade Workforce: the Person-Organization Fit Perspective

Kim, So Yeon;Moretti, Raul
- 무역상무연구
- /
- 제56권
- /
- pp.57-76
- /
- 2012
As globalization accelerates and the trade environment rapidly changes, a more prepared trade workforce is required for the business world. The trade department is an important educational institution that educates and trains future trade talent. Thus, in the evolving trade environment, their role has significant implications for the economic growth of Korea. The present paper is a comparative study of the importance of the university education system in terms of the trade work force through the Person-Organization fit (P-O fit) perspective. We observe that the American trade education system is more oriented to meet and reflect the needs from the real world. It contains a support structure through various governmental organizations that offer financial incentives as well as educational opportunities through internships and other hands-on experiences. The trade education systems in universities in Korea are still static by comparison and are recommended to implement changes that will give students more practical exposure to trade. The governmental and business sectors are also encouraged to support universities in achieving these goals through building a strong cooperative network with university trade departments.
PDF

전차선 전압제약조건을 고려한 AT 급전계통 해석 (Analysis of AT Feeding Systems considering the Voltage Constraint Conditions of the Catenary.)

김백;정광우
- 한국철도학회:학술대회논문집
- /
- 한국철도학회 2005년도 춘계학술대회 논문집
- /
- pp.27-33
- /
- 2005
Constant load model is generally used for an electric train to perform the static analysis of AT feeding systems. In this model, the train will be considered as a constant load model when it drives or as a constant source model when it applies regenerative brake. However there must be some voltage constraints on the catenary in actual operations. These constraints are established for the reason of protecting the feeding facilities from excessive rise of regenerative braking voltage or guaranteeing the minimum traction power of train. In normal operating situation, the pantagraph voltage of the train should be maintained within these limits. Keeping these facts in minds, we suggest new methods of analyzing AT feeding systems using the constant power models with the conditions of voltage constraints. The simulation results from a sample system using the proposed method illustrate both the states of system variables and the supply-demand relation of power among the trains and the systems very clearly, so it is believed that the proposed method yields more accurate results than conventional methods do. The proposed methods are believed to contribute to the assessment of TCR-TSC for compensating reactive powers too.
PDF

팬타그래프 전압제약조건을 고려한 AT급전계통 해석 (Analysis Of AT Feeding Systems Considering The Voltage Constraint Conditions Of The Pantagraph)

문영현;김백
- 한국철도학회논문집
- /
- 제9권6호
- /
- pp.652-656
- /
- 2006
Constant load model is widely used for an electric train to perform the static analysis of AT (Auto Transformer) feeding systems. In this model, the train will be considered as a constant load model when it drives or as a constant source model when it applies regenerative brake. However there must be some constraints imposed on the pantagraph voltage in actual operations. These constraints are established for the reason of protecting the feeding facilities from excessive rise of regenerative braking voltage or guaranteeing the minimum traction power of train. In normal operating situation, the pantagraph voltage of the train should be maintained within these limits. Keeping these facts in minds, we suggest new methods or analyzing AT feeding systems using the constant power models with the conditions of voltage constraints. The simulation results from a sample system using the proposed method illustrate both the states of system variables and the supply-demand relation of power among the trains and the systems very clearly, so it is believed that the proposed method yields more accurate results than conventional methods do. The proposed methods are believed to contribute to the assessment of TCR-TSC for compensating reactive powers too.
PDF KSCI

소형 및 저비용화를 위한 전자석-스프링 구동장치 연구 (A Study on Electromagnetic-Spring Actuator for Low Cost Miniature Actuators)

김세웅;이창섭;최현영
- 한국군사과학기술학회지
- /
- 제22권3호
- /
- pp.392-400
- /
- 2019
This paper provides a fin actuation system of missile based on electromagnetic-spring mechanism to miniaturize the system and lower the cost. Compared with proportional electro-mechanical actuators, the output of Electromagnetic-Spring Actuators(EMSA) has two or three discrete states, but the mechanical configuration of EMSA is simple since it does not need power trains like gears. The simple mechanism of EMSA makes it easy to build small size, low cost, and relatively high torque actuators. However, fast response time is required to improve the dynamic performance and accuracy of missiles since bang-off-bang operation of EMSA affects the flight performance of missile. In this paper the development of EMSA including parameter optimization and mathematical modeling is described. The simulation results using Simulink and experimental test results of prototype EMSAs are presented.
https://doi.org/10.9766/KIMST.2019.22.3.392 인용 PDF KSCI HTML

중국 게임 산업의 발전을 위한 전문 인력 양성 방안 -중국내 대학의 게임 전문 인력 양성 커리큘럼 제안 (A plan for the Development of Chinese Game Industry -Curriculum Suggestions for Training Professionals in Chinese Universities)

정수;이동열
- 디지털융복합연구
- /
- 제18권3호
- /
- pp.325-332
- /
- 2020
중국 게임시장에서 개발 또는 시판되는 게임의 대부분은 외국에서 개발되어 서비스 되거나 혹은 국내(중국)에서 제작되어 외국 게임회사의 서비스로 시판되는 게임이 대부분이다. 중국이 세계적 게임 개발 대국이 되고자 한다면 여러 방법 중에 전문교육양성이 최우선이라고 생각한다. 특히 중국 의 대학교에서 전문적 게임학과가 꼭 있어야 한다고 본다. 본 논문에서 게임개발 대국 미국과 아시아 게임 개발 대국 한국의 대학교에 있는 게임디자인학과에 대해서 조사하여 게임학과 교육방식을 분석하고자 한다. 비교분석을 통해 중국 대학교에서 게임학과를 신설할 때 필요한 요소를 찾아 중국의 디지털미디어예술(數字媒體藝術)학과에서 게임분야의 고급 게임개발자 양성을 위한 교육 커리큘럼을 제시하고자 한다. 이 연구는 중국 게임교육에 대해 도움이 되면 향후에 석사과정과 박사과정 게임교육 커리큘럼을 연구를 진행할 예정이다.
https://doi.org/10.14400/JDC.2020.18.3.325 인용 PDF KSCI

단말간 직접 통신 네트워크를 위한 심층 강화학습 기반 분산적 스케쥴링 알고리즘 (A Distributed Scheduling Algorithm based on Deep Reinforcement Learning for Device-to-Device communication networks)

정무웅;김륜우;반태원
- 한국정보통신학회논문지
- /
- 제24권11호
- /
- pp.1500-1506
- /
- 2020
본 논문에서는 오버레이 단말 간 직접 (Device-to-Device : D2D) 통신 네트워크를 위한 강화학습 기반 스케쥴링 문제를 연구한다. 강화학습 모델 중 하나인 Q-learning을 이용한 D2D 통신 기술들이 연구되었지만, Q-learning은 상태와 행동의 개수가 증가함에 따라서 높은 복잡도를 유발한다. 이러한 문제를 해결하기 위하여 Deep Q Network (DQN) 기반 D2D 통신 기술들이 연구되었다. 본 논문에서는 무선 통신 시스템 특성을 고려한 DQN 모델을 디자인하고, 피드백 및 시그널링 오버헤드를 줄일 수 있는 DQN 기반 분산적 스케쥴링 방식을 제안한다. 제안 방식은 중앙집중식으로 변수들을 학습시키고, 최종 학습된 파라미터를 모든 단말들에게 전달한다. 모든 단말들은 최종 학습된 파라미터를 이용하여 각자의 행동을 개별적으로 결정한다. 제안 방식의 성능을 컴퓨터 시뮬레이션을 통하여 분석하고, 최적방식, 기회주의적 선택 방식, 전체 전송 방식과 비교한다.
https://doi.org/10.6109/jkiice.2020.24.11.1500 인용 PDF KSCI

군소 세포의 발화 형태에 미치는 열자극 효과 (The Effect of Heat on the Spiking Patterns of the Cells in Aplysia)

현남규
- 한국의학물리학회지:의학물리
- /
- 제18권2호
- /
- pp.73-80
- /
- 2007
제주 해안에서 잡은 60여 개의 군소를 대상으로 한 실험 데이터 중에서 5개의 세포에 대한 실험 결과를 분석해 보니 주목할 만한 결과들이 나왔다. 이들 5개의 세포들 중에 3개의 세포에서는 $32^{\circ}C$에서 $10^{\circ}C$까지 온도를 내리는 동안에 이완된 진동 상태, 연속발화상태 사이의 과분극 기간이 짧은 버스팅, 혼돈 양상의 버스팅, 주기 배가 양상, 연속발화 기간은 길고 휴지 기간은 짧은 버스팅, 일정하게 반복되는 비팅 상태이거나 타원형 버스팅, 휴지상태가 차례차례로 나타났으며, $10^{\circ}C$에서 $32^{\circ}C$까지 온도를 올리는 동안에는 그 반대 순서로 발화 형태가 변화하였다. 같은 온도 범위에서 80분 정도의 주기로 계속해서 열변화 자극을 줄 때마다 이러한 발화 형태 변화는 일정하게 나타났다. 그러나 나머지 두 세포의 경우에는 이와는 다른 발화 형태의 온도에 따른 변화를 보였다. 이들 경우에 이러한 열변화 자극을 24시간 이상 지속하였으나 발화 진폭은 크게 줄어들지 않았다. 그리고 평균발화 진동수, 버스팅 신호 사이의 평균 과분극상태 지속시간, 활동전위 진폭과 활동전위의 최저값 등을 $C^{++}$로 짠 프로그램을 실행시켜서 비교 분석 가능하였다. 온도에 따른 발화 진동수의 분포는 저온과 고온에서는 낮으나 중간 영역에서는 높은 종 모양의 형태를 보였는데, 온도를 내리는 동안에는 온도를 올리는 동안 보다 발화 진동수가 높았다.
PDF

확률적 모델예측제어를 이용한 물리기반 제어기 지도 학습 프레임워크 (A Supervised Learning Framework for Physics-based Controllers Using Stochastic Model Predictive Control)

한다성
- 한국컴퓨터그래픽스학회논문지
- /
- 제27권1호
- /
- pp.9-17
- /
- 2021
본 논문에서는 확률적 모델예측제어(model predictive control) 기법을 이용하여 예제 동작 데이터가 주어지면 물리 기반 시뮬레이션 환경에서 그 동작을 모방할 수 있는 캐릭터 동작 제어기를 빠르게 학습할 수 있는 간편한 지도 학습(supervised learning) 프레임워크를 제안한다. 제안된 프레임워크는 크게 학습 데이터 생성과 오프라인 학습의 두 컴포넌트로 구성된다. 첫번째 컴포넌트는 예제 동작 데이터가 주어지면 확률적 모델예측제어를 통해 그 동작 데이터를 추적하기 위한 최적 제어기를 캐릭터의 현재 상태로부터 시작하여 가까운 미래 상태까지의 시간 윈도우에 대해 주기적으로 업데이트하면서 그 최적 제어기를 통해 캐릭터의 동작을 확률적으로 제어한다. 이러한 주기적인 최적 제어기의 업데이트와 확률적 제어는 주어진 예제 동작 데이터를 모방하는 동안 캐릭터가 가질 수 있는 다양한 상태들을 효과적으로 탐색하게 하여 지도 학습에 유용한 학습 데이터를 수집할 수 있게 해준다. 이렇게 학습 데이터가 수집되면, 오프라인 학습 컴포넌트에서는 그 수집된 데이터를 정규화 시켜서 데이터에 내제된 크기와 단위의 차이를 조정하고 지도 학습을 통해 제어기를 위한 간단한 구조의 인공 신경망을 학습시킨다. 걷기 동작과 달리기 동작에 대한 실험은 본 논문에서 제안한 학습 프레임워크가 물리 기반 캐릭터 동작 제어기를 빠르고 효과적으로 생성할 수 있음을 보여준다.
https://doi.org/10.15701/kcgs.2021.27.1.9 인용 PDF KSCI

검색결과 21건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)