• Title/Summary/Keyword: 마코프 특징

Search Result 43, Processing Time 0.021 seconds

Policy Modeling for Efficient Reinforcement Learning in Adversarial Multi-Agent Environments (적대적 멀티 에이전트 환경에서 효율적인 강화 학습을 위한 정책 모델링)

  • Kwon, Ki-Duk;Kim, In-Cheol
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.3
    • /
    • pp.179-188
    • /
    • 2008
  • An important issue in multiagent reinforcement learning is how an agent should team its optimal policy through trial-and-error interactions in a dynamic environment where there exist other agents able to influence its own performance. Most previous works for multiagent reinforcement teaming tend to apply single-agent reinforcement learning techniques without any extensions or are based upon some unrealistic assumptions even though they build and use explicit models of other agents. In this paper, basic concepts that constitute the common foundation of multiagent reinforcement learning techniques are first formulated, and then, based on these concepts, previous works are compared in terms of characteristics and limitations. After that, a policy model of the opponent agent and a new multiagent reinforcement learning method using this model are introduced. Unlike previous works, the proposed multiagent reinforcement learning method utilize a policy model instead of the Q function model of the opponent agent. Moreover, this learning method can improve learning efficiency by using a simpler one than other richer but time-consuming policy models such as Finite State Machines(FSM) and Markov chains. In this paper. the Cat and Mouse game is introduced as an adversarial multiagent environment. And effectiveness of the proposed multiagent reinforcement learning method is analyzed through experiments using this game as testbed.

Automatic speech recognition using acoustic doppler signal (초음파 도플러를 이용한 음성 인식)

  • Lee, Ki-Seung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.35 no.1
    • /
    • pp.74-82
    • /
    • 2016
  • In this paper, a new automatic speech recognition (ASR) was proposed where ultrasonic doppler signals were used, instead of conventional speech signals. The proposed method has the advantages over the conventional speech/non-speech-based ASR including robustness against acoustic noises and user comfortability associated with usage of the non-contact sensor. In the method proposed herein, 40 kHz ultrasonic signal was radiated toward to the mouth and the reflected ultrasonic signals were then received. Frequency shift caused by the doppler effects was used to implement ASR. The proposed method employed multi-channel ultrasonic signals acquired from the various locations, which is different from the previous method where single channel ultrasonic signal was employed. The PCA(Principal Component Analysis) coefficients were used as the features of ASR in which hidden markov model (HMM) with left-right model was adopted. To verify the feasibility of the proposed ASR, the speech recognition experiment was carried out the 60 Korean isolated words obtained from the six speakers. Moreover, the experiment results showed that the overall word recognition rates were comparable with the conventional speech-based ASR methods and the performance of the proposed method was superior to the conventional signal channel ASR method. Especially, the average recognition rate of 90 % was maintained under the noise environments.

A Study on the Comovements and Structural Changes of Global Business Cycles using MS-VAR models (MS-VAR 모형을 이용한 글로벌 경기변동의 동조화 및 구조적 변화에 대한 연구)

  • Lee, Kyung-Hee;Kim, Kyung-Soo
    • Management & Information Systems Review
    • /
    • v.35 no.3
    • /
    • pp.1-22
    • /
    • 2016
  • We analyzed the international comovements and structural changes in the quarterly real GDP by the Markov-switching vector autoregressive model (MS-VAR) from 1971(1) to 2016(1). The main results of this study were as follows. First, the business cycle phenomenon that occurs in the models or individual time series in real GDP has been grasped through the MS-VAR models. Unlike previous studies, this study showed the significant comovements, asymmetry and structural changes in the MS-VAR model using a real GDP across countries. Second, even if there was a partial difference, there were remarkable structural changes in the economy contraction regime(recession), such as 1988(2) ending the global oil shock crisis and 2007(3) starting the global financial crisis by the MS-VAR model. Third, large-scale structural changes were generated in the economic expansion and/or contraction regime simultaneously among countries. We found that the second world oil shocks that occurred after the first global oil shocks of 1973 and 1974 were the main reasons that caused the large-scale comovements of the international real GDP among countries. In addition, the spillover between Korea and 5 countries has been weak during the Asian currency crisis from 1997 to 1999, but there was strong transmission between Korea and 5 countries at the end of 2007 including the period of the global financial crisis. Fourth, it showed characteristics that simultaneous correlation appeared to be high due to the country-specific shocks generated for each country with the regime switching using real GDP since 1973. Thus, we confirmed that conclusions were consistent with a number of theoretical and empirical evidence available, and the macro-economic changes were mainly caused by the global shocks for the past 30 years. This study found that the global business cycles were due to large-scale asymmetric shocks in addition to the general changes, and then showed the main international comovements and/or structural changes through country-specific shocks.

  • PDF