• Title/Summary/Keyword: Simulation-based Policy Improvement

Search Result 43, Processing Time 0.02 seconds

A Simulation Sample Accumulation Method for Efficient Simulation-based Policy Improvement in Markov Decision Process (마르코프 결정 과정에서 시뮬레이션 기반 정책 개선의 효율성 향상을 위한 시뮬레이션 샘플 누적 방법 연구)

  • Huang, Xi-Lang;Choi, Seon Han
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.7
    • /
    • pp.830-839
    • /
    • 2020
  • As a popular mathematical framework for modeling decision making, Markov decision process (MDP) has been widely used to solve problem in many engineering fields. MDP consists of a set of discrete states, a finite set of actions, and rewards received after reaching a new state by taking action from the previous state. The objective of MDP is to find an optimal policy, that is, to find the best action to be taken in each state to maximize the expected discounted reward of policy (EDR). In practice, MDP is typically unknown, so simulation-based policy improvement (SBPI), which improves a given base policy sequentially by selecting the best action in each state depending on rewards observed via simulation, can be a practical way to find the optimal policy. However, the efficiency of SBPI is still a concern since many simulation samples are required to precisely estimate EDR for each action in each state. In this paper, we propose a method to select the best action accurately in each state using a small number of simulation samples, thereby improving the efficiency of SBPI. The proposed method accumulates the simulation samples observed in the previous states, so it is possible to precisely estimate EDR even with a small number of samples in the current state. The results of comparative experiments on the existing method demonstrate that the proposed method can improve the efficiency of SBPI.

A Study on the Effect of the Inventory Policy on Military Supply Chain Performance - Focused on System Dynamics - (재고정책에 따른 군 공급체인 성과에 관한 연구 - 시스템 다이나믹스를 중심으로 -)

  • 안병기;김태현;문성임
    • Journal of the military operations research society of Korea
    • /
    • v.28 no.2
    • /
    • pp.1-19
    • /
    • 2002
  • This study shows the effect of inventory policy change from supplier-based to customer-based. We focus on the service level, cost, and information distortion of the Military Supply Chain(MSC) with System Dynamics. We design MSC model according to field practician interviews by using Vensim. The simulation makes a comparison between supply-based inventory policy performances and order-based inventory policy performances. In order to evaluate the MSC performances, we measure the accumulation of backlog(service level), supply chain cost, and order percentage overshoot(information distortion). The results show that 1) changing inventory policy from supplier-based to end customer order-based gets a good customer service, reduces MSC cost, and prevents information distortion, 2) changing inventory policy from supplier-based to immediate customer order-based reduces a small amount of MSC cost and deteriorates customer service, and 3) supply level is main factor for MSC performances improvement. This study implicates the policy change makes a improvement of MSC performance without introducing information system.

A Study on the Improvement of Outpatient Process Using Simulation (시뮬레이션을 이용한 외래프로세스 개선방안에 관한 연구)

  • Choi, Hyun-Sook;Ji, Eun Hee;Kang, Sung-Hong
    • Journal of Digital Convergence
    • /
    • v.12 no.8
    • /
    • pp.377-387
    • /
    • 2014
  • The purpose of this study is to suggest improvement ways of outpatient process via a simulation model and to improve operational efficiency. Three experimentation scenarios were implemented into the simulation model to determine which proposed scenario provides better improvement in terms of the following performance measures: LOS(Length of Stay), patient waiting time, patient travel time, and staff utilization. The hospital medical data collection and statistical tools used to analyze the process mining tools. And the PIOS simulation tool was used and the validity of the model was verified by using t-test. The simulation results demonstrated that oupatient process of center type is most efficient. Simulation approach is a powerful technique that supports efficient decision-making compared to traditional healthcare management approach based on past experience, feelings, and intuition. Therefore, the proposed experimentation model has wide applicability in healthcare systems.

Field Service Engineer Replenishment Policy Assessment Using a Discrete-Event and Agent-Based Simulation Model : A Case Study (Discrete-event와 Agent 기반의 시뮬레이션을 이용한 현장 서비스 요원 보급 정책 평가 사례 연구)

  • Suh, Eun Suk
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.41 no.6
    • /
    • pp.588-598
    • /
    • 2015
  • In this paper, a simulation model for assessing the impact of alternative field service engineer replenishment policies is introduced. The end-to-end supply chain simulation model is created using a discrete-event and agent-based simulation model, which enables accurate description of key individual entities in the investigated supply chain, such as field service engineers. Once the model is validated with the historical data, it is used to assess the impacts of field service engineer replenishment policies for a major printing equipment manufacturing firm.In the case study, newly proposed replenishment policies for post-sale distribution supply chain are assessed for the level of service improvement to end customers.

Improvement of AS/RS Class-based Storage Policy by Common Zone Allocation (공동영역의 설정에 의한 AS/RS의 등급별 저장정책 개선 방안 연구)

  • 문기주;김광필
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.24 no.3
    • /
    • pp.39-47
    • /
    • 1999
  • It has been concluded that the performance of class-based storage policy is better than the performance of random storage policy in the literature. However, the rack shortage problem assigned to the 1st class items makes the decision hard to apply the class-based storage policy in practice. In this paper, a new common zone concept is introduced between two classes to resolve the problem with class-based storage policy. The common zone is the area to accept items from both classes. An AS/RS model is developed for computer simulation study and the effect of common area sizes with various AS/RS operation conditions is analyzed.

  • PDF

An Analysis of Operating Policies for Multi-Product Unit Load AS/RS (다품종 단위적재 자동창고 시스템의 운영정책 분석)

  • Park, Yang-Byeong
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.15 no.1
    • /
    • pp.1-15
    • /
    • 1989
  • In the past few years, increasing numbers of automatic storage/retrieval system (AS/RS) using computer controlled storage/retrieval machine have been installed. This paper introduces two modeling approaches to determine the best operating policy for AS/RS : an M/G/1 queueing model and a computer simulation model. The operating policy consists of three elements. : the operation command cycle, the storage location method, and the operation dispatching rule. The analysis based on M/G/1 model is suitable for a quick and approximate evaluation, due to its inherent strict assumptions. The computer simulation can be used to perform a more realistic analysis. It is shown through the study that a significant improvement in the throughput and/or the space requirement can be expected by determining the best operating policy to a particular system. Most important, the computer simulation demonstrates its powerful capability in evaluating dynamic stochatic systems with imperfect information.

  • PDF

Integrated Multiple Simulation for Optimizing Performance of Stock Trading Systems based on Neural Networks (통합 다중 시뮬레이션에 의한 신경망 기반 주식 거래 시스템의 성능 최적화)

  • Lee, Jae-Won;O, Jang-Min
    • The KIPS Transactions:PartB
    • /
    • v.14B no.2
    • /
    • pp.127-134
    • /
    • 2007
  • There are many researches about the intelligent stock trading systems with the help of the advance of the artificial intelligence such as machine learning techniques, Though the establishment of the reasonable trading policy plays an important role in the performance of the trading systems most researches focused on the improvement of the predictability. Also some previous works, which treated the trading policy, treated the simplified versions dependent on the predictors in less systematic ways. In this paper, we propose the integrated multiple simulation' as a method of optimizing trading performance of stock trading systems. The propose method is adopted in the NXShell a development environment for neural network based stock trading systems. Under the proposed integrated multiple simulation', we simulate the multiple tradings for all combinations of the neural network's outputs and the trading policy parameters, evaluate the learning performance according to the various metrics and establish the optimal policy for a given prediction module based on the resulting performance. In the experiment, we present the trading policy comparison results using the stock value data from the KOSPI and KOSDAQ.

A Study on Control Scheme for Fairness Improvement of Assuared Forwarding Services in Differentiated Service Network (DiffServ 망에서 AF 서비스의 공평성 향상을 위한 제어 기법)

  • Kim, Byun-gon;Jeong, Dong-su
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.05a
    • /
    • pp.649-652
    • /
    • 2015
  • Previous marking policy for the AF service of TCP traffic in the Diffserv network have no sufficient consideration on the effect of RTT and target rate. In this paper, in order to improve fairness Index by the effect RTT difference of TCP traffic, we propose the modified TSW3CDM(Time Sliding Window Three Color Dynamic Marker) based on average transfer rate estimation and the flow state. The proposed algorithm is dynamic marking policy that do allocate band width in proportion to transmission rate. To evaluate the performance of the proposed algorithm, We accomplished a computer simulation using NS-2. From simulation results, the proposed TSW3CDM algorithm improves fairness index by comparison with TSW3CM.

  • PDF

Reducing the frequency of processor thrashing using guarantee/reservation in process migration (작업 이주시 보장/예약 기법을 이용한 프로세서 쓰레싱 빈도 감소)

  • Lee, Jun-Yeon;Im, Jae-Hyeon
    • The KIPS Transactions:PartA
    • /
    • v.8A no.2
    • /
    • pp.133-146
    • /
    • 2001
  • In a dynamic load distribution policies, each node gathers the current system sates information before making a decision on load balancing. Load balancing policies based on this strategy can suffer from processor thrashing. In this paper, we propose a new algorithm which attempts to decrease the frequency of the processor thrashing, the algorithm is based on the integration of three components. The first, the algorithm of which determine the size of jobs be transferred. The second, negotiation protocol with obtains a mutual agreement between a sender and a receiver on the transferring job size. And the third, a symmetrically-initiated location policy. The algorithm proposed in this paper used Siman IV as simulation tool to prove the improvement of performance. I analyzed the result of simulation, and compared with related works. The mean response time shows that there are no difference with existing policy, but appear a outstanding improvement in high load. The thrashing coefficient that shows the average response time, CPU overhead and the thrashing ratio at both the receiving and sending node has been used in the analysis. A significant improvement in the average response time and the CPU overhead ratio was detected using our algorithm when an overhead occurred in the system over other algorithm. The thrashing coefficient differed in the sending node and the receiving node of the system. Using our algorithm, the thrashing coefficient at the sending node showed more improvement when there was an overhead in the system, proving to be more useful. Therefore, it can be concluded that the thrashing ratio can be reduce by properly setting the maximum and minimum value of the system’s threshold queue.

  • PDF

The Supply and Demand Projection of Nurses in Korea (2010년까지의 간호사 인력 수요 및 공급 추계)

  • 박현애;최영희;이선자
    • Health Policy and Management
    • /
    • v.3 no.1
    • /
    • pp.146-168
    • /
    • 1993
  • The study was conducted to project supply and demand of the nurses till year 2010 based on analysis of supply and demand of nurses up to year 1991. Results of the study will provide invaluable information for nurses manpower planning as well as overall health manpower planning for the 21th century. It is projected that nurses will be oversupplied based on the current prductivity which is undesirable situation if the quality of care is considered, and undersupplied based on the the medical law as well as optimal productivity. Thus, it is desirable to increase active supply of nurses. One of the ways of increasing active supply would be increasing the size of training and education. But, considering low employment rate of nurses which is about 59% better way of solving problems related to nurses shortage would be improvement in nurses' employment rate. According to simulation study done as part of this study, if nurses' employment rate goes up to 80%, there is no need for increasing the size of training to meet the demand at the level of medical law.

  • PDF