• 제목/요약/키워드: Data Set Records

검색결과 197건 처리시간 0.026초

디리슈레 혼합모형을 이용한 함정 전투체계 부품의 고장시간 분포 추정 (An Application of Dirichlet Mixture Model for Failure Time Density Estimation to Components of Naval Combat System)

  • 이진환;김정훈;정봉주;김경택
    • 산업경영시스템학회지
    • /
    • 제42권4호
    • /
    • pp.194-202
    • /
    • 2019
  • Reliability analysis of the components frequently starts with the data that manufacturer provides. If enough failure data are collected from the field operations, the reliability should be recomputed and updated on the basis of the field failure data. However, when the failure time record for a component contains only a few observations, all statistical methodologies are limited. In this case, where the failure records for multiple number of identical components are available, a valid alternative is combining all the data from each component into one data set with enough sample size and utilizing the useful information in the censored data. The ROK Navy has been operating multiple Patrol Killer Guided missiles (PKGs) for several years. The Korea Multi-Function Control Console (KMFCC) is one of key components in PKG combat system. The maintenance record for the KMFCC contains less than ten failure observations and a censored datum. This paper proposes a Bayesian approach with a Dirichlet mixture model to estimate failure time density for KMFCC. Trends test for each component record indicated that null hypothesis, that failure occurrence is renewal process, is not rejected. Since the KMFCCs have been functioning under different operating environment, the failure time distribution may be a composition of a number of unknown distributions, i.e. a mixture distribution, rather than a single distribution. The Dirichlet mixture model was coded as probabilistic programming in Python using PyMC3. Then Markov Chain Monte Carlo (MCMC) sampling technique employed in PyMC3 probabilistically estimated the parameters' posterior distribution through the Dirichlet mixture model. The simulation results revealed that the mixture models provide superior fits to the combined data set over single models.

분담목록에서의 전거통제와 전거일파공유 (Authority control and authority files in the cooperative cataloging)

  • 최달현
    • 한국도서관정보학회지
    • /
    • 제25권
    • /
    • pp.257-293
    • /
    • 1996
  • This paper reviews various aspects of authouity control system and presents prerequistes for an effective authority control in our future cooperative cataloging. It can be summarized as follows. First, numerous factors affecting authority control must be analyzed and consistent procedures and policies on the authority control have to be established. Second, to make an effective bibiographic data base there must be a standard for the information processing and a systematic organization for information sharing and communicating. Third, for this objective we have to build a MARC format, establish a network for the exchange of automatic authority records among systems, standardize the transcription of multscripts, and establish a centralized automatic authority system for a consistent maintenance of authorityrecords of the union data base. Fourth, it would be one of the best way of achieving cooperative cataloging to set up such a nation-wide authority control system as the NACO in Japan.

  • PDF

Proposal for an Inundation Hazard Index of Road Links for Safer Routing Services in Car Navigation Systems

  • Kim, Ji-Young;Lee, Jae-Bin;Lee, Won-Hee;Yu, Ki-Yun
    • ETRI Journal
    • /
    • 제32권3호
    • /
    • pp.430-439
    • /
    • 2010
  • Inundation of roads by heavy rainfall has attracted more attention than traffic accidents, traffic congestion, and construction because it simultaneously causes travel delays and threatens driver safety. For these reasons, in this paper, we propose an inundation hazard index (IHI) of road links, which shows the possibility of inundation of road links caused by rainfall. To generate the index, we have used two key data sources, namely the digital elevation model (DEM) and past rainfall records of when inundation has occurred. IHI is derived by statistically analyzing the relationships between the normalized relative height of the road links calculated from DEM within the watershed and past rainfall records. After analyzing the practical applicability of the proposed index with a commercial car navigation system through a set of tests, we confirmed that the proposed IHI could be implemented to choose safer routes, with reduced chances of encountering roads having inundation risks.

Genetic Parameters and Annual Trends for Birth and Weaning Weights of a Northeastern Thai Indigenous Cattle Line

  • Intaratham, W.;Koonawootrittriron, S.;Sopannarath, P.;Graser, H.-U.;Tumwasorn, S.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제21권4호
    • /
    • pp.478-483
    • /
    • 2008
  • Records of a Northeastern Thai indigenous cattle line population were used to estimate genetic parameters and annual trends for calf weights. The data set comprised records of 1,922 and 1,489 animals for birth and weaning weight, respectively born from 1993 to 2004. A bivariate analysis was carried out for variance and covariance components estimations using average information restricted maximum likelihood procedure. Average estimated breeding value and maternal breeding value of the animals born in 1993 were set to zero as a base group. Genetic trends of each trait were calculated by regressing average estimated breeding values and maternal breeding values on birth year of calves. Phenotypic trends for each trait were calculated by regressing the yearly adjusted weight on birth year of calves. The results revealed that the estimate of direct heritability, maternal heritability and maternal permanent environmental variance as a proportion of phenotypic variance for birth and weaning weight was 0.40, 0.14 and 0.04; 0.27, 0.05 and 0.23, respectively. Direct heritability was moderately heritable and genetic improvement through selection can be achieved. The estimate of phenotypic, direct genetic, maternal genetic and maternal permanent environmental correlation between birth and weaning weight was 0.48, 0.65, 0.98 and 0.73, respectively. The phenotypic trend, genetic trends of estimated breeding value and maternal breeding value for birth weight was 0.18, 0.04 and 0.01 kg/year, respectively. The phenotypic trend, genetic trends of estimated breeding value and maternal breeding value for weaning weight was -1.36, 0.32 and 0.03 kg/year, respectively. As maternal genetic effect was considerably less important than direct genetic effect, selection for improved weaning weight of this Northeastern Thai indigenous cattle line can place more emphasis on the direct genetic effect.

두류식품의 지역 이름 브랜드화의 효과: 한국 소비자의 종적 데이터 분석을 중심으로 (The Effects of Regional Branding on Soybean Products: Evidence from Consumer Longitudinal Data in Korea)

  • 김태경;정구현
    • 유통과학연구
    • /
    • 제14권10호
    • /
    • pp.109-116
    • /
    • 2016
  • Purpose - This study investigates the purchase pattern relating to soybean products in Korea. Specifically, the effect of branding based on a regional name was analyzed in terms of consumer purchase frequencies. The primary purpose of this study is to understand why family characteristics affect product selection for a regional brand in the soybean food category. Research design, data, and methodology - We used data collected by the Rural Development Administration (RDA) of Korea. The RDA has monitored agricultural food consumers for years in order to obtain purchase records. Panel participants live in regions near the capital city of Seoul, Korea. Examining data from January 2010 to May 2016, 667 families were selected for analysis. The final data set was 1,335,402. Each purchase item by each individual family was aggregated to a countable weekly observation. To analyze the data set quantitatively, zero-inflation regression was adopted, which was appropriate to avoid biases from overly dispersed observations. Results - We hypothesized the effects of regional branding from the viewpoint of the family characteristics. The first hypothesis was that the number of children would be positively associated with the purchase of a regional brand of soybean products. The result strongly supported this hypothesis. The second hypothesis was that the number of family members would be negatively associated with the purchase of the soybean products of a regional brand. Based on empirical analysis, we concluded that this hypothesis was partially supported. The third hypothesis was the presence of an interaction effect between the number of children and the family size, which was supported by the results. As a supplementary analysis, we also tested mean-variance differences in terms of categories and regional branding with corporate branding. Conclusion - The results of this study provide insights for regional branding strategies in agricultural food management. This study appears to be one of the seminal studies trying to analyze purchase patterns from longitudinal observations. In addition, this study adopted variables characterizing family lifestyle. This study confirmed that children and family size should be considered when soybean product brands are introduced.

다중 연속 스카이라인 질의의 효율적인 처리 기법 (Multiple Continuous Skyline Query Processing Over Data Streams)

  • 이유원;이기용;김명호
    • 한국전자거래학회지
    • /
    • 제15권4호
    • /
    • pp.165-179
    • /
    • 2010
  • 최근 들어 e-비즈니스 환경에서도 증권 거래, 시세, 주문 및 과금 데이터와 같이 지속적으로 유입되는 데이터 스트림에 대한 처리가 중요해지고 있다. 이 중에서도 데이터 스트림에 대한 다기준 의사 결정에 사용되는 스카이라인(skyline) 질의의 사용이 증가하고 있다. 다차원 튜플의 집합이 주어졌을 때, 스카이라인 집합은 다른 튜플에 의해 지배(dominate)되지 않는 튜플들의 집합을 반환한다. 고정된 데이터에 대한 단일 스카이라인 질의 처리에 대해서는 최근까지 많은 연구가 이루어져 왔으나, 데이터 스트림 환경에서 다중 연속 스카이라인 질의 처리에 대해서는 아직까지 많은 연구가 수행되지 않았다. 본 논문에서는 데이터 스트림 환경에서 하나 이상의 연속 스카이라인 질의들이 주어졌을 때, 이들을 효율적으로 처리할 수 있는 방법을 제안한다. 제안하는 방법은 각 튜플이 어떤 질의의 결과에 포함될지를 효율적으로 파악함으로써, 여러 개의 연속 스카이라인 질의들도 적은 비용으로 동시에 처리할 수 있다. 다양한 실험을 통해 제안하는 방법의 우수성을 보인다.

한국남자프로농구 경기기록 분석을 통한 승패결정요인 추정: 2010-2011시즌, 2011-2012시즌 정규리그 기록 적용 (Estimating the determinants of victory and defeat through analyzing records of Korean pro-basketball)

  • 김세형;이준우;이미숙
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권5호
    • /
    • pp.993-1003
    • /
    • 2012
  • 한국남자프로농구 경기기록을 이용하여 승패결정요인을 분석하였다. 2010년 10월부터 2011년 3월까지, 2011년 10월부터 2012년 3월까지 치러진 정규리그 (540경기)의 기록을 분석하여 승패결정요인을 추정하였다. 한국농구연맹은 7개 공격변인과 7개 수비변인에 대한 자료를 제공하고 있다. 이들 자료 중에 공헌도와 공격력에 적용되는 6개 공격변인 (2점슛 성공률, 3점슛 성공률, 자유투 성공률, 공격리바운드, 어시스트, 턴오버)과 4개 수비변인 (수비리바운드, 스틸, 굿디펜스, 블록슛)이 승패에 미치는 영향을 통계적으로 분석하기 위해 로지스틱회귀분석과 의사결정나무분석을 적용하였다. 두 분석은 PASW와 Answer Tree 통계프로그램을 사용하였으며 모든 유의수준은 .05로 설정하였다. 로지스틱회귀분석 결과, 6개 공격변인 중 2점슛 성공률, 3점슛 성공률, 턴오버가 통계적으로 승패에 유의미한 영향을 미치고 4개 수비변인 중 굿디펜스를 제외한 수비리바운드, 스틸, 블록슛이 통계적으로 승패에 유의미한 영향을 미치는 것으로 나타났다. 그리고 공격변인 의사결정나무분석 결과에서는 2점슛 성공률이 51%-58%이며, 3P%가 31%를 초과하고 TO가 11개 이하일때 승리할 수 있는 확률이 80.85%로 가장 높게 나타났다. 이에 반해 수비변인 의사결정나무분석 결과, 수비리바운드가 24개를 초과하고 스틸이 6개를 초과하며, 블록슛이 2개를 초과할 때 승리할 수 있는 확률이 94.12%로 가장 높게 나타났다.

A Data-Consistency Scheme for the Distributed-Cache Storage of the Memcached System

  • Liao, Jianwei;Peng, Xiaoning
    • Journal of Computing Science and Engineering
    • /
    • 제11권3호
    • /
    • pp.92-99
    • /
    • 2017
  • Memcached, commonly used to speed up the data access in big-data and Internet-web applications, is a system software of the distributed-cache mechanism. But it is subject to the severe challenge of the loss of recently uncommitted updates in the case where the Memcached servers crash due to some reason. Although the replica scheme and the disk-log-based replay mechanism have been proposed to overcome this problem, they generate either the overhead of the replica synchronization or the persistent-storage overhead that is caused by flushing related logs. This paper proposes a scheme of backing up the write requests (i.e., set and add) on the Memcached client side, to reduce the overhead resulting from the making of disk-log records or performing the replica consistency. If the Memcached server fails, a timestamp-based recovery mechanism is then introduced to replay the write requests (buffered by relevant clients), for regaining the lost-data updates on the rebooted Memcached server, thereby meeting the data-consistency requirement. More importantly, compared with the mechanism of logging the write requests to the persistent storage of the master server and the server-replication scheme, the newly proposed approach of backing up the logs on the client side can greatly decrease the time overhead by up to 116.8% when processing the write workloads.

데이타 스트림에서의 다중 조인 질의 최적화 방법 (Optimizing Multi-way Join Query Over Data Streams)

  • 박홍규;이원석
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제35권6호
    • /
    • pp.459-468
    • /
    • 2008
  • 데이타 스트림이란 실시간에 연속적으로 빠르게 생성되는 데이타 집합을 의미한다. 이러한 데이타 스트림들은 최근 사회가 발달과 더불어 정보 환경도 급속도로 발전함에 따라 센서 데이타, 교통상황 수집 자료, 웹 클릭 모니터링 등과 같은 많은 응용 분야에서 적용되고 있다. 이러한 형태의 데이트 스트립을 처리하기 위해서는 미리 등록된 질의에 대하여 새롭게 들어오는 스트림 데이타의 결과를 계속적으로 생성하게 된다. 이와 같은 이유로 끊임없이 들어오는 스트링 데이타들을 빠르게 처리하는 것이 이 분야에서 주된 이슈가 되었으며, 이를 위한 방법으로 등록된 질의들을 효율적으로 처리하기 위한 질의 최적화분야에 많은 연구가 있었다. 그러므로 본 논문에서는 기존 연구에서 사용되었던 그리디 방법을 기반으로 비용 모델을 이용하여 최소의 비용을 갖는 질의 계획을 선택하는 확장된 그리디 방법(EGA)을 제시한다. 화장된 그리디 방법은 알고리즘의 정확성이 떨어지는 그리디 알고리즘의 단점을 극복하기 위하여 비용이 가장 작은 연산하나를 선택하는 대신 비용이 자은 연산들의 집합을 선택한다. 이 연산들의 집합의 크기는 알고리즘의 정확성과 수행 시간에 영향을 끼치며, 투 개의 변수에 의해서 적응적으로 조절 수 있다. 실험에서는 다양한 스트림 환경에서 대부분 그리디 알고리즘보다 향상된 성능을 보장하고, 두 변수에 의한 알고리즘의 성능 및 수행 시간 차이를 보여줌으로써 본 알고리즘의 효율성을 검증하였다.

A DATABASE FOR HUMAN PERFORMANCE UNDER SIMULATED EMERGENCIES OF NUCLEAR POWER PLANTS

  • Park, Jin-Kyun;Jung, Won-Dea
    • Nuclear Engineering and Technology
    • /
    • 제37권5호
    • /
    • pp.491-502
    • /
    • 2005
  • Reliable human performance is a prerequisite in securing the safety of complicated process systems such as nuclear power plants. However, the amount of available knowledge that can explain why operators deviate from an expected performance level is so small because of the infrequency of real accidents. Therefore, in this study, a database that contains a set of useful information extracted from simulated emergencies was developed in order to provide important clues for understanding the change of operators' performance under stressful conditions (i.e., real accidents). The database was developed under Microsoft Windows TM environment using Microsoft Access $97^{TM}$ and Microsoft Visual Basic $6.0^{TM}$. In the database, operators' performance data obtained from the analysis of over 100 audio-visual records for simulated emergencies were stored using twenty kinds of distinctive data fields. A total of ten kinds of operators' performance data are available from the developed database. Although it is still difficult to predict operators' performance under stressful conditions based on the results of simulated emergencies, simulation studies remain the most feasible way to scrutinize performance. Accordingly, it is expected that the performance data of this study will provide a concrete foundation for understanding the change of operators' performance in emergency situations.