• Title/Summary/Keyword: Distribution data

Search Result 17,544, Processing Time 0.14 seconds

On the STSP Normal Distribution

  • Choi, Jeen-Kap
    • Journal of the Korean Data and Information Science Society
    • /
    • v.16 no.2
    • /
    • pp.451-456
    • /
    • 2005
  • We introduce the standard two-sided power normal distribution and consider the relation between the probability in standard two-sided power distribution and the probability in standard two-sided power normal distribution and obtain the even moment of the special two-sided power normal distribution including the cases considered by Gupta and Nadarajah(2004)

  • PDF

Vessel traffic geometric probability approaches with AIS data in active shipping lane for subsea pipeline quantitative risk assessment against third-party impact

  • Tanujaya, Vincent Alvin;Tawekal, Ricky Lukman;Ilman, Eko Charnius
    • Ocean Systems Engineering
    • /
    • v.12 no.3
    • /
    • pp.267-284
    • /
    • 2022
  • A subsea pipeline designed across active shipping lane prones to failure against external interferences such as anchorage activities, hence risk assessment is essential. It requires quantifying the geometric probability derived from ship traffic distribution based on Automatic Identification System (AIS) data. The actual probability density function from historical vessel traffic data is ideal, as for rapid assessment, conceptual study, when the AIS data is scarce or when the local vessels traffic are not utilised with AIS. Recommended practices suggest the probability distribution is assumed as a single peak Gaussian. This study compares several fitted Gaussian distributions and Monte Carlo simulation based on actual ship traffic data in main ship direction in an active shipping lane across a subsea pipeline. The results shows that a Gaussian distribution with five peaks is required to represent the ship traffic data, providing an error of 0.23%, while a single peak Gaussian distribution and the Monte Carlo simulation with one hundred million realisation provide an error of 1.32% and 0.79% respectively. Thus, it can be concluded that the multi-peak Gaussian distribution can represent the actual ship traffic distribution in the main direction, but it is less representative for ship traffic distribution in other direction. The geometric probability is utilised in a quantitative risk assessment (QRA) for subsea pipeline against vessel anchor dropping and dragging and vessel sinking.

Notes on a skew-symmetric inverse double Weibull distribution

  • Woo, Jung-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.2
    • /
    • pp.459-465
    • /
    • 2009
  • For an inverse double Weibull distribution which is symmetric about zero, we obtain distribution and moment of ratio of independent inverse double Weibull variables, and also obtain the cumulative distribution function and moment of a skew-symmetric inverse double Weibull distribution. And we introduce a skew-symmetric inverse double Weibull generated by a double Weibull distribution.

  • PDF

Symbolic Cluster Analysis for Distribution Valued Dissimilarity

  • Matsui, Yusuke;Minami, Hiroyuki;Misuta, Masahiro
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.3
    • /
    • pp.225-234
    • /
    • 2014
  • We propose a novel hierarchical clustering for distribution valued dissimilarities. Analysis of large and complex data has attracted significant interest. Symbolic Data Analysis (SDA) was proposed by Diday in 1980's, which provides a new framework for statistical analysis. In SDA, we analyze an object with internal variation, including an interval, a histogram and a distribution, called a symbolic object. In the study, we focus on a cluster analysis for distribution valued dissimilarities, one of the symbolic objects. A hierarchical clustering has two steps in general: find out step and update step. In the find out step, we find the nearest pair of clusters. We extend it for distribution valued dissimilarities, introducing a measure on their order relations. In the update step, dissimilarities between clusters are redefined by mixture of distributions with a mixing ratio. We show an actual example of the proposed method and a simulation study.

A Study on Constructing the Prediction System Using Data Mining Techniques to Find Medium-Voltage Customers Causing Distribution Line Faults (특별고압 수전설비 관리에 데이터 마이닝 기법을 적용한 파급고장 발생가능고객 예측시스템 구현 연구)

  • Bae, Sung-Hwan;Kim, Ja-Hee;Lim, Han-Seung
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.58 no.12
    • /
    • pp.2453-2461
    • /
    • 2009
  • Faults caused by medium-voltage customers have been increased and enlarged their portion in total distribution faults even though we have done many efforts. In the previous paper, we suggested the fault prediction model and fault prevention method for these distribution line faults. However we can't directly apply this prediction model in the field. Because we don't have an useful program to predict those customers causing distribution line faults. This paper presents the construction method of data warehouse in ERP system and the program to find customers who cause distribution line faults in medium-voltage customer's electric facility management applying data mining techniques. We expect that this data warehouse and prediction program can effectively reduce faults resulted from medium-voltage customer facility.

Cubic normal distribution and its significance in structural reliability

  • Zhao, Yan-Gang;Lu, Zhao-Hui
    • Structural Engineering and Mechanics
    • /
    • v.28 no.3
    • /
    • pp.263-280
    • /
    • 2008
  • Information on the distribution of the basic random variable is essential for the accurate analysis of structural reliability. The usual method for determining the distributions is to fit a candidate distribution to the histogram of available statistical data of the variable and perform approximate goodness-of-fit tests. Generally, such candidate distribution would have parameters that may be evaluated from the statistical moments of the statistical data. In the present paper, a cubic normal distribution, whose parameters are determined using the first four moments of available sample data, is investigated. A parameter table based on the first four moments, which simplifies parameter estimation, is given. The simplicity, generality, flexibility and advantages of this distribution in statistical data analysis and its significance in structural reliability evaluation are discussed. Numerical examples are presented to demonstrate these advantages.

Implementation of Multicore-Aware Load Balancing on Clusters through Data Distribution in Chapel (클러스터 상에서 다중 코어 인지 부하 균등화를 위한 Chapel 데이터 분산 구현)

  • Gu, Bon-Gen;Carpenter, Patrick;Yu, Weikuan
    • The KIPS Transactions:PartA
    • /
    • v.19A no.3
    • /
    • pp.129-138
    • /
    • 2012
  • In distributed memory architectures like clusters, each node stores a portion of data. How data is distributed across nodes influences the performance of such systems. The data distribution scheme is the strategy to distribute data across nodes and realize parallel data processing. Due to various reasons such as maintenance, scale up, upgrade, etc., the performance of nodes in a cluster can often become non-identical. In such clusters, data distribution without considering performance cannot efficiently distribute data on nodes. In this paper, we propose a new data distribution scheme based on the number of cores in nodes. We use the number of cores as the performance factor. In our data distribution scheme, each node is allocated an amount of data proportional to the number of cores in it. We implement our data distribution scheme using the Chapel language. To show our data distribution is effective in reducing the execution time of parallel applications, we implement Mandelbrot Set and ${\pi}$-Calculation programs with our data distribution scheme, and compare the execution times on a cluster. Based on experimental results on clusters of 8-core and 16-core nodes, we demonstrate that data distribution based on the number of cores can contribute to a reduction in the execution times of parallel programs on clusters.

Excel macro for applying Bayes' rule (베이즈 법칙의 활용을 위한 엑셀 매크로)

  • Kim, Jae-Hyun;Baek, Hoh-Yoo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.6
    • /
    • pp.1183-1197
    • /
    • 2011
  • The prior distribution is the probability distribution we have before observing data. Using Bayes' rule, we can compute the posterior distribution, the new probability distribution, after observing data. Computing the posterior distribution is much easier than before by using Excel VBA macro. In addition, we can conveniently compute the successive updating posterior distributions after observing the independent and sequential outcomes. In this paper we compose some Excel VBA macros for applying Bayes' rule and give some examples.

On Estimating the Hazard Rate for Samples from Weighted Distributions

  • Ahmad, Ibrahim A.
    • International Journal of Reliability and Applications
    • /
    • v.1 no.2
    • /
    • pp.133-143
    • /
    • 2000
  • Data from weighted distributions appear, among other situations, when some of the data are missing or are damaged, a case that is important in reliability and life testing. The kernel method for hazard rate estimation is discussed for these data where the basic large sample properties are given. As a by product, the basic properties of the kernel estimate of the distribution function for data from weighted distribution are presented.

  • PDF

THE BIVARIATE GAMMA EXPONENTIAL DISTRIBUTION WITH APPLICATION TO DROUGHT DATA

  • Nadarajah, Saralees
    • Journal of applied mathematics & informatics
    • /
    • v.24 no.1_2
    • /
    • pp.221-230
    • /
    • 2007
  • The exponential and the gamma distributions have been the traditional models for drought duration and drought intensity data, respectively. However, it is often assumed that the drought duration and drought intensity are independent, which is not true in practice. In this paper, an application of the bivariate gamma exponential distribution is provided to drought data from Nebraska. The exact distributions of R=X+Y, P=XY and W=X/(X+Y) and the corresponding moment properties are derived when X and Y follow this bivariate distribution.