• Title/Summary/Keyword: Skewed Distribution

Search Result 228, Processing Time 0.024 seconds

Tree Size Distribution Modelling: Moving from Complexity to Finite Mixture

  • Ogana, Friday Nwabueze;Chukwu, Onyekachi;Ajayi, Samuel
    • Journal of Forest and Environmental Science
    • /
    • v.36 no.1
    • /
    • pp.7-16
    • /
    • 2020
  • Tree size distribution modelling is an integral part of forest management. Most distribution yield systems rely on some flexible probability models. In this study, a simple finite mixture of two components two-parameter Weibull distribution was compared with complex four-parameter distributions in terms of their fitness to predict tree size distribution of teak (Tectona grandis Linn f) plantations. Also, a system of equation was developed using Seemingly Unrelated Regression wherein the size distributions of the stand were predicted. Generalized beta, Johnson's SB, Logit-Logistic and generalized Weibull distributions were the four-parameter distributions considered. The Kolmogorov-Smirnov test and negative log-likelihood value were used to assess the distributions. The results show that the simple finite mixture outperformed the four-parameter distributions especially in stands that are bimodal and heavily skewed. Twelve models were developed in the system of equation-one for predicting mean diameter, seven for predicting percentiles and four for predicting the parameters of the finite mixture distribution. Predictions from the system of equation are reasonable and compare well with observed distributions of the stand. This simplified mixture would allow for wider application in distribution modelling and can also be integrated as component model in stand density management diagram.

Distribution of Path Loss for Wireless Personal Networks Operating in a Square Region

  • Yang, Rumin;Shen, Bin;Kwak, Kyung-Sup
    • ETRI Journal
    • /
    • v.33 no.2
    • /
    • pp.283-286
    • /
    • 2011
  • Path loss plays fundamental roles in system design, spectrum management, and performance evaluation. The traditional path loss model has a slight inconvenience; it depends on the unknown distance. In this letter, we explore the probability distribution function (PDF) of path loss in an indoor office environment by randomizing out the distance variable. It is shown that the resulting PDF is not Gaussian-like but is skewed to the right, and both the PDF and the moments are related to the size of the office instead of the unknown distance. To be specific, we incorporate the IEEE 802.15.4a channel parameters into our model and tabulate the cumulative distribution function with respect to different room sizes. Through a simple example, we show how our model helps a cognitive spectrum user to infer path loss information of primary users without necessarily knowing their transmitter-receiver distance.

Design of Median Control Chart for Unsymmetrical Weibull Distribution (비대칭(非對稱)와이블분포공정(分布工程)에서 메디안특수관리도(特殊管理圖)의 설계(設計))

  • Sin, Yong-Baek;Hwang, Ui-Mi
    • Journal of Korean Society for Quality Management
    • /
    • v.14 no.2
    • /
    • pp.2-8
    • /
    • 1986
  • This thesis is concerned with the design of control chart based on the sample median which is easy to use in practical situations and to analyze the properties for non-normally distributed Weibull process. In this cases are use to the quality characteristics of the process are not normally distributed but skewed due to the intermitted production, small lot size and sample size is small one n=3 or n=5, etc. And when it relates unsymmetrically distributed process, model designed median control chart is more effective than Shewhart $\bar{x}$-chart which assumed on normal distribution, when we exactly should be known Weibull distribution or estimated. The median control chart in this thesis is more robustness compared with other conventionally developed control chart.

  • PDF

A spatial heterogeneity mixed model with skew-elliptical distributions

  • Farzammehr, Mohadeseh Alsadat;McLachlan, Geoffrey J.
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.373-391
    • /
    • 2022
  • The distribution of observations in most econometric studies with spatial heterogeneity is skewed. Usually, a single transformation of the data is used to approximate normality and to model the transformed data with a normal assumption. This assumption is however not always appropriate due to the fact that panel data often exhibit non-normal characteristics. In this work, the normality assumption is relaxed in spatial mixed models, allowing for spatial heterogeneity. An inference procedure based on Bayesian mixed modeling is carried out with a multivariate skew-elliptical distribution, which includes the skew-t, skew-normal, student-t, and normal distributions as special cases. The methodology is illustrated through a simulation study and according to the empirical literature, we fit our models to non-life insurance consumption observed between 1998 and 2002 across a spatial panel of 103 Italian provinces in order to determine its determinants. Analyzing the posterior distribution of some parameters and comparing various model comparison criteria indicate the proposed model to be superior to conventional ones.

Design of Median Control Chart for Nonnormally Distributed Processes (비정규분포공정(非正規分布工程)에서 메디안특수관리도(特殊管理圖)의 모형설계(模型設計))

  • Sin, Yong-Baek
    • Journal of Korean Society for Quality Management
    • /
    • v.15 no.2
    • /
    • pp.10-19
    • /
    • 1987
  • Statistical control charts are useful tools to monitor and control the manufacturing processes and are widely used in most Korean industries. Many Korean companies, however, do not always obtain desired results from the traditional control charts by Shewhart such as the $\overline{X}$-chart, X-chart, $\widetilde{X}$-chart, etc. This is partly because the quality charterstics of the process are not distributed normally but are skewed due to the intermittent production, small lot size, etc. In the Shewhart $\overline{X}$-chart, which is the most widely used one in Korea, such skewed distributions make the plots to be inclined below or above the central line or outside the control limits although no assignable causes can be found. To overcome such shortcomings in nonnormally distributed processes, a distribution-free type of confidence interval can be used, which should be based on order statistics. This thesis is concerned with the design of control chart based on a sample median which is easy to use in practical situation and therefore properties for nonnormal distributions may be easily analyzed. Control limits and central lines are given for the more famous nonnormal distributions, such as Gamma, Beta, Lognormal, Weibull, Pareto, and Truncated-normal distributions.

  • PDF

Use of beta-P distribution for modeling hydrologic events

  • Murshed, Md. Sharwar;Seo, Yun Am;Park, Jeong-Soo;Lee, Youngsaeng
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.1
    • /
    • pp.15-27
    • /
    • 2018
  • Parametric method of flood frequency analysis involves fitting of a probability distribution to observed flood data. When record length at a given site is relatively shorter and hard to apply the asymptotic theory, an alternative distribution to the generalized extreme value (GEV) distribution is often used. In this study, we consider the beta-P distribution (BPD) as an alternative to the GEV and other well-known distributions for modeling extreme events of small or moderate samples as well as highly skewed or heavy tailed data. The L-moments ratio diagram shows that special cases of the BPD include the generalized logistic, three-parameter log-normal, and GEV distributions. To estimate the parameters in the distribution, the method of moments, L-moments, and maximum likelihood estimation methods are considered. A Monte-Carlo study is then conducted to compare these three estimation methods. Our result suggests that the L-moments estimator works better than the other estimators for this model of small or moderate samples. Two applications to the annual maximum stream flow of Colorado and the rainfall data from cloud seeding experiments in Southern Florida are reported to show the usefulness of the BPD for modeling hydrologic events. In these examples, BPD turns out to work better than $beta-{\kappa}$, Gumbel, and GEV distributions.

Depositional Environment of Sandy Tidal Flat in Anmyeondo, Western Coast of Korea (서해안 안면도 사질 조석대의 퇴적환경)

  • Oh, Jae-Kyung;Han, Chang-Hee
    • Journal of the Korean earth science society
    • /
    • v.31 no.2
    • /
    • pp.139-150
    • /
    • 2010
  • This study investigated the depositional environment of the area of Anmyeondo in the mid-western coast of Korea. A total of 142 surface sediments were analyzed, and leveling survey was conducted as well in 12 transects at the sandy tidal flat in the area. The surface sediments are classified into several different types (slightly gravelly Sand, Sand, slightly gravelly sandy Mud, etc.) and they show fining-distribution toward the offshore. Textural parameters are characterized by being coarse, well sorted, positively skewed in the supratidal and intertidal flat, whereas being finer, poorly sorted, negatively skewed in the subtidal flat. According to the C-M diagram, the sediment transport modes are characterized by the mixed mode of bedload and graded suspension in supratidal and intertidal flat, and by graded suspension or uniform suspension in the subtidal flat. Histogram shows a bimodal distribution in the lower subtidal flat which is indicative of two sediment transport modes.

Revision of Ecological Score of Benthic Macroinvertebrates Community in Korea (한국의 저서성 대형무척추동물 생태점수 개선)

  • Kong, Dongsoo;Park, Youngjun;Jeon, Yong-Rak
    • Journal of Korean Society on Water Environment
    • /
    • v.34 no.3
    • /
    • pp.251-269
    • /
    • 2018
  • In 1997, the environmental score (Q) corresponding the tolerance value of Korean benthic macroinvertebrate species and the related biotic score, the ecological score of benthic macroinvertebrates (ESB) were proposed; ESB was similar to Beck's biotic index or Chandler's biotic score. This study was carried out to revise the Q values of individual species and the assessment scheme of ESB based on the taxonomic performance and ecological information accumulated since then. The original ESB was renamed as TESB (total ESB), and AESB (average ESB) was newly proposed. AESB is calculated by dividing the TESB (for a given station) by the number of species present in the sample. In this study, TESB showed a positively skewed distribution, while AESB showed a negatively skewed distribution. The correlation between TESB and the concentration of $BOD_5$ was a little stronger than that of the original ESB. TESB showed a very strong correlation (correlation coefficient r = 0.98) with Margalef's species richness, of which correlation coefficient was higher than that of AESB (r = 0.85). AESB showed a strong correlation (r = -0.79) with the concentration of $BOD_5$, while TESB showed a weaker correlation (r = -0.67). Applying TESB and AESB together in an assessment of the environment may be comprehensive because the physical and chemical states of the environment can be evaluated together. AESB is less dependent on the sample size, while TESB tends to increase as the sample size increases. In the evaluation of the environment using TESB, it is necessary to standardize the methods on monitoring.

A Physical Design Method of Storage Structures for MOLAP Systems of Data Warehouse (데이터 웨어하우스의 다차원 온라인 분석처리 시스템을 위한 저장구조의 물리적 설계기법)

  • Lee Jong-Hak
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.3
    • /
    • pp.297-312
    • /
    • 2005
  • Aggregation is an operation that plays a key role in multidimensional OLAP (MOLAP) systems of data warehouse. Existing aggregation operations in MOLAP have been proposed for file structures such as multidimensional arrays. These tile structures do not work well with skewed distributions. This paper presents a physical design methodology for storage structures ni MOLAP that use the multidimensional tile organizations adapting to a skewed distribution. In uniform data distribution, we first show that the performance of multidimensional analytical processing is highly affected by the similarity of the shapes between query regions and page regions in the domain space of the multidimensional file organizations. And than, in skewed distributions, we reflect the effect of data distributions on the design by using the shapes of the normalized query regions that are weighted with data density of those query regions. Finally, we demonstrate that the physical design methodology theoretically derived is indeed correct in real environments. In the two-dimensional file organizations, the results of experiments indicate that the performance of the proposed method is enhanced by more than seven times over the conventional method. We expect that the performance will be more enhanced when the dimensionality is more than two. The result confirms that the proposed physical design methodology is useful in a practical way.

  • PDF

Power Investigation of the Entropy-Based Test of Fit for Inverse Gaussian Distribution by the Information Discrimination Index

  • Choi, Byungjin
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.837-847
    • /
    • 2012
  • Inverse Gaussian distribution is widely used in applications to analyze and model right-skewed data. To assess the appropriateness of the distribution prior to data analysis, Mudholkar and Tian (2002) proposed an entropy-based test of fit. The test is based on the entropy power fraction(EPF) index suggested by Gokhale (1983). The simulation results report that the power of the entropy-based test is superior compared to other goodness-of-fit tests; however, this observation is based on the small-scale simulation results on the standard exponential, Weibull W(1; 2) and lognormal LN(0:5; 1) distributions. A large-scale simulation should be performed against various alternative distributions to evaluate the power of the entropy-based test; however, the use of a theoretical method is more effective to investigate the powers. In this paper, utilizing the information discrimination(ID) index defined by Ehsan et al. (1995) as a mathematical tool, we scrutinize the power of the entropy-based test. The selected alternative distributions are the gamma, Weibull and lognormal distributions, which are widely used in data analysis as an alternative to inverse Gaussian distribution. The study results are provided and an illustrative example is analyzed.