• Title/Summary/Keyword: statistical probability models

Search Result 217, Processing Time 0.032 seconds

Balanced Simultaneous Confidence Intervals in Logistic Regression Models

  • Lee, Kee-Won
    • Journal of the Korean Statistical Society
    • /
    • v.21 no.2
    • /
    • pp.139-151
    • /
    • 1992
  • Simultaneous confidence intervals for the parameters in the logistic regression models with random regressors are considered. A method based on the bootstrap and its stochastic approximation will be developed. A key idea in using the bootstrap method to construct simultaneous confidence intervals is the concept of prepivoting which uses the transformation of a root by its estimated cumulative distribution function. Repeated use of prepivoting makes the overall coverage probability asymptotically correct and the coverage probabilities of the individual confidence statement asymptotically equal. This method is compared with ordinary asymptotic methods based on Scheffe's and Bonferroni's through Monte Carlo simulation.

  • PDF

Failure Probability Models of Concrete Subjected to Split Tension Repeated- Loads (쪼갬인장 반복하중을 받는 콘크리트의 파괴확률 모델)

  • 김동호;김경진;이봉학;윤경구
    • Proceedings of the Korea Concrete Institute Conference
    • /
    • 2003.05a
    • /
    • pp.311-314
    • /
    • 2003
  • Concrete structures such as bridge, pavement, airfield, and offshore structure are normally subjected to repeated load. This paper proposes a failure probability models of concrete subjected to split tension repeated-loads, based on experimental results. The fatigue tests were performed at the stress ratio of 0.1, the loading shape of sine, the frequency of 20Hz, and the stress levels of 90, 80 and 70%. The fatigue test specimen was 150mm in diameter and 75mm in thickness. The fatigue analysis did not include which exceeded 0.9 of statistical coefficient of determination values or did not failure at 2$\times$$10^6$ cycles. The graphical method, the moment method, and maximum likelihood estimation method were used to obtain Weibull distribution parameters. The goodness-of-fit test by Kolmogorov-Smirnov test was acceptable 5% level of significance. As a result, the proposed failure probability model based on the two-parameter($\alpha and \mu$) Weibull distribution was good enough to estimate accurately the fatigue life subjected to tension mode.

  • PDF

A study on the behavioral-structure of production activity through the statistical analysis models - focus on the probability distribution of PERT, Queueing theory - (통계적(統計的) 계량분석(計量分析)모델을 통한 생산활동(活動)의 행태구조(行態構造)에 관한 연구 -RERT와 Queueing theory의 확률분포를 중심으로-)

  • Kim, Hong Jae
    • Journal of Korean Society for Quality Management
    • /
    • v.19 no.2
    • /
    • pp.145-157
    • /
    • 1991
  • Thid study intends to pursue behavioral-structure of production behavior through statistical models which are using in PERT and Queueing theory. We can corprehand the orders of human production behavior's characteristics by several related attributes of probablity/statistics. These attributes are poisson, Beta, exponential distributions and P.S Laplace's natural probability. Human production behavior is related and regressed to these attributes in many divisions intermediately. Progressive numerical understanding in many essential human behavior acts on the application of practical behavior standard in production word and operation.

  • PDF

Laplace's Method for General Integrals with Applications to Statistical Mechanics

  • Park, Nae-Hyun;Jeon, Jong-Woo
    • Journal of the Korean Statistical Society
    • /
    • v.14 no.2
    • /
    • pp.87-94
    • /
    • 1985
  • This paper extends the results of Ellis and Rosen (1982 a) to some more general integrals and applies our main theorem to compute the specific free energy of some models in statistical mechanics. The general integrals of this paper mean the integrals with respect to the probability measures induced by the sample mean of n i.i.d. random variables taking values in a separable Banach space.

  • PDF

Truncation Parameter Selection in Binary Choice Models (이항 선택 모형에서의 절단 모수 선택)

  • Kim, Kwang-Rae;Cho, Kyu-Dong;Koo, Ja-Yong
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.6
    • /
    • pp.811-827
    • /
    • 2010
  • This paper deals with a density estimation method in binary choice models that can be regarded as a statistical inverse problem. We use an orthogonal basis to estimate density function and consider the choice of an appropriate truncation parameter to reflect the model complexity and the prediction accuracy. We propose a data-dependent rule to choose the truncation parameter in the context of binary choice models. A numerical simulation is provided to illustrate the performance of the proposed method.

Development of an Integer Algorithm for Computation of the Matching Probability in the Hidden Markov Model (I) (은닉마르코브 모델의 부합확률연산의 정수화 알고리즘 개발 (I))

  • 김진헌;김민기;박귀태
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.8
    • /
    • pp.11-19
    • /
    • 1994
  • The matching probability P(ο/$\lambda$), of the signal sequence(ο) observed for a finite time interval with a HMM (Hidden Markov Model $\lambda$) indicates the probability that signal comes from the given model. By utilizing the fact that the probability represents matching score of the observed signal with the model we can recognize an unknown signal pattern by comparing the magnitudes of the matching probabilities with respect to the known models. Because the algorithm however uses floating point variables during the computing process hardware implementation of the algorithm requires floating point units. This paper proposes an integer algorithm which uses positive integer numbers rather than float point ones to compute the matching probability so that we can economically realize the algorithm into hardware. The algorithm makes the model parameters integer numbers by multiplying positive constants and prevents from divergence of data through the normalization of variables at each step. The final equation of matching probability is composed of constant terms and a variable term which contains logarithm operations. A scheme to make the log conversion table smaller is also presented. To analyze the qualitive characteristics of the proposed algorithm we attatch simulation result performed on two groups of 10 hypothetic models respectively and inspect the statistical properties with repect to the model order the magnitude of scaling constants and the effect of the observation length.

  • PDF

Binary Forecast of Heavy Snow Using Statistical Models

  • Sohn, Keon-Tae
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.2
    • /
    • pp.369-378
    • /
    • 2006
  • This Study focuses on the binary forecast of occurrence of heavy snow in Honam area based on the MOS(model output statistic) method. For our study daily amount of snow cover at 17 stations during the cold season (November to March) in 2001 to 2005 and Corresponding 45 RDAPS outputs are used. Logistic regression model and neural networks are applied to predict the probability of occurrence of Heavy snow. Based on the distribution of estimated probabilities, optimal thresholds are determined via true shill score. According to the results of comparison the logistic regression model is recommended.

Variable selection and prediction performance of penalized two-part regression with community-based crime data application

  • Seong-Tae Kim;Man Sik Park
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.4
    • /
    • pp.441-457
    • /
    • 2024
  • Semicontinuous data are characterized by a mixture of a point probability mass at zero and a continuous distribution of positive values. This type of data is often modeled using a two-part model where the first part models the probability of dichotomous outcomes -zero or positive- and the second part models the distribution of positive values. Despite the two-part model's popularity, variable selection in this model has not been fully addressed, especially, in high dimensional data. The objective of this study is to investigate variable selection and prediction performance of penalized regression methods in two-part models. The performance of the selected techniques in the two-part model is evaluated via simulation studies. Our findings show that LASSO and ENET tend to select more predictors in the model than SCAD and MCP. Consequently, MCP and SCAD outperform LASSO and ENET for β-specificity, and LASSO and ENET perform better than MCP and SCAD with respect to the mean squared error. We find similar results when applying the penalized regression methods to the prediction of crime incidents using community-based data.

Prediction of stress intensity factor range for API 5L grade X65 steel by using GPR and MPMR

  • Murthy, A. Ramachandra;Vishnuvardhan, S.;Saravanan, M.;Gandhi, P.
    • Structural Engineering and Mechanics
    • /
    • v.81 no.5
    • /
    • pp.565-574
    • /
    • 2022
  • The infrastructures such as offshore, bridges, power plant, oil and gas piping and aircraft operate in a harsh environment during their service life. Structural integrity of engineering components used in these industries is paramount for the reliability and economics of operation. Two regression models based on the concept of Gaussian process regression (GPR) and Minimax probability machine regression (MPMR) were developed to predict stress intensity factor range (𝚫K). Both GPR and MPMR are in the frame work of probability distribution. Models were developed by using the fatigue crack growth data in MATLAB by appropriately modifying the tools. Fatigue crack growth experiments were carried out on Eccentrically-loaded Single Edge notch Tension (ESE(T)) specimens made of API 5L X65 Grade steel in inert and corrosive environments (2.0% and 3.5% NaCl). The experiments were carried out under constant amplitude cyclic loading with a stress ratio of 0.1 and 5.0 Hz frequency (inert environment), 0.5 Hz frequency (corrosive environment). Crack growth rate (da/dN) and stress intensity factor range (𝚫K) values were evaluated at incremental values of loading cycle and crack length. About 70 to 75% of the data has been used for training and the remaining for validation of the models. It is observed that the predicted SIF range is in good agreement with the corresponding experimental observations. Further, the performance of the models was assessed with several statistical parameters, namely, Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Coefficient of Efficiency (E), Root Mean Square Error to Observation's Standard Deviation Ratio (RSR), Normalized Mean Bias Error (NMBE), Performance Index (ρ) and Variance Account Factor (VAF).

Prediction of Future Sea Surface Temperature around the Korean Peninsular based on Statistical Downscaling (통계적 축소법을 이용한 한반도 인근해역의 미래 표층수온 추정)

  • Ham, Hee-Jung;Kim, Sang-Su;Yoon, Woo-Seok
    • Journal of Industrial Technology
    • /
    • v.31 no.B
    • /
    • pp.107-112
    • /
    • 2011
  • Recently, climate change around the world due to global warming has became an important issue and damages by climate change have a bad effect on human life. Changes of Sea Surface Temperature(SST) is associated with natural disaster such as Typhoon and El Nino. So we predicted daily future SST using Statistical Downscaling Method and CGCM 3.1 A1B scenario. 9 points of around Korea peninsular were selected to predict future SST and built up a regression model using Multiple Linear Regression. CGCM 3.1 was simulated with regression model, and that comparing Probability Density Function, Box-Plot, and statistical data to evaluate suitability of regression models, it was validated that regression models were built up properly.

  • PDF