• Title/Summary/Keyword: Anderson-Darling

Search Result 64, Processing Time 0.035 seconds

Future projections of extreme precipitation by using CMIP6 database at finer scales over South Korea (CMIP6 기후변화 자료를 이용한 국내 미래 극한강우의 예측)

  • Kim, Jongho;Van Doi, Manh
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.368-368
    • /
    • 2021
  • 기후 변화로 인한 극한사상의 크기와 빈도 변화를 예측하는 것은 수공 인프라 설계에 있어 주된 관심사 중 하나이다. 보통 극한사상에 대한 강도, 빈도, 지속시간에 대한 정보가 필요하며, 이는 일반적으로 IDF(Intensity-Duration-Frequency) 곡선으로부터 추출된다. 최근 CMIP(Coupled Model Intercomparison Project) 6단계에서 새로운 이산화탄소 배출 시나리오와 업데이트된 기후모델을 이용하여 미래의 기후에 대한 예측 시계열을 발표했으므로, 미래 기후 변화 시나리오를 기반으로 IDF 곡선을 새로 추정하고 미래 기간의 변화를 평가할 필요가 있다. 본 연구에서는 한국의 40개 지역에 대해 일단위 자료를 시단위로 축소(downscaling)한 후, 확률론적 일기생성기(stochastic weather generator)를 이용하여 30년 시단위 시계열을 100개의 앙상블로 생성하였다. 생성된 시계열로부터 연최대강수량 시계열을 재구성하여 GEV 분포와 gumbel 분포에 적용하였다. 적합도 검정(Anderson-Darling(AD) 검정 및 Kolmogorov-Smirnov(KS) 검정)을 수행하였으며, 과거 자료를 기반으로 생성된 IDF 곡선과 비교 검증하였다. CMIP5의 기후변화 자료를 사용한 결과와 CMIP6 기후변화의 결과를 비교하였으며, 본 연구의 주요 결과는 다음과 같다. (1) 향후 강우 강도는 증가할 것이며 강우 강도의 증가는 말기에 현저하게 관찰될 것이다. (2) 시간별 강우 강도의 미래 변화가 일단위 강우 강도보다 더 크다. (3) 강우 강도의 불확실성을 정량화하기 위해 앙상블을 사용해야 한다. (4) 강우 강도의 미래 변화에 대한 공간적인 경향이 확인된다. 시단위 시계열 앙상블을 생성하여 추정된 IDF 곡선에 대한 정보는 기후 변화의 영향을 평가하고 적절한 적응 및 대응 전략을 개발하는 데 도움이 될 것이다.

  • PDF

Application of deterministic models for obtaining groundwater level distributions through outlier analysis

  • Dae-Hong Min;Saheed Mayowa Taiwo;Junghee Park;Sewon Kim;Hyung-Koo Yoon
    • Geomechanics and Engineering
    • /
    • v.35 no.5
    • /
    • pp.499-509
    • /
    • 2023
  • The objective of this study is to perform outlier analysis to obtain the distribution of groundwater levels through the best model. The groundwater levels are measured in 10, 25 and 30 piezometers in Seoul, Daejeon and Suncheon in South Korea. Fifty-eight empirical distribution functions were applied to determine a suitable fit for the measured groundwater levels. The best fitted models based on the measured values are determined as the Generalized Pareto distribution, the Johnson SB distribution and the Normal distribution for Seoul, Daejeon and Suncheon, respectively; the reliability is estimated through the Anderson-Darling method. In this study, to choose the appropriate confidence interval, the relationship between the amount of outlier data and the confidence level is demonstrated, and then the 95% is selected at a reasonable confidence level. The best model shows a smaller error ratio than the GEV while the Mahalanobis distance and outlier labelling methods results are compared and validated. The outlier labelling and Mahalanobis distance based on median shown higher validated error ratios compared to their mean equivalent suggesting, the methods sensitivity to data structure.

Verification of Reliability by the Induced Voltage of a Downscaled and Simulated 22.9kV-Y Distribution Line (축소 모의된 22.9 kV-Y 배전선로의 유도 전압에 대한 신뢰성 검증)

  • Kim, Jeom-Sik;Choi, Chung-Seog
    • Journal of the Korean Society of Safety
    • /
    • v.30 no.4
    • /
    • pp.26-31
    • /
    • 2015
  • The purpose of this paper is to measure the induced voltage of the downscaled and simulated overhead ground wire of a 22.9kV-Y distribution line. This study performed a test of the downscaled and simulated distribution line according to whether it is grounded or not and the value of the ground resistance. In order to verify the reliability of the data measured by the test, the data was analyzed using the Minitab 17 program. It was found that the induced voltage of the downscaled and simulated distribution line is influenced by the value of the ground resistance. It was also found that the ground resistance obtained at a certain point is closely related to whether electric poles are grounded or not. The analysis results of the measured test data with a statistical method showed that the Anderson Darling (AD) was analyzed to be the smallest as 0.188 when the ground resistance of the electric poles had been maintained at $10{\Omega}$. In addition, the P value analyzed to be 0.894 which is in the proximity of the theoretical value of 1 and verified the reliability of the test data. It could be seen that the data measured by the downscaled simulation test forms a linear graph. It is thought that if a distribution line is installed in the same manner as the downscaled, simulated distribution line, the mean induced voltage will be reduced and reliability will be increased.

Statistical Probability Analysis of Storage Temperatures of Domestic Refrigerator as a Risk Factor of Foodborne Illness Outbreak (식중독 발생 위해인자로서 가정용 냉장고의 온도에 대한 확률분포 분석)

  • Bahk, Gyung-Jin
    • Korean Journal of Food Science and Technology
    • /
    • v.42 no.3
    • /
    • pp.373-376
    • /
    • 2010
  • The objective of this study was to present the proper probability distribution model based on the data obtained from surveys on domestic refrigerator food storage temperatures in home. Domestic refrigerator temperatures were determined as risk factors in foodborne disease outbreaks for microbial risk assessment (MRA). The temperature was measured by directly visiting 139 homes using a data logger from May to September of 2009. The overall mean temperature for all the refrigerators in the survey was $3.53{\pm}2.96^{\circ}C$, with 23.6% of the refrigerators measuring above $5^{\circ}C$. Probability distributions were also created using @RISK program based on the measured temperature data. Statistical ranking was determined by the goodness of fit (GOF, i.e., the Kolmogorov-Smirnov (KS) or Anderson-Darling (AD) test) to determine the proper probability distribution model. This result showed that the LogLogistic (-10.407, 13.616, 8.6107) distribution was found to be the most appropriate for the MRA model. The results of this study might be directly used as input variables in exposure evaluation for conducting MRA.

Estimation of sewer deterioration by Weibull distribution function (와이블 분포함수를 이용한 하수관로 노후도 추정)

  • Kang, Byongjun;Yoo, Soonyu;Park, Kyoohong
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.34 no.4
    • /
    • pp.251-258
    • /
    • 2020
  • Sewer deterioration models are needed to forecast the remaining life expectancy of sewer networks by assessing their conditions. In this study, the serious defect (or condition state 3) occurrence probability, at which sewer rehabilitation program should be implemented, was evaluated using four probability distribution functions such as normal, lognormal, exponential, and Weibull distribution. A sample of 252 km of CCTV-inspected sewer pipe data in city Z was collected in the first place. Then the effective data (284 sewer sections of 8.15 km) with reliable information were extracted and classified into 3 groups considering the sub-catchment area, sewer material, and sewer pipe size. Anderson-Darling test was conducted to select the most fitted probability distribution of sewer defect occurrence as Weibull distribution. The shape parameters (β) and scale parameters (η) of Weibull distribution were estimated from the data set of 3 classified groups, including standard errors, 95% confidence intervals, and log-likelihood values. The plot of probability density function and cumulative distribution function were obtained using the estimated parameter values, which could be used to indicate the quantitative level of risk on occurrence of CS3. It was estimated that sewer data group 1, group 2, and group 3 has CS3 occurrence probability exceeding 50% at 13th-year, 11th-year, and 16th-year after the installation, respectively. For every data groups, the time exceeding the CS3 occurrence probability of 90% was also predicted to be 27th- to 30th-year after the installation.

Reference Intervals from Hospital-Based Data for Hematologic and Serum Chemistry Values in Dogs (병원자료에 근거한 혈액 및 혈액화학 검사항목의 참고구간 설정)

  • Kwon, Young-Wook;Pak, Son-Il
    • Journal of Veterinary Clinics
    • /
    • v.27 no.1
    • /
    • pp.66-70
    • /
    • 2010
  • Reference interval is critical for interpreting laboratory results, monitoring response to therapy and predicting the prognosis of the patients in clinical settings. The aim of the present study was to update established reference intervals for routine hematologic and serum chemistry values for a population of clinically healthy dogs (range, 1-8 years) seen in an animal hospital. Blood was obtained by venipuncture while animals were physically restrained, and samples were analyzed for 9 chemistries on MS9-5H (Melot Schloesing Lab, France) and 6 hematology on Vet Test 8008 (IDEXX, USA). Data from 105 dogs (52 males and 53 females) for hematology and 113 dogs (37 males and 76 females) for chemistry were used to determine reference intervals using the parametric, nonparametric and bootstrap methods. Prior to analysis, all parameters were tested for normal distribution using Anderson-Darling criterion. Of the 9 biochemical analytes, alkaline phosphatase, alanine aminotransferase, aspartate aminotransferase, creatinine, total protein, and glucose concentrations did not fit normal distribution for both original and transformed data. All but eosinophil count satisfied normal distribution for either original or transformed data. Parametric method can be used for original cholesterol concentrations, RBC, WBC, and neutrophil counts. This technique can also be used for power-transformed values of blood urea nitrogen concentrations and for logarithm of lymphocyte and monocyte counts. Non-parametric or bootstrap method was the preferred choice for the remaining 7 biochemical parameters and eosinophil count as they did not follow normal distributions. All three statistical techniques performed in similar reference intervals. When establishing reference intervals for clinical laboratory data, it is essential to assess the distribution of the original data to increase the accuracy of the interval, and non-parametric or bootstrap methods are of alternative for the data that do not fit normal distribution.

Review on the Effectiveness of Apartments According to Insulation Reinforcement of Energy Saving Design Standard - Energy Cost Analysis of Space Heating and Domestic Hot Water - (에너지절약설계기준의 단열 변화에 따른 공동주택 실효성 검토 - 난방 및 급탕에너지 사용금액 분석 -)

  • Kim, Ji-hyeon;Lim, Hee-won;Lee, Hyun-seung;Shin, U-cheul
    • Journal of the Architectural Institute of Korea Structure & Construction
    • /
    • v.36 no.1
    • /
    • pp.173-178
    • /
    • 2020
  • In this study, we analyzed space heating and domestic hot water(DHW) charges from 2017 to 2018 for 151,206 households in 202 apartment complexes built from 1997 to 2016, in order to evaluate the effectiveness of Korea's insulation standards reinforcement. The applied insulation standards were revised three times(in 1987, 2001 and 2010). We used the Anderson-Darling test to review the normality of the space heating and DHW charges. As a result, each p-value was greater than 0.05 and it followed the normal distribution. The annual average space heating charges per unit area of apartments with the 1987 insulation standards were 5,237₩/㎡a in 2017 and 5,328₩/㎡a in 2018. The heating charges with the 2001 standards were 4,827₩/㎡a in 2017 and 4,817₩/㎡a in 2018. Compared to previous standards(1987 standards), heating charges decreased by 7.8% and 9.6%. Also the heating charges with the 2010 standards were 3,683₩/㎡a in 2017 and 3,734₩/㎡a in 2018. Compared to previous standards(2001 standards), heating charges decreased by 24.3% and 21.5%. On the other hand, compared to each previous standards, DHW charges were decreased by 12.2%, 13.6% or increased by 4.6%, 6.9%, which means there's no correlation between insulation standards reinforcement and DHW charges.

A survey on the Minimum Time Scale by Southern Region of the Korean Peninsula for Daily SPI Application (일 단위 SPI 적용을 위한 한반도 남부지역별 최소 시간 척도 조사)

  • Chae Lim Lee;Ji Yu Seo;Jeong Eun Won;Sang Dan Kim
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.328-328
    • /
    • 2023
  • 표준강수지수(Standardized Precipitation Index, SPI)는 강수량 변동의 정도를 표준화하여 나타낸 지수로, 가뭄 평가에 적용되고 있다. 일반적으로 SPI를 산정할 때는 월 단위의 시간 척도를 적용하며, 장기간의 가뭄에 대해 평가한다. 그러나 시간 척도가 길어질수록 가뭄 발생 후 가뭄을 감지하는 데 걸리는 시간이 더 길어지기 때문에 대처가 더욱 어려워진다. 또한, 기후변화로 인해 가뭄 빈도가 증가하고, 그 정도가 더욱 심화되면서 일 단위의 적용이 필요해지고 있다. 본 연구는 한반도 남부지역을 대상으로 일 단위의 SPI 적용을 위한 최소 시간 척도를 조사하였다. 대상 지역을 강원권, 수도권, 부울경, 대경권, 호남권, 충청권의 총 6개 지역으로 분리하여, 각 지역별, 계절별 최소 시간 척도를 조사하였다. SPI 산정을 위해 후보 분포형으로 Gumbel, Gamma, GEV, Loglogistic, Lognormal, Weibull을 적용하였으며, 시간 척도는 5일부터 365일까지 총 10개로 설정하였다. 본 연구에선 크게 적합도 검정과 정규성 검정으로 진행하였다. 적합도 검정에서는 Chi-square test를 적용하였으며, 이때 일 단위의 짧은 시간 척도를 적용할 경우 누가 강수 시계열의 값이 0으로, 0값이 시계열에 포함되면 SPI의 정확도가 떨어지는 문제가 발생하는데, 이를 보완하기 위해 누가 강수 시계열의 0값을 고려하였다. 마지막으로 각 후보 분포형을 적용하여 산정된 SPI가 표준정규분포에 합당한지를 검증하기 위해 Anderson-Darling test를 수행하였다. 결과적으로 대부분의 지역에서는 봄과 여름의 경우 최소 15일 정도의 시간 척도까지는 적용할 수 있을 것으로 판단되며, 겨울의 경우는 최소 30일 정도의 시간 척도를 적용해야 함을 확인하였다. 지역별로 차이가 크진 않지만, 이러한 연구 결과를 참고하여 각 지역별로 더 나은 가뭄 대책을 마련할 수 있을 것으로 기대된다.

  • PDF

Development of drought frequency analysis program (가뭄빈도해석 프로그램 개발)

  • Lee, Jeong Ju;Kang, Shin Uk;Chun, Gun Il;Kim, Hyeon Sik
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.14-14
    • /
    • 2020
  • 일반적으로 수문빈도해석은 치수계획 수립에 이용되는 설계강수량, 계획홍수량 등을 산정하기 위해 연최대치계열 또는 연초과치계열 자료를 이용한 극치빈도해석을 수행하고, 확률분포의 우측꼬리(right tail) 부분을 이용하여 확장된 재현기간에 해당하는 확률수문량을 추정한다. 하지만 가뭄 관련 분석에서는 확률분포의 좌측꼬리(left tail) 부분은 이용해 확장된 재현기간별 확률수문량을 추정해야할 경우가 발생한다. 또한 물관리 실무에서 장 단기 운영계획 수립을 위해 이용하는 갈수빈도 유입량 산정 등에서도 평년보다 작은 수문량에 대한 빈도해석이 필요한 경우가 있다. 국가 가뭄정보분석센터에서는 기존에 K-water연구원에서 개발한 빈도해석 프로그램인 K-FAT의 분석모듈을 이용해 극소치계열 또는 갈수빈도 유입량 분석에 특화된 가뭄빈도해석 프로그램을 개발하였다. 본 프로그램은 GEV, Gumbel, Weibull 등 14개의 확률분포형을 포함하며, 모멘트법, 최우도법 및 L-모멘트법을 사용하여 매개변수를 추정한다. 적합도 검정의 경우 χ2, K-S, CVM, PPCC 및 수정 Anderson-Darling test를 이용하여 다각적인 검정을 할 수 있도록 하였다. 분석을 위한 입력 자료의 경우 사용자가 전처리를 통해 준비한 연최소치계열 등 연도별 시계열자료를 이용할 수 있으며, 일단위 및 월단위의 강수량 또는 댐 유입량 자료를 이용해 사용자가 원하는 기간의 누적강수량, 평균 유입량으로 변환할 수 있는 자료변환 기능을 추가하여 실무 활용성을 높였다. 또한 최적 확률분포 선정을 위해 참고할 수 있도록 AIC(Akaike information criteria)와 BIC(Bayesian information criteria) 분석이 포함되어 있으며, Bootstrap 기법 등을 이용한 불확실성 산정을 통해 추정 값의 신뢰구간을 표시하도록 하였다. 개발된 프로그램은 베타버전 시험배포를 거쳐 가뭄정보포털을 통해 배포할 예정이다.

  • PDF

Assessment of compressive strength of high-performance concrete using soft computing approaches

  • Chukwuemeka Daniel;Jitendra Khatti;Kamaldeep Singh Grover
    • Computers and Concrete
    • /
    • v.33 no.1
    • /
    • pp.55-75
    • /
    • 2024
  • The present study introduces an optimum performance soft computing model for predicting the compressive strength of high-performance concrete (HPC) by comparing models based on conventional (kernel-based, covariance function-based, and tree-based), advanced machine (least square support vector machine-LSSVM and minimax probability machine regressor-MPMR), and deep (artificial neural network-ANN) learning approaches using a common database for the first time. A compressive strength database, having results of 1030 concrete samples, has been compiled from the literature and preprocessed. For the purpose of training, testing, and validation of soft computing models, 803, 101, and 101 data points have been selected arbitrarily from preprocessed data points, i.e., 1005. Thirteen performance metrics, including three new metrics, i.e., a20-index, index of agreement, and index of scatter, have been implemented for each model. The performance comparison reveals that the SVM (kernel-based), ET (tree-based), MPMR (advanced), and ANN (deep) models have achieved higher performance in predicting the compressive strength of HPC. From the overall analysis of performance, accuracy, Taylor plot, accuracy metric, regression error characteristics curve, Anderson-Darling, Wilcoxon, Uncertainty, and reliability, it has been observed that model CS4 based on the ensemble tree has been recognized as an optimum performance model with higher performance, i.e., a correlation coefficient of 0.9352, root mean square error of 5.76 MPa, and mean absolute error of 4.1069 MPa. The present study also reveals that multicollinearity affects the prediction accuracy of Gaussian process regression, decision tree, multilinear regression, and adaptive boosting regressor models, novel research in compressive strength prediction of HPC. The cosine sensitivity analysis reveals that the prediction of compressive strength of HPC is highly affected by cement content, fine aggregate, coarse aggregate, and water content.