• Title/Summary/Keyword: statistical potential

Search Result 1,044, Processing Time 0.026 seconds

Outlier tests on potential outliers (잠재적 이상치군에 대한 검정)

  • Seo, Han Son
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.159-167
    • /
    • 2017
  • Observations identified as potential outliers are usually tested for real outliers; however, some outlier detection methods skip a formal test or perform a test using simulated p-values. We introduce test procedures for outliers by testing subsets of potential outliers rather than by testing individual observations of potential outliers to avoid masking or swamping effects. Examples to illustrate methods and a Monte Carlo study to compare the power of the various methods are presented.

A Review of Statistical Methods in the Journal of Oriental Obstetrics & Gynecology (대한한방부인과학회지에서 사용된 통계방법에 관한 연구)

  • Kim, Yoon-Sang;Oh, Hyun-Sook;Lim, Eun-Mee
    • The Journal of Korean Obstetrics and Gynecology
    • /
    • v.25 no.1
    • /
    • pp.70-78
    • /
    • 2012
  • Objectives: The purpose of this article is not until to investigate the changes and types of statistical methods and to point out the statistical errors after analyzing the method of articles that improve the quality of the statistical analysis of papers published in the Journal of Oriental Obstetrics and Gynecology. Methods: Papers published in the Journal of Oriental Obstetrics and Gynecology from 2009 to 2011 were reviewed for methodological and statistical validity using a modified version of Ahn's checklist. A statistician reviewed individual papers and evaluated the list items in the checklist for each paper. To avoid the potential assessment error by the statistician who lacks expertise in the field of Oriental Obstetrics and Gynecology. Results: A total of 190 papers including 64 original articles, 40 reviews article, 58 case report and 28 brief communication were reviewed. Statistics methods used in 121 papers were composed of t-test(58.7%), ANOVA test(19.8%) and ${\chi}^2$- test (14.0%) et al. Whereas only 14.9% of papers were free of statistical errors, the number of omission errors was 58 and the number of commission errors was 149 each. Conclusions: A variety of statistical errors were encountered in papers published in the Journal of Oriental Obstetrics and Gynecology. Accordingly, researchers should be more careful when it comes to describing and applying statistical methods.

A Statistical Analysis of JERS L-band SAR Backscatter and Coherence Data for Forest Type Discrimination

  • Zhu Cheng;Myeong Soo-Jeong
    • Korean Journal of Remote Sensing
    • /
    • v.22 no.1
    • /
    • pp.25-40
    • /
    • 2006
  • Synthetic aperture radar (SAR) from satellites provides the opportunity to regularly incorporate microwave information into forest classification. Radar backscatter can improve classification accuracy, and SAR interferometry could provide improved thematic information through the use of coherence. This research examined the potential of using multi-temporal JERS-l SAR (L band) backscatter information and interferometry in distinguishing forest classes of mountainous areas in the Northeastern U.S. for future forest mapping and monitoring. Raw image data from a pair of images were processed to produce coherence and backscatter data. To improve the geometric characteristics of both the coherence and the backscatter images, this study used the interferometric techniques. It was necessary to radiometrically correct radar backscatter to account for the effect of topography. This study developed a simplified method of radiometric correction for SAR imagery over the hilly terrain, and compared the forest-type discriminatory powers of the radar backscatter, the multi-temporal backscatter, the coherence, and the backscatter combined with the coherence. Statistical analysis showed that the method of radiometric correction has a substantial potential in separating forest types, and the coherence produced from an interferometric pair of images also showed a potential for distinguishing forest classes even though heavily forested conditions and long time separation of the images had limitations in the ability to get a high quality coherence. The method of combining the backscatter images from two different dates and the coherence in a multivariate approach in identifying forest types showed some potential. However, multi-temporal analysis of the backscatter was inconclusive because leaves were not the primary scatterers of a forest canopy at the L-band wavelengths. Further research in forest classification is suggested using diverse band width SAR imagery and fusing with other imagery source.

Application of Statistical and Machine Learning Techniques for Habitat Potential Mapping of Siberian Roe Deer in South Korea

  • Lee, Saro;Rezaie, Fatemeh
    • Proceedings of the National Institute of Ecology of the Republic of Korea
    • /
    • v.2 no.1
    • /
    • pp.1-14
    • /
    • 2021
  • The study has been carried out with an objective to prepare Siberian roe deer habitat potential maps in South Korea based on three geographic information system-based models including frequency ratio (FR) as a bivariate statistical approach as well as convolutional neural network (CNN) and long short-term memory (LSTM) as machine learning algorithms. According to field observations, 741 locations were reported as roe deer's habitat preferences. The dataset were divided with a proportion of 70:30 for constructing models and validation purposes. Through FR model, a total of 10 influential factors were opted for the modelling process, namely altitude, valley depth, slope height, topographic position index (TPI), topographic wetness index (TWI), normalized difference water index, drainage density, road density, radar intensity, and morphological feature. The results of variable importance analysis determined that TPI, TWI, altitude and valley depth have higher impact on predicting. Furthermore, the area under the receiver operating characteristic (ROC) curve was applied to assess the prediction accuracies of three models. The results showed that all the models almost have similar performances, but LSTM model had relatively higher prediction ability in comparison to FR and CNN models with the accuracy of 76% and 73% during the training and validation process. The obtained map of LSTM model was categorized into five classes of potentiality including very low, low, moderate, high and very high with proportions of 19.70%, 19.81%, 19.31%, 19.86%, and 21.31%, respectively. The resultant potential maps may be valuable to monitor and preserve the Siberian roe deer habitats.

New approach to calculate Weibull parameters and comparison of wind potential of five cities of Pakistan

  • Ahmed Ali Rajput;Muhammad Daniyal;Muhammad Mustaqeem Zahid;Hasan Nafees;Misha Shafi;Zaheer Uddin
    • Advances in Energy Research
    • /
    • v.8 no.2
    • /
    • pp.95-110
    • /
    • 2022
  • Wind energy can be utilized for the generation of electricity, due to significant wind potential at different parts of the world, some countries have already been generating of electricity through wind. Pakistan is still well behind and has not yet made any appreciable effort for the same. The objective of this work was to add some new strategies to calculate Weibull parameters and assess wind energy potential. A new approach calculates Weibull parameters; we also developed an alternate formula to calculate shape parameters instead of the gamma function. We obtained k (shape parameter) and c (scale parameter) for two-parameter Weibull distribution using five statistical methods for five different cities in Pakistan. Maximum likelihood method, Modified Maximum likelihood Method, Method of Moment, Energy Pattern Method, Empirical Method, and have been to calculate and differentiate the values of (shape parameter) k and (scale parameter) c. The performance of these five methods is estimated using the Goodness-of-Fit Test, including root mean square error, mean absolute bias error, mean absolute percentage error, and chi-square error. The daily 10-minute average values of wind speed data (obtained from energydata.info) of different cities of Pakistan for the year 2016 are used to estimate the Weibull parameters. The study finds that Hyderabad city has the largest wind potential than Karachi, Quetta, Lahore, and Peshawar. Hyderabad and Karachi are two possible sites where wind turbines can produce reasonable electricity.

A Statistical-Mechanical Analysis of One-Dimensional Fluid of Rigid Rods (딱딱한 막대 모양 분자로 이루어진 1차원 유체의 통계 역학적 분석)

  • Lim, Kyung-Hee
    • Journal of the Korean Applied Science and Technology
    • /
    • v.26 no.1
    • /
    • pp.45-50
    • /
    • 2009
  • Three-dimensional, statistical-mechanical formulations of problems are usually untractable analytically, and therefore they are commonly solved numerically. However, their one-dimensional counterparts are always to be solved analytically. In general analytical solutions sheds more insights to the problems than numerical solutions. Hence, solutions of one-dimensional problems may provide key properties to the problems, when they are extended to three dimensions. In this article, thermodynamic properties of one-dimensional fluid comprising molecules of rigid rods are analyzed statistical-mechanically. Molecules of rigid rods are characterized with repulsive or excluded volume effect. It is observed that this feature is well reflected in thermodynamic functions such as Helmholtz free energy. volumetric equation of state. chemical potential, entropy, etc.

A Study on the Group Sequential Methods for Comparing Survival Distributions in Clinical Trials

  • Jae Won Lee
    • Communications for Statistical Applications and Methods
    • /
    • v.5 no.2
    • /
    • pp.459-475
    • /
    • 1998
  • In many clinical trials, we are interested in comparing the failure time distribution of different treatment groups. Because of ethical and economic reasons, clinical trials need to be monitored for early dramatic benefits or potential harmful effects. Prior knowledge, evolving knowledge, statistical considerations, medical judgment and ethical principles are all involved in the decision to terminate a trial early, and thus the monitoring is usually carried out by an independent scientific committee. This paper reviews the recently proposed group sequential testing procedures for clinical trials with survival data. Design considerations of such clinical trials are also discussed. This paper compares the characteristics of each of these methods and provides the biostatisticians with the guidelines for choosing the appropriate group sequential methods in a given situation.

  • PDF

Statistical Techniques for Automatic Indexing and Some Experiments with Korean Documents (자동색인의 통계적기법과 한국어 문헌의 실험)

  • Chung Young Mee;Lee Tae Young
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.9
    • /
    • pp.99-118
    • /
    • 1982
  • This paper first reviews various techniques proposed for automatic indexing with special emphasis placed on statistical techniques. Frequency-based statistical techniques are categorized into the following three approaches for further investigation on the basis of index term selection criteria: term frequency approach, document frequency approach, and probabilistic approach. In the experimental part of this study, Pao's technique based on the Goffman's transition region formula and Harter's 2-Poisson distribution model with a measure of the potential effectiveness of index term were tested. Experimental document collection consists of 30 agriculture-related documents written in Korean. Pao's technique did not yield good result presumably due to the difference in word usage between Korean and English. However, Harter's model holds some promise for Korean document indexing because the evaluation result from this experiment was similar to that of the Harter's.

  • PDF

Dynamic graphic features in S-PLUS and XLISP-STAT (S-PLUS와 XLISP-STAT의 다이나믹그래픽 기능)

  • 김철웅;서한손
    • The Korean Journal of Applied Statistics
    • /
    • v.6 no.1
    • /
    • pp.23-28
    • /
    • 1993
  • The increase in computing power and the decrease in price of computers has enabled statistical computer graphics to progress tremendously in recent years. Many people can now access to the newly developed computer graphical methods easily. The direct manipulation on screen and the symultaneous realization of the results are two main ingradients of dynamic graphics. We compare the dynamic graphical features in two relatively new packages; SPLUS and XLISP-STAT. XLISP-STAT is very lean packed with powerful dynamic graphical tools. The statistical computer graphics, being still in the state of infancy, has a lot of room to grow, and is a new research area with a great potential.

  • PDF

Generalized half-logistic Poisson distributions

  • Muhammad, Mustapha
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.4
    • /
    • pp.353-365
    • /
    • 2017
  • In this article, we proposed a new three-parameter distribution called generalized half-logistic Poisson distribution with a failure rate function that can be increasing, decreasing or upside-down bathtub-shaped depending on its parameters. The new model extends the half-logistic Poisson distribution and has exponentiated half-logistic as its limiting distribution. A comprehensive mathematical and statistical treatment of the new distribution is provided. We provide an explicit expression for the $r^{th}$ moment, moment generating function, Shannon entropy and $R{\acute{e}}nyi$ entropy. The model parameter estimation was conducted via a maximum likelihood method; in addition, the existence and uniqueness of maximum likelihood estimations are analyzed under potential conditions. Finally, an application of the new distribution to a real dataset shows the flexibility and potentiality of the proposed distribution.