• Title/Summary/Keyword: Chi Square Statistics

Search Result 639, Processing Time 0.026 seconds

Issue Word Extraction Using Chi-square Statistics (카이제곱 통계량을 이용한 이슈 단어 추출)

  • Shin, Junsoo
    • Annual Conference on Human and Language Technology
    • /
    • 2014.10a
    • /
    • pp.225-227
    • /
    • 2014
  • 최근 온라인 뉴스는 대중의 관심사 및 트렌드에 따라서 다양한 종류의 기사들이 작성된다. 이러한 관심사 및 트렌드는 시간의 흐름에 따라 계속 변한다. 본 논문에서는 온라인 뉴스의 기사 제목을 이용하여 시간에 따라 변하는 관심사 및 트렌드와 관련된 단어를 추출하는 방법을 제안한다. 특정 기간 별 출현하는 뉴스들을 하나의 카테고리로 가정하고 자질 선택 방법에서 널리 사용되는 카이제곱 통계량을 이용하여 각 카테고리의 주요 단어를 추출한다. 실험 결과 특정 기간 별 관심사 및 트렌드와 관련된 단어들이 출현하는 것을 확인하였다.

  • PDF

Comparing More than Two Agreement Measures Using Marginal Association

  • Oh, Myong-Sik
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.6
    • /
    • pp.1023-1029
    • /
    • 2009
  • Oh (2009) has proposed a likelihood ratio test for comparing two agreements for dependent observations based on the concept of marginal homogeneity and marginal stochastic ordering. In this paper we consider the comparison of more than two agreement measures. Simple ordering and simple tree ordering among agreement measures are investigated. Some test procedures, including likelihood ratio test, are discussed.

Statistical Inference Concerning Peakedness Ordering between Two Symmetric Distributions

  • Oh, Myong-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.1
    • /
    • pp.201-210
    • /
    • 2004
  • The peakedness ordering is closely related to dispersive ordering. In this paper we consider the statistical inference concerning peakedness ordering between two arbitrary symmetric distributions. Nonparametric maximum likelihood estimates of two distribution functions under symmetry and peakedness ordering are given. The likelihood ratio test for equality of two symmetric discrete distributions in the sense of peakedness ordering is studied.

  • PDF

An approach to improving the Lindley estimator

  • Park, Tae-Ryoung;Baek, Hoh-Yoo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.6
    • /
    • pp.1251-1256
    • /
    • 2011
  • Consider a p-variate ($p{\geq}4$) normal distribution with mean ${\theta}$ and identity covariance matrix. Using a simple property of noncentral chi square distribution, the generalized Bayes estimators dominating the Lindley estimator under quadratic loss are given based on the methods of Brown, Brewster and Zidek for estimating a normal variance. This result can be extended the cases where covariance matrix is completely unknown or ${\Sigma}={\sigma}^2I$ for an unknown scalar ${\sigma}^2$.

The Comparison of Variables between Therapy Continuity Group and Therapy Discontinuity Group of Patients With Hypertension and Diabetes in Daegu Initiative (심뇌혈관질환 고위험군 중 치료연속자와 치료불연속자 간의 특성 비교)

  • Park, Jeong-Sook;Kwon, Young-Sook;Oh, Yun-Jung
    • Korean Journal of Health Education and Promotion
    • /
    • v.26 no.2
    • /
    • pp.135-148
    • /
    • 2009
  • Objectives: The purpose of this study is to identify the characteristics between therapy continuity group and therapy discontinuity group and to develop management program for Korean patients with hypertension and diabetes. Methods: The subject of the study were 109 therapy continuity and 66 therapy discontinuity of Korea hypertension diabetes Daegu initiative. The data collection was performed from December 5 to December 30, 2008. Analysis of data was done by using descriptive statistics, chi-square test, t-test and ANCOVA with SPSS program. Results: 1) The groups were significantly correlated with such variables systolic BP(F=4.518, p=0.035) and diastolic BP(F=17.793, p=0.000). 2) The groups with hypertensive were significantly correlated with such variables perceived susceptibility of disease($\chi^2$=25.053, p=0.000), perceived barrier of health behavior($\chi^2$=12.584, p=0.006), drinking($\chi^2$=27.545, p=0.000), diet($\chi^2$=8.645, p=0.013), regular taking medicine($\chi^2$=92.415, p=0.000) and regular measurement of BP($\chi^2$=6.045, p=0.049). 3) The groups with diabetic were significantly correlated with such variables perceived seriousness of disease($\chi^2$=6.128, p=0.047), perceived susceptibility of disease($\chi^2$=8.079, p=0.018), health knowledge and attitude(F=8.418, p=0.006), drinking($\chi^2$=6.276, p=0.043), diet($\chi^2$=7.275, p=0.026), regular taking medicine($\chi^2$=33.083, p=0.000) and regular measurement of glucose($\chi^2$=7.233, p=0.027). Conclusion: The above findings indicate that it is necessary to develop and apply special management programs according to the therapy discontinuity group.

Empirical Analysis on Rao-Scott First Order Adjustment for Two Population Homogeneity test Based on Stratified Three-Stage Cluster Sampling with PPS

  • Heo, Sunyeong
    • Journal of Integrative Natural Science
    • /
    • v.7 no.3
    • /
    • pp.208-213
    • /
    • 2014
  • National-wide and/or large scale sample surveys generally use complex sample design. Traditional Pearson chi-square test is not appropriate for the categorical complex sample data. Rao-Scott suggested an adjustment method for Pearson chi-square test, which uses the average of eigenvalues of design matrix of cell probabilities. This study is to compare the efficiency of Rao-Scott first order adjusted test to Wald test for homogeneity between two populations using 2009 Gyeongnam regional education offices's customer satisfaction survey (2009 GREOCSS) data. The 2009 GREOCSS data were collected based on stratified three-stage cluster sampling with probability proportional to size. The empirical results show that the Rao-Scott adjusted test statistic using only the variances of cell probabilities is very close to the Wald test statistic, which uses the covariance matrix of cell probabilities, under the 2009 GREOCSS data based. However it is necessary to be cautious to use the Rao-Scott first order adjusted test statistic in the place of Wald test because its efficiency is decreasing as the relative variance of eigenvalues of the design matrix of cell probabilities is increasing, specially more when the number of degrees of freedom is small.

The Relationship of Alcohol Use Disorders and Depression, Qualty of Life in the Eldery (노인의 알코올 사용장애에 따른 우울, 삶의 질과의 관계)

  • Oh, Chung-Uk;Kim, Seon-Rye
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.1
    • /
    • pp.196-201
    • /
    • 2014
  • This study intends to investigate alcohol use disorder in the elderly in rural area. The subjects were 212 elderly people. Alcohol use disorder was defined as a score of more than 10 points on the AUDIT-K. The collected data were analyzed descriptive statistics, chi-square test and t-test using SPSS 19.0 program. The alcohol use disorder in the elderly in rural area were 18.4%. The alcohol use disorder was statistically significant difference according to gender, age, inmate, scholarship, religion, job and smoking. The alcohol use disorder correlated positively with depression. To prepare the aging society, the government should make preparation prorgram for elderly alcoholics.

The Cosmetic Purchase Behavior of Women in Their 20s (I) - Focused on Consumption Value - (20대 여성의 화장품 구매행동에 관한 연구 (I) - 소비가치를 중심으로 -)

  • Park, Kwanghee
    • Journal of the Korean Society of Costume
    • /
    • v.67 no.3
    • /
    • pp.47-65
    • /
    • 2017
  • The purpose of this study was to divide respondents by consumption values, and to examine the differences in their cosmetic purchase behavior. Cosmetic purchase behavior consisted of variables such as purchase frequency, purchase amount, place of purchase, purchase reason, reason for using cosmetics, purchase propensity, degree of using information source, and selection criteria. A survey was conducted with 308 women between the ages 20 and 29 from December 5th to 10th 2016. Data collected from the respondents through an Internet survey were analyzed using descriptive statistics, factor analyses, cluster analysis, analyses of variance and chi-square tests. Four consumption value dimensions emerged that were termed emotional, differentiated individuality-pursuing, functional and social value. The respondents were classified into three groups(emotional consumer group, functional consumer group, active consumer group) by cluster analysis using four dimensions of consumption value. The results of the analyses of variance and chi-square tests showed significant differences in purchase frequency, place of purchase, purchase reason, reason for using cosmetics, degree of using information source and selection criteria among groups classified by consumption value. However, there were no differences in purchase amount and purchase propensity among them.

An Assessment of Statistical Validity of Articles Published in the Journal of Korean Acupuncture & Moxibusition Society - from 1984 to 2002 - (대한침구학회지 논문의 통계적 오류에 관한 연구)

  • Lee, Seung-deok
    • Journal of Acupuncture Research
    • /
    • v.21 no.1
    • /
    • pp.176-188
    • /
    • 2004
  • This study was carried out to investigate statistical validity of medical articles that used various statistical techniques such as t-test, analysis of variance, correlation analysis, regression analysis and chi-square test. For study 429 original articles using those statistical methods were selected from Journal of Korean Acupuncture & Moxibusition Society published from 1984 to 2002. 429 original articles were reviewed to analyzed the statistical procedures. Results are summarized as follows : 1. In this study 93 articles(21.68%) of 429 ones didn't report statement of statistical method in detail. 2. 53 articles(12.53%) didn't report p-value in correctly, and 245 articles(57.11 %) used mean${\pm}$standard error (Mean${\pm}$SEM.) and 109 articles used mean${\pm}$standard deviation(Mean${\pm}$SD.). All of 23 articles using nonparametric statistical techniques made an error to central tendency or dispersion. 3. 175 articles(59.93%) and 14 articles(4.79%) of 292 ones made an error to description of equal variances and normal distribution. 4. 99 articles(50%) of 185 ones misused t-test and 4 articles of 5 ones misused chi-square test. 5. 28 articles(73.68%) of 38 ones using discrete variable misused parametric technique such as t-test or ANOVA. 2 articles and 1 article of 125 ones choosing paired samples misused independent t-test and Mann-Whitney U test. 6. 20 articles using analysis of variance didn't use multiple comparison.

  • PDF

The cosmetic buying behavior of women in their 20s - Focused on differences by cosmetic involvement - (20대 여성의 화장품 구매행동에 관한 연구 - 화장품 관여도에 따른 차이를 중심으로 -)

  • Park, Kwanghee;Choi, Mi-Hwa
    • The Research Journal of the Costume Culture
    • /
    • v.27 no.6
    • /
    • pp.569-581
    • /
    • 2019
  • This study investigated differences in cosmetic buying behavior and personal characteristics between cosmetic involvement groups. Cosmetics buying behavior refers to reason for using cosmetics, use of information sources, selection criteria, place of purchase, use/non-use of cosmetics, purchase propensity, purchase frequency, purchase amount, and satisfaction with cosmetics. Personal characteristic contains pursuing image, age, residence area, job, and average household monthly income. Data was collected from 5-10 December 2016, from 308 females in their 20s using an internet survey. The analysis included descriptive statistics, t-tests, Mann-Whitney U tests, and chi-square tests. The respondents were divided into two groups (a high cosmetic involvement group and a low cosmetic involvement group) according to the degree of cosmetic involvement. The results of t-tests revealed significant differences between groups in terms of reasons for using cosmetics, use of information sources, selection criteria, purchase frequency, place of purchase, use/non-use of cosmetics, and satisfaction with cosmetics. The results of Mann-Whitney U tests highlighted a significant difference in purchase frequency between both groups. The results of chi-square tests indicated significant differences in purchase frequency, purchase amount, pursuing image, and average household monthly income. However, no significant differences were evident in terms of purchase propensity, age, job, and area of residence between groups.