• Title/Summary/Keyword: Chi-square statistics

검색결과 639건 처리시간 0.028초

카이제곱 통계량을 이용한 이슈 단어 추출 (Issue Word Extraction Using Chi-square Statistics)

  • 신준수
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 2014년도 제26회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.225-227
    • /
    • 2014
  • 최근 온라인 뉴스는 대중의 관심사 및 트렌드에 따라서 다양한 종류의 기사들이 작성된다. 이러한 관심사 및 트렌드는 시간의 흐름에 따라 계속 변한다. 본 논문에서는 온라인 뉴스의 기사 제목을 이용하여 시간에 따라 변하는 관심사 및 트렌드와 관련된 단어를 추출하는 방법을 제안한다. 특정 기간 별 출현하는 뉴스들을 하나의 카테고리로 가정하고 자질 선택 방법에서 널리 사용되는 카이제곱 통계량을 이용하여 각 카테고리의 주요 단어를 추출한다. 실험 결과 특정 기간 별 관심사 및 트렌드와 관련된 단어들이 출현하는 것을 확인하였다.

  • PDF

Comparing More than Two Agreement Measures Using Marginal Association

  • Oh, Myong-Sik
    • Communications for Statistical Applications and Methods
    • /
    • 제16권6호
    • /
    • pp.1023-1029
    • /
    • 2009
  • Oh (2009) has proposed a likelihood ratio test for comparing two agreements for dependent observations based on the concept of marginal homogeneity and marginal stochastic ordering. In this paper we consider the comparison of more than two agreement measures. Simple ordering and simple tree ordering among agreement measures are investigated. Some test procedures, including likelihood ratio test, are discussed.

Statistical Inference Concerning Peakedness Ordering between Two Symmetric Distributions

  • Oh, Myong-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권1호
    • /
    • pp.201-210
    • /
    • 2004
  • The peakedness ordering is closely related to dispersive ordering. In this paper we consider the statistical inference concerning peakedness ordering between two arbitrary symmetric distributions. Nonparametric maximum likelihood estimates of two distribution functions under symmetry and peakedness ordering are given. The likelihood ratio test for equality of two symmetric discrete distributions in the sense of peakedness ordering is studied.

  • PDF

An approach to improving the Lindley estimator

  • Park, Tae-Ryoung;Baek, Hoh-Yoo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권6호
    • /
    • pp.1251-1256
    • /
    • 2011
  • Consider a p-variate ($p{\geq}4$) normal distribution with mean ${\theta}$ and identity covariance matrix. Using a simple property of noncentral chi square distribution, the generalized Bayes estimators dominating the Lindley estimator under quadratic loss are given based on the methods of Brown, Brewster and Zidek for estimating a normal variance. This result can be extended the cases where covariance matrix is completely unknown or ${\Sigma}={\sigma}^2I$ for an unknown scalar ${\sigma}^2$.

심뇌혈관질환 고위험군 중 치료연속자와 치료불연속자 간의 특성 비교 (The Comparison of Variables between Therapy Continuity Group and Therapy Discontinuity Group of Patients With Hypertension and Diabetes in Daegu Initiative)

  • 박정숙;권영숙;오윤정
    • 보건교육건강증진학회지
    • /
    • 제26권2호
    • /
    • pp.135-148
    • /
    • 2009
  • Objectives: The purpose of this study is to identify the characteristics between therapy continuity group and therapy discontinuity group and to develop management program for Korean patients with hypertension and diabetes. Methods: The subject of the study were 109 therapy continuity and 66 therapy discontinuity of Korea hypertension diabetes Daegu initiative. The data collection was performed from December 5 to December 30, 2008. Analysis of data was done by using descriptive statistics, chi-square test, t-test and ANCOVA with SPSS program. Results: 1) The groups were significantly correlated with such variables systolic BP(F=4.518, p=0.035) and diastolic BP(F=17.793, p=0.000). 2) The groups with hypertensive were significantly correlated with such variables perceived susceptibility of disease($\chi^2$=25.053, p=0.000), perceived barrier of health behavior($\chi^2$=12.584, p=0.006), drinking($\chi^2$=27.545, p=0.000), diet($\chi^2$=8.645, p=0.013), regular taking medicine($\chi^2$=92.415, p=0.000) and regular measurement of BP($\chi^2$=6.045, p=0.049). 3) The groups with diabetic were significantly correlated with such variables perceived seriousness of disease($\chi^2$=6.128, p=0.047), perceived susceptibility of disease($\chi^2$=8.079, p=0.018), health knowledge and attitude(F=8.418, p=0.006), drinking($\chi^2$=6.276, p=0.043), diet($\chi^2$=7.275, p=0.026), regular taking medicine($\chi^2$=33.083, p=0.000) and regular measurement of glucose($\chi^2$=7.233, p=0.027). Conclusion: The above findings indicate that it is necessary to develop and apply special management programs according to the therapy discontinuity group.

Empirical Analysis on Rao-Scott First Order Adjustment for Two Population Homogeneity test Based on Stratified Three-Stage Cluster Sampling with PPS

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제7권3호
    • /
    • pp.208-213
    • /
    • 2014
  • National-wide and/or large scale sample surveys generally use complex sample design. Traditional Pearson chi-square test is not appropriate for the categorical complex sample data. Rao-Scott suggested an adjustment method for Pearson chi-square test, which uses the average of eigenvalues of design matrix of cell probabilities. This study is to compare the efficiency of Rao-Scott first order adjusted test to Wald test for homogeneity between two populations using 2009 Gyeongnam regional education offices's customer satisfaction survey (2009 GREOCSS) data. The 2009 GREOCSS data were collected based on stratified three-stage cluster sampling with probability proportional to size. The empirical results show that the Rao-Scott adjusted test statistic using only the variances of cell probabilities is very close to the Wald test statistic, which uses the covariance matrix of cell probabilities, under the 2009 GREOCSS data based. However it is necessary to be cautious to use the Rao-Scott first order adjusted test statistic in the place of Wald test because its efficiency is decreasing as the relative variance of eigenvalues of the design matrix of cell probabilities is increasing, specially more when the number of degrees of freedom is small.

노인의 알코올 사용장애에 따른 우울, 삶의 질과의 관계 (The Relationship of Alcohol Use Disorders and Depression, Qualty of Life in the Eldery)

  • 오청욱;김선예
    • 한국산학기술학회논문지
    • /
    • 제15권1호
    • /
    • pp.196-201
    • /
    • 2014
  • 본 연구는 농촌지역 제가노인의 알코올 사용장애 정도와 관련 요인을 파악하여 제가노인의 알코올 사용장애 개선프로그램 개발에 기초 자료를 제공하기 위함이다. 자료는 SPSS 19.0 Version으로 기술적 통계, chi-square test. t-test를 이용하여 분석하였다. 농촌지역 제가노인의 사용장애는 성별, 연령, 동거인 존재여부, 학력, 종교, 직업, 흡연 여부에 따라 통계적으로 유의한 차이가 있었다. 제가노인의 알코올 사용장애는 우울정도와 유의한 상관관계가 있는 것으로 나타났다. 제가노인의 알코올 사용장애를 보건사회문제로 인식하고, 이를 예방할 수 있는 프로그램 개발과 노인들의 참여에 지역사회와 정부가 노력해야 할 것이며, 이는 고령화사회를 맞이하는 기초작업이 될 것이다.

20대 여성의 화장품 구매행동에 관한 연구 (I) - 소비가치를 중심으로 - (The Cosmetic Purchase Behavior of Women in Their 20s (I) - Focused on Consumption Value -)

  • 박광희
    • 복식
    • /
    • 제67권3호
    • /
    • pp.47-65
    • /
    • 2017
  • The purpose of this study was to divide respondents by consumption values, and to examine the differences in their cosmetic purchase behavior. Cosmetic purchase behavior consisted of variables such as purchase frequency, purchase amount, place of purchase, purchase reason, reason for using cosmetics, purchase propensity, degree of using information source, and selection criteria. A survey was conducted with 308 women between the ages 20 and 29 from December 5th to 10th 2016. Data collected from the respondents through an Internet survey were analyzed using descriptive statistics, factor analyses, cluster analysis, analyses of variance and chi-square tests. Four consumption value dimensions emerged that were termed emotional, differentiated individuality-pursuing, functional and social value. The respondents were classified into three groups(emotional consumer group, functional consumer group, active consumer group) by cluster analysis using four dimensions of consumption value. The results of the analyses of variance and chi-square tests showed significant differences in purchase frequency, place of purchase, purchase reason, reason for using cosmetics, degree of using information source and selection criteria among groups classified by consumption value. However, there were no differences in purchase amount and purchase propensity among them.

대한침구학회지 논문의 통계적 오류에 관한 연구 (An Assessment of Statistical Validity of Articles Published in the Journal of Korean Acupuncture & Moxibusition Society - from 1984 to 2002 -)

  • 이승덕
    • Journal of Acupuncture Research
    • /
    • 제21권1호
    • /
    • pp.176-188
    • /
    • 2004
  • This study was carried out to investigate statistical validity of medical articles that used various statistical techniques such as t-test, analysis of variance, correlation analysis, regression analysis and chi-square test. For study 429 original articles using those statistical methods were selected from Journal of Korean Acupuncture & Moxibusition Society published from 1984 to 2002. 429 original articles were reviewed to analyzed the statistical procedures. Results are summarized as follows : 1. In this study 93 articles(21.68%) of 429 ones didn't report statement of statistical method in detail. 2. 53 articles(12.53%) didn't report p-value in correctly, and 245 articles(57.11 %) used mean${\pm}$standard error (Mean${\pm}$SEM.) and 109 articles used mean${\pm}$standard deviation(Mean${\pm}$SD.). All of 23 articles using nonparametric statistical techniques made an error to central tendency or dispersion. 3. 175 articles(59.93%) and 14 articles(4.79%) of 292 ones made an error to description of equal variances and normal distribution. 4. 99 articles(50%) of 185 ones misused t-test and 4 articles of 5 ones misused chi-square test. 5. 28 articles(73.68%) of 38 ones using discrete variable misused parametric technique such as t-test or ANOVA. 2 articles and 1 article of 125 ones choosing paired samples misused independent t-test and Mann-Whitney U test. 6. 20 articles using analysis of variance didn't use multiple comparison.

  • PDF

20대 여성의 화장품 구매행동에 관한 연구 - 화장품 관여도에 따른 차이를 중심으로 - (The cosmetic buying behavior of women in their 20s - Focused on differences by cosmetic involvement -)

  • 박광희;최미화
    • 복식문화연구
    • /
    • 제27권6호
    • /
    • pp.569-581
    • /
    • 2019
  • This study investigated differences in cosmetic buying behavior and personal characteristics between cosmetic involvement groups. Cosmetics buying behavior refers to reason for using cosmetics, use of information sources, selection criteria, place of purchase, use/non-use of cosmetics, purchase propensity, purchase frequency, purchase amount, and satisfaction with cosmetics. Personal characteristic contains pursuing image, age, residence area, job, and average household monthly income. Data was collected from 5-10 December 2016, from 308 females in their 20s using an internet survey. The analysis included descriptive statistics, t-tests, Mann-Whitney U tests, and chi-square tests. The respondents were divided into two groups (a high cosmetic involvement group and a low cosmetic involvement group) according to the degree of cosmetic involvement. The results of t-tests revealed significant differences between groups in terms of reasons for using cosmetics, use of information sources, selection criteria, purchase frequency, place of purchase, use/non-use of cosmetics, and satisfaction with cosmetics. The results of Mann-Whitney U tests highlighted a significant difference in purchase frequency between both groups. The results of chi-square tests indicated significant differences in purchase frequency, purchase amount, pursuing image, and average household monthly income. However, no significant differences were evident in terms of purchase propensity, age, job, and area of residence between groups.