• Title/Summary/Keyword: Chi-Square Test for Association

Search Result 533, Processing Time 0.035 seconds

An Empirical Study of Qualities of Association Rules from a Statistical View Point

  • Dorn, Maryann;Hou, Wen-Chi;Che, Dunren;Jiang, Zhewei
    • Journal of Information Processing Systems
    • /
    • v.4 no.1
    • /
    • pp.27-32
    • /
    • 2008
  • Minimum support and confidence have been used as criteria for generating association rules in all association rule mining algorithms. These criteria have their natural appeals, such as simplicity; few researchers have suspected the quality of generated rules. In this paper, we examine the rules from a more rigorous point of view by conducting statistical tests. Specifically, we use contingency tables and chi-square test to analyze the data. Experimental results show that one third of the association rules derived based on the support and confidence criteria are not significant, that is, the antecedent and consequent of the rules are not correlated. It indicates that minimum support and minimum confidence do not provide adequate discovery of meaningful associations. The chi-square test can be considered as an enhancement or an alternative solution.

Clinical Analysis of Symptoms and Oriental Medical Prescriptions According to Elapsed Time of Stroke in Oriental Medical Hospital Inpatients

  • Yun, Hen-Ja;Sung, Kang-Keyng
    • Herbal Formula Science
    • /
    • v.20 no.1
    • /
    • pp.133-147
    • /
    • 2012
  • Objectives : This study was intended to understand characteristics of symptoms, oriental medicine prescription and laboratory test results according to elapsed time of stroke. Methods : Through the medical records of 205 stroke inpatients in the oriental medical hospital in the year 2010, we investigated manifested symptoms, administered oriental medicine prescription and clinical pathological examination results. Collected items were classified to depend on stroke types, cerebral infarction and hemorrhage. We analyzed association between manifested symptoms, the oriental medicine prescription, and laboratory test results of stroke patients and elapsed time. Chi-square tests were performed to determine the significance level of association. Results : All symptoms, prescriptions and laboratory test results in cerebral infarction patients were associated with elapsed time. Especially, symptoms, prescriptions and pathological examination results showed very high statistical significance with elapsed time (a symptom; chi-square(df)=164.3(22), p<0.001, prescription; chi-square(df)=93.5(22), p<0.001, and pathological examination results; chi-square(df)=164.3(22), p<0.0004). But in the case of cerebral hemorrhage, there was not statistical significance. Conclusions : The elapsed time of stroke may be an essential requisite in catching symptoms and prescribing for stroke patients in oriental medical treatment.

The Analysis of Association between Learning Styles and a Model of IoT-based Education : Chi-Square Test for Association

  • Sayassatov, Dulan;Cho, Namjae
    • Journal of Information Technology Applications and Management
    • /
    • v.27 no.3
    • /
    • pp.19-36
    • /
    • 2020
  • The Internet of things (IoT) is a system of interrelated computed devices, digital machines and any physical objects which are provided with unique identifiers and the potential to transmit data to people or machine (M2M) without requiring human interaction. IoT devices can be used to monitor and control the electrical and electronic systems used in different fields like smart home, smart city, smart healthcare and etc. In this study we introduce four imaginary IoT devices as a learning support assistants according to students' dominant learning styles measured by Honey and Mumford Learning Styles: Activists, Reflectors, Theorists and Pragmatists. This research emphasizes the association between students' strong learning styles and a preference to appropriate IoT devices with specific characteristics. Moreover, different levels of IoT devices' architecture are clearly explained in this study where all the artificial devices are designed based on this structure. Data analysis of experiment were measured by the use of chi square test for association and research results showed the statistical significance of the estimated model and the impacts of each category over the model where we finally got accurate estimates for our research variables. This study revealed the importance of considering the students' dominant learning styles before inventing a new IoT device.

Application of Random Forests to Association Studies Using Mitochondrial Single Nucleotide Polymorphisms

  • Kim, Yoon-Hee;Kim, Ho
    • Genomics & Informatics
    • /
    • v.5 no.4
    • /
    • pp.168-173
    • /
    • 2007
  • In previous nuclear genomic association studies, Random Forests (RF), one of several up-to-date machine learning methods, has been used successfully to generate evidence of association of genetic polymorphisms with diseases or other phenotypes. Compared with traditional statistical analytic methods, such as chi-square tests or logistic regression models, the RF method has advantages in handling large numbers of predictor variables and examining gene-gene interactions without a specific model. Here, we applied the RF method to find the association between mitochondrial single nucleotide polymorphisms (mtSNPs) and diabetes risk. The results from a chi-square test validated the usage of RF for association studies using mtDNA. Indexes of important variables such as the Gini index and mean decrease in accuracy index performed well compared with chi-square tests in favor of finding mtSNPs associated with a real disease example, type 2 diabetes.

Empirical Comparisons of Disparity Measures for Three Dimensional Log-Linear Models

  • Park, Y.S.;Hong, C.S.;Jeong, D.B.
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.543-557
    • /
    • 2006
  • This paper is concerned with the applicability of the chi-square approximation to the six disparity statistics: the Pearson chi-square, the generalized likelihood ratio, the power divergence, the blended weight chi-square, the blended weight Hellinger distance, and the negative exponential disparity statistic. Three dimensional contingency tables of small and moderate sample sizes are generated to be fitted to all possible hierarchical log-linear models: the completely independent model, the conditionally independent model, the partial association models, and the model with one variable independent of the other two. For models with direct solutions of expected cell counts, point estimates and confidence intervals of the 90 and 95 percentage points of six statistics are explored. For model without direct solutions, the empirical significant levels and the empirical powers of six statistics to test the significance of the three factor interaction are computed and compared.

  • PDF

Dissolved Oxygen Trend in Sapgyo Stream Watershed (삽교천유역의 용존산소 추세)

  • Rim, Chang-Soo
    • Journal of Korea Water Resources Association
    • /
    • v.46 no.6
    • /
    • pp.667-681
    • /
    • 2013
  • In this study, monthly and seasonal dissolved oxygen trends of 19 water quality measurement stations in Sapgyo stream watershed were analyzed using monthly dissolved oxygen (DO) data measured for 16 years (1995~2010). Mann-Kendall trend test and Sen's slope estimator were carried out for trend analysis. Furthermore, Sapgyo stream watershed was divided into four different sections (Sapgyo stream, Muhan stream, Gykgyo stream, and Sapgyo lake) and chi-square test of homogeneity for DO trend was carried out for four different sections. The study results indicated that most of water quality measurement stations showed increasing or non-significant trend of DO on a monthly and seasonal basis. The chi-square test of homogeneity for each water quality measurement station showed the statistical homogeneity in seasonal DO trend; however, the test results showed the statistical non-homogeneity in monthly DO trend for the stations located in the reservoir. Overall, the dissolved oxygen trend in each water quality measurement station showed different patterns depending on the location of each station and season.

A Study on Sales Training of Clothing Companies (의류 판매원 교육실태에 관한 연구)

  • 김미숙;김보경
    • The Research Journal of the Costume Culture
    • /
    • v.7 no.4
    • /
    • pp.155-167
    • /
    • 1999
  • The present study investigated various sales training programs used by apparel companies and compared each other in order to provide an important information for developing effective training programs for professional salesperson. Sixty eight companies were used and grouped into four categories based on brand characteristics : domestic national brand(DNB), casual brand(CB), foreign brand(FB) and domestic designer brand(DDB). Data were collected from the managers in charge or training salesperson by both questionnaires and personal and telephone interviews. Data were collected during July in 1998, and analyzed by using ANOVA, Duncan\`s multiple range test, and Chi-square test. Since the sample size was small, Yates\` correction formula was used to maximize statistical validity in non-parametric procedure of Chi-square test. The main purpose of sales training indicated by the companies were to satisfy customers and to maximize the profit. Significant differences were found among the groups in the importance level of training contents such as knowledge, and customer relation, training methods, place, and duration/frequency of training at training center.

  • PDF

On spectral and statistical characteristics of sea waves by the typhoons (태풍에 의한 파랑의 스펙트럼 및 통계적 특성)

  • Shim, Jae-Seol;Oh, Byung-Chul;Kim, Sang-Ik
    • Water for future
    • /
    • v.22 no.4
    • /
    • pp.441-451
    • /
    • 1989
  • Using the wave data by typhoons LEE, VERA, THELMA which gave great damages in the Korean penisula, the significant waves based on zero-up & down crossing and Tucker-Draper method are compared with those from the wave energy spectrum. And the histograms of individual waves obtained from zero-up crossing method are presented and compared with the Rayleigh, Weibull, Gluhovski, Ibrageemov and Goda distributions, and also the Chi-square goodness of fit test is applied to each theoretical distributions. It is shown that the significant wave heights by zero-up crossing method are very well agreed to those by energy spectrum method. The wave heights are found to well follow the Rayleigh and Goda distributions by the Chi-square test.

  • PDF

The Association between Seven Health Practices and Self Rated Health by Sasang Constitution (사상체질별 7대 건강행위와 주관적 건강상태의 연관성)

  • Jang, Eun-Su;Kim, Yun-Young;Baek, Young-Hwa;Lee, Si-Woo
    • Journal of Sasang Constitutional Medicine
    • /
    • v.30 no.1
    • /
    • pp.32-42
    • /
    • 2018
  • The purpose of this study aimed to know the association between seven health practices and self rated health by Sasang constitution. We recruited 367 subjects aged from 30 to 59. KS 15 questionnaire was used to classify Sasang constitution and visual analogue scale was used to estimate self rated health. Chi-square test was used to know the difference of occupation distribution by Sasang constitution. Anova test, T-test and Chi-square test also used to analyze the difference of self rated health between the health practice group and non-health group in individual Sasang constitution. SPSS 21.0K was used and significant p was <.05. Regular morning meal, non-snaking, good sleeping and sufficient exercise had higher self rated health score (p<.05). Regular morning meal, good sleeping and sufficient exercise had higher score in Tae-eumin (p<.05). Good sleeping had higher self rated health score in Soeumin and Soyangin (p<.05). These results suggest there is possibility that health practices for health promotion could be different according to Sasang constitution.

Socio-Demographic and Behavioural Risk Factors for Cervical Cancer and Knowledge, Attitude and Practice in Rural and Urban Areas of North Bengal, India

  • Raychaudhuri, Sreejata;Mandal, Sukanta
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.4
    • /
    • pp.1093-1096
    • /
    • 2012
  • Background: Cervical cancer is common among women worldwide. A multitude of risk factors aggravate the disease. This study was conducted to: (1) determine the prevalence and (2) make a comparative analysis of the socio-demographic and behavioural risk factors of cervical cancer and knowledge, attitude and practice between rural and urban women of North Bengal, India. Study Design: Community-based cross-sectional study. Methods: A survey (first in North Bengal) was conducted among 133 women in a rural area (Kawakhali) and 88 women in an urban slum (Shaktigarh) using predesigned semi-structured questionnaires. The respondents were informed of the causes (including HPV), signs and symptoms, prevention of cervical cancer and treatment, and the procedure of the PAP test and HPV vaccination. Results: The prevalence of risk factors like multiparity, early age of marriage, use of cloth during menstruation, use of condom and OCP, early age of first intercourse was 37.2%, 82%, 83.3%, 5.4%, 15.8% and 65.6% respectively. Awareness about the cause, signs and symptoms, prevention of cervical cancer, PAP test and HPV vaccination was 3.6%, 6.3%, 3.6%, 9.5% and 14.5% respectively. Chi-square testing revealed that in the study population, significant differential at 5% exists between rural and urban residents with respect to number of children, use of cloth/sanitary napkins, family history of cancer and awareness regarding causes of cervical cancer. Regarding KAP, again using chi-square tests, surprisingly, level of education is found to be significant for each element of KAP in urban areas in contrast to complete absence of association between education and elements of KAP in rural areas. Conclusions: A large number of risk factors were present in both areas, the prevalence being higher in the rural areas. The level of awareness and role of education appears to be insignificant determinants in rural compared to urban areas. This pilot study needs to be followed up by large scale programmes to re-orient awareness campaigns, especially in rural areas.