• Title/Summary/Keyword: Spearman correlation coefficients

Search Result 135, Processing Time 0.025 seconds

Secure Multi-Party Computation of Correlation Coefficients (상관계수의 안전한 다자간 계산)

  • Hong, Sun-Kyong;Kim, Sang-Pil;Lim, Hyo-Sang;Moon, Yang-Sae
    • Journal of KIISE
    • /
    • v.41 no.10
    • /
    • pp.799-809
    • /
    • 2014
  • In this paper, we address the problem of computing Pearson correlation coefficients and Spearman's rank correlation coefficients in a secure manner while data providers preserve privacy of their own data in distributed environment. For a data mining or data analysis in the distributed environment, data providers(data owners) need to share their original data with each other. However, the original data may often contain very sensitive information, and thus, data providers do not prefer to disclose their original data for preserving privacy. In this paper, we formally define the secure correlation computation, SCC in short, as the problem of computing correlation coefficients in the distributed computing environment while preserving the data privacy (i.e., not disclosing the sensitive data) of multiple data providers. We then present SCC solutions for Pearson and Spearman's correlation coefficients using secure scalar product. We show the correctness and secure property of the proposed solutions by presenting theorems and proving them formally. We also empirically show that the proposed solutions can be used for practical applications in the performance aspect.

Digital Convergence Teaching Strategy System using Spearman Correlation Coefficients (스피어만 상관계수를 이용한 디지털 융합 강의 전략 시스템)

  • Lee, Byung-Wook
    • Journal of Internet Computing and Services
    • /
    • v.11 no.6
    • /
    • pp.111-122
    • /
    • 2010
  • Since educating digital convergence is to unite various sciences and technologies with computer as the central figure, it has different range and methods of education. Therefore, it has problems with recommending limited conceptual information because of difficulties to standardize education plan and teaching strategies. In this paper, I propose education plan and teaching strategy system by using Spearman correlation coefficients. This system is to find a solution against disadvantage of recommending limited conceptual information by ranking relations of teaching strategies from the information based on the demand of industrial and academic fields, and then provides lists of teaching strategy information suitable for user's atmosphere and characteristics. Performance test is to compare effects of precision and recall with existing service systems. The test shows 90.4% of precision and 77.6% of recall.

Parametric and Non Parametric Measures for Text Similarity (텍스트 유사성을 위한 파라미터 및 비 파라미터 측정)

  • Mlyahilu, John;Kim, Jong-Nam
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.20 no.4
    • /
    • pp.193-198
    • /
    • 2019
  • The wide spread of genuine and fake information on internet has lead to various studies on text analysis. Copying and pasting others' work without acknowledgement, research results manipulation without proof has been trending for a while in the era of data science. Various tools have been developed to reduce, combat and possibly eradicate plagiarism in various research fields. Text similarity measurements can be manually done by using both parametric and non parametric methods of which this study implements cosine similarity and Pearson correlation as parametric while Spearman correlation as non parametric. Cosine similarity and Pearson correlation metrics have achieved highest coefficients of similarity while Spearman shown low similarity coefficients. We recommend the use of non parametric methods in measuring text similarity due to their non normality assumption as opposed to the parametric methods which relies on normality assumptions and biasness.

Reproducibility of a food frequency questionnaire: Korea Nurses' Health Study

  • Song, Sihan;Kim, Bohye;Pang, Yanghee;Kim, Oksoo;Lee, Jung Eun
    • Nutrition Research and Practice
    • /
    • v.16 no.1
    • /
    • pp.106-119
    • /
    • 2022
  • BACKGROUND/OBJECTIVES: This study aimed to examine the reproducibility of food frequency questionnaires (FFQs) designed for young female nurses in the Korea Nurses' Health Study. SUBJECTS/METHODS: The reproducibility of web-based, self-administered FFQs was evaluated among 243 Korean female nurses. The first FFQ (FFQ1) was administered from March 2014 to February 2019 and the second FFQ (FFQ2) from November 2019, with a mean interval of 2.8 years between the FFQs (range, 9 months-5.6 years). Pearson and Spearman correlation coefficients (r values) and quartile agreements between FFQ1 and FFQ2 were calculated for intakes of energy, nutrients, and foods. RESULTS: Pearson correlation coefficients ranged from 0.41 to 0.55 (median r = 0.51) for energy and raw nutrients and from 0.16 to 0.46 (median r = 0.36) for energy-adjusted nutrients. Spearman correlation coefficients ranged from 0.25 to 0.72 (median r = 0.41) for food items. The percentages of women who were classified into the same or adjacent quartile were 77% to 84% (median = 82%) for raw nutrients and 69% to 86% (median = 78%) for foods. CONCLUSIONS: The results indicated that the web-based FFQ used in the Korea Nurses' Health Study has acceptable reproducibility.

Improvement of User's Context Aware and Characteristic Process using spearman correlation coefficients (스피어만 장관계수를 이용한 사용자 상황 및 특성 처리 개선)

  • Ahn, Chan-Shik;Oh, Sang-Yeob
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.10
    • /
    • pp.1444-1452
    • /
    • 2010
  • There is very little information on mobile terminal service systems such as CRUMPET because the all the users have different situations and characteristic, and so it is also difficult to find correlations. Because of the difficulty of customizing and recommending information based on preference stemming from the users' various situations and characteristics, they usually provide limited, conceptual information. This paper will recommend a system that recommends information tailored to the user's situation and characteristics, using the Spearman correlation coefficients. It finds correlations from users' information and sequences information that is suitable to the user's situation and characteristics into a list, thereby solving the problem of limited, conceptual information. Performance tests have revealed when compared to existing service systems, this system is more effective in terms of precision and recall, with a 92.3% precision rate, and a 73.8% recall rate.

Correlation Analysis of General Parameters and Metals in the Lake Sediments of Geum River Basin

  • Lee, Jun-Bae;Cho, Yoon-Hae;Huh, In-Ae;Khan, Jong-Beom;Oh, Da-Yeon;Yang, Yoon-Mo;Gil, Gi-Beom;Lee, Soo-Hyung;Cheon, Se-Yeok;Lee, Bo-Mi
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.50 no.6
    • /
    • pp.684-696
    • /
    • 2017
  • An investigation of grain size, organic compounds and metal distribution in lakes from Geum river basin (Republic of Korea) was conducted in two years (2014 and 2015). The samples of sediment were collected from the 3 lakes (12 sites). The samples were analyzed the concentration of metals (Pb, Zn, Cu, Cr, Ni, As, Cd, Hg, Al, and Li) and general indices including grain size. Spearman correlation coefficients were determined using general indices and metal concentrations respectively. The organic qualities of sediments were improved in 2015 compared with 2014. The concentrations of metals were lower than Sediment Criteria of Lakes in Korea. The significant Spearman correlation coefficients were presented only sand-clay, clay-water content, COD-TOC, Cu-Ni, Cd-Li, Zn-Li, and Cr-Ni of general and metal parameters in 2014, 2015 and both of two years.

Improvement of the Semantic Information Retrieval using Ontology and Spearman Correlation Coefficients (온톨로지 기술과 스피어만 상관계수를 적용한 시맨틱 정보 검색 향상)

  • Lee, Byungwook
    • Journal of Digital Convergence
    • /
    • v.11 no.11
    • /
    • pp.351-357
    • /
    • 2013
  • Information retrieval by query keywords have some mismatching problems to fit user's requirement for the retrieved documents due to the varieties of users. These problems are originated from the different situations and characteristics of user's requirement. Also, it has a problem that general correlation coefficients did not display the information relations. In this thesis, it is to suggest knowledge retrieval system to verify feasibility of personnel selection procedure and results supporting selection rules after construction of personnel selection ontologies and rules composed of various concept and knowledge based on the semantic web technology. In the suggested system, it is to clear disadvantages of limited information retrieval providing the suitable information to satisfy user's different situations and characteristics using Spearman's coefficients. Experimental results by this semantic-based information retrieval show 90.3% of accuracy and 71.8% of recall compared with legacy keyword information retrieval.

A Study on Validity of a Semi-Quantitative Food Frequency Questionnaire for Korean Adults (성인의 식이섭취 조사를 위한 반정량 식품섭취빈도조사지의 타당도 연구 -건강증진센터 내원 성인을 대상으로 -)

  • Shim, Jee-Seon;Oh, Kyung-Won;Suh, Il;Kim, Mi-Yang;Sohn, Chun-Young;Lee, Eun-Joo;Nam, Chung-Mo
    • Korean Journal of Community Nutrition
    • /
    • v.7 no.4
    • /
    • pp.484-494
    • /
    • 2002
  • This study was conducted to validate the semi-quantitative food frequency questionnaire that was developed to assess the intakes of fatty acids, as well as energy, carbohydrates, fat, protein, minerals and vitamins in Korean adults. The validity of the semi-quantitative food frequency questionnaire was tested on 78 subjects (31 men,47 women) aged 34 to 66 years. The semi-quantitative food frequency questionnaire included 93 food items and was validated on two 3-day dietary records. The mean intakes and the Spearman Correlation Coefficients between the semi-quantitative food frequency questionnaire and the two 3-day dietary records were analyzed for each nutrient and food group level. The mean nutrient intakes obtained from the semi-quantitative food frequency questionnaire were estimated to be greater than those of the two 3-day dietary records. The Spearman Correlation Coefficients between the energy-adjusted nutrient intakes from the semi-quantitative food frequency questionnaire and the two 3-day dietary records ranged from 0.24 for polyunsaturated fatty acids to 0.55 for fat in men and from 0.29 for polyunsaturated fatty acids to 0.55 for saturated fatty acids in women, respectively. The Spearman Correlation Coefficients for food intake ranged from 0.11 for teas and beverages to 0.58 for grains and their products in men,-0.04 for potatoes and starches to 0.73 for milk and dairy products in women. Foods consumed regularly had lower intra-person variation and tended to have higher observed correlation coefficients. These results indicate that the semi-quantitative food frequency questionnaire is a useful tool for estimating nutrient intakes, particularly of total fat and saturated fatty acid intakes.

Estimation of the Exhaust Characteristics of Biodiesel Used in Diesel Engine (디젤엔진에서 바이오디젤의 배기가스 특성 평가)

  • Baek, Seok Heum;Yoon, Jeong Hwan;Jung, Woo Sung;Ha, Hyeong Soo;Chung, Sung Sik;Yeom, Jeong Kuk
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.38 no.2
    • /
    • pp.129-137
    • /
    • 2014
  • In this study, the characteristics of exhaust gas as a function of the biodiesel mixing ratio were investigated. Diesel and waste oil were used for preparing mixed fuel, and the ratios of the mixed fuel were varied in the BD3~BD100 range. The injection pressures(${\Delta}p_{inj}$) was considered as an experimental variable and was set to 400 bar, 600 bar, 800 bar, 1000 bar, and 1200 bar. Furthermore, for quantitatively analyzing the characteristics of exhaust gas(NOx and Soot), the concepts of Pearson correlation coefficient and Spearman rank-order correlation coefficient based on statistics were introduced. Consequently, it was found that the correlation of the emission of NOx and Soot is linear, and the Pearson and Spearman coefficients are -0.732 and -0.724, respectively, under all analysis conditions. Especially, for the injection pressure of 800 bar, a simultaneous reduction in NOx and Soot emission is possible by controlling the biodiesel mixing ratio. This is because the correlation coefficients of NOx and Soot emissions were nearly 0, as the Pearson correlation coefficient was -0.089.

Validity and Reliability of a Dish-based, Semi-quantitative Food Frequency Questionnaire for Korean Diet and Cancer Research

  • Park, Min-Kyung;Noh, Hwa-Young;Song, Na-Yeun;Paik, Hee-Young;Park, So-Hee;Joung, Hyo-Jee;Song, Won-O;Kim, Jeong-Seon
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.2
    • /
    • pp.545-552
    • /
    • 2012
  • This study evaluated the validity and reliability of applying a newly developed dish-based, semi-quantitative food frequency questionnaire (FFQ) for Korean diet and cancer research. The subjects in the present study were 288 Korean adults over 30 years of age who had completed two FFQs and four 3-day diet records (DRs) from May 2008 to February 2009. Student's t-tests, Chi-square tests, and Spearman's rank correlation coefficients were used to estimate and compare intakes from different dietary assessment tools. Agreement in quintiles was calculated to validate agreement between the results of the second FFQ (FFQ-2) conducted in February 2009 and the DRs. Median Spearman's correlation coefficients between the intake of nutrients and foods assessed by the FFQ-1 and FFQ-2 were 0.59 and 0.57, respectively, and the coefficients between the intake of nutrients and foods assessed by the FFQ-2 and the DRs were 0.31 and 0.29, respectively. The quintile classifications of same or adjacent quintile for intake of nutrients and foods were 64% and 65%, respectively. Misclassification into opposite quintiles occurred in less than 5% for all dietary factors. Thus this newly-developed, Korean dish-based FFQ demonstrated moderate correspondence with the four 3-day DRs. Its reliability and validity are comparable to those reported in other studies.