• Title/Summary/Keyword: pearson correlation

Search Result 7,840, Processing Time 0.034 seconds

On the Study of Perfect Coverage for Recommender System

  • Lee, Hee-Choon;Lee, Seok-Jun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1151-1160
    • /
    • 2006
  • The similarity weight, the pearson's correlation coefficient, which is used in the recommender system has a weak point that it cannot predict all of the prediction value. The similarity weight, the vector similarity, has a weak point of the high MAE although the prediction coverage using the vector similarity is higher than that using the pearson's correlation coefficient. The purpose of this study is to suggest how to raise the prediction coverage. Also, the MAE using the suggested method in this study was compared both with the MAE using the pearson's correlation coefficient and with the MAE using the vector similarity, so was the prediction coverage. As a result, it was found that the low of the MAE in the case of using the suggested method was higher than that using the pearson's correlation coefficient. However, it was also shown that it was lower than that using the vector similarity. In terms of the prediction coverage, when the suggested method was compared with two similarity weights as I mentioned above, it was found that its prediction coverage was higher than that pearson's correlation coefficient as well as vector similarity.

  • PDF

A Study on the Maximizing Coverage for Recommender System

  • Lee, Hee-Choon;Lee, Seok-Jun;Park, Ji-Won;Kim, Chul-Seoung
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2006.11a
    • /
    • pp.119-128
    • /
    • 2006
  • The similarity weight, the pearson's correlation coefficient, which is used in the recommender system has a weak point that it cannot predict all of the prediction value. The similarity weight, the vector similarity, has a weak point of the high MAE although the prediction coverage using the vector similarity is higher than that using the pearson's correlation coefficient. The purpose of this study is to suggest how to raise the prediction coverage. Also, the MAE using the suggested method in this study was compared both with the MAE using the pearson's correlation coefficient and with the MAE using the vector similarity, so was the prediction coverage. As a result, it was found that the low of the MAE in the case of using the suggested method was higher than that using the pearson's correlation coefficient. However, it was also shown that it was lower than that using the vector similarity In terms of the prediction coverage, when the suggested method was compared with two similarity weights as I mentioned above, it was found that its prediction coverage was higher than that pearson's correlation coefficient as well as vector similarity.

  • PDF

Measure Correlation Analysis of Network Flow Based On Symmetric Uncertainty

  • Dong, Shi;Ding, Wei;Chen, Liang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.6 no.6
    • /
    • pp.1649-1667
    • /
    • 2012
  • In order to improve the accuracy and universality of the flow metric correlation analysis, this paper firstly analyzes the characteristics of Internet flow metrics as random variables, points out the disadvantages of Pearson Correlation Coefficient which is used to measure the correlation between two flow metrics by current researches. Then a method based on Symmetrical Uncertainty is proposed to measure the correlation between two flow metrics, and is extended to measure the correlation among multi-variables. Meanwhile, the simulation and polynomial fitting method are used to reveal the threshold value between different correlation degrees for SU method. The statistical analysis results on the common flow metrics using several traces show that Symmetrical Uncertainty can not only represent the correct aspects of Pearson Correlation Coefficient, but also make up for its shortcomings, thus achieve the purpose of measuring flow metric correlation quantitatively and accurately. On the other hand, reveal the actual relationship among fourteen common flow metrics.

On the Effect of Significance of Correlation Coefficient for Recommender System

  • Lee, Hee-Choon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1129-1139
    • /
    • 2006
  • Pearson's correlation coefficient and vector similarity are generally applied to The users' similarity weight of user based recommender system. This study is needed to find that the correlation coefficient of similarity weight is effected by the number of pair response and significance probability. From the classified correlation coefficient by the significance probability test on the correlation coefficient and pair of response, the change of MAE is studied by comparing the predicted precision of the two. The results are experimentally related with the change of MAE from the significant correlation coefficient and the number of pair response.

  • PDF

Secure Multi-Party Computation of Correlation Coefficients (상관계수의 안전한 다자간 계산)

  • Hong, Sun-Kyong;Kim, Sang-Pil;Lim, Hyo-Sang;Moon, Yang-Sae
    • Journal of KIISE
    • /
    • v.41 no.10
    • /
    • pp.799-809
    • /
    • 2014
  • In this paper, we address the problem of computing Pearson correlation coefficients and Spearman's rank correlation coefficients in a secure manner while data providers preserve privacy of their own data in distributed environment. For a data mining or data analysis in the distributed environment, data providers(data owners) need to share their original data with each other. However, the original data may often contain very sensitive information, and thus, data providers do not prefer to disclose their original data for preserving privacy. In this paper, we formally define the secure correlation computation, SCC in short, as the problem of computing correlation coefficients in the distributed computing environment while preserving the data privacy (i.e., not disclosing the sensitive data) of multiple data providers. We then present SCC solutions for Pearson and Spearman's correlation coefficients using secure scalar product. We show the correctness and secure property of the proposed solutions by presenting theorems and proving them formally. We also empirically show that the proposed solutions can be used for practical applications in the performance aspect.

Identifying Spatial Distribution Pattern of Water Quality in Masan Bay Using Spatial Autocorrelation Index and Pearson's r (공간자기상관 지수와 Pearson 상관계수를 이용한 마산만 수질의 공간분포 패턴 규명)

  • Choi, Hyun-Woo;Park, Jae-Moon;Kim, Hyun-Wook;Kim, Young-Ok
    • Ocean and Polar Research
    • /
    • v.29 no.4
    • /
    • pp.391-400
    • /
    • 2007
  • To identify the spatial distribution pattern of water quality in Masan Bay, Pearson's correlation as a common statistic method and Moran's I as a spatial autocorrelation statistics were applied to the hydrological data seasonally collected from Masan Bay for two years ($2004{\sim}2005$). Spatial distribution of salinity, DO and silicate among the hydrological parameters clustered strongly while chlorophyll a distribution displayed a weak clustering. When the similarity matrix of Moran's I was compared with correlation matrix of Pearson's r, only the relationships of temperature vs. salinity, temperature vs. silicate and silicate vs. total inorganic nitrogen showed significant correlation and similarity of spatial clustered pattern. Considering Pearson's correlation and the spatial autocorrelation results, water quality distribution patterns of Masan Bay were conceptually simplified into four types. Based on the simplified types, Moran's I and Pearson's r were compared respectively with spatial distribution maps on salinity and silicate with a strong clustered pattern, and with chlorophyll a having no clustered pattern. According to these test results, spatial distribution of the water quality in Masan Bay could be summed up in four patterns. This summation should be developed as spatial index to be linked with pollutant and ecological indicators for coastal health assessment.

Evaluation of Reliability and Validity of the Louisville Instrument for Transplantation (LIFT) in Korean Population (한글판 Louisville Instrument for Transplantation 설문지의 신뢰도 및 타당도 평가)

  • Kim, Hong-Min;Kim, Ji-Hoon;Hwang, Jae-Ha;Kim, Kwang-Seog;Lee, Sam-Yong
    • Archives of Plastic Surgery
    • /
    • v.38 no.3
    • /
    • pp.245-250
    • /
    • 2011
  • Purpose: Composite tissue allotransplantation has emerged as a new therapeutic modality to reconstruct major tissue defects of the head, neck and extremities. A questionnaire-based instrument, the Louisville Instrument for Transplantation (LIFT), has been developed to objectively assess the risk-versus-benefit ratio for composite tissue allotransplantation procedures. The objective of this study is to assess if the LIFT is a useful, reliable and valid tool to apply to the Korean population. Methods: Seventy-three medical students and 60 lay public completed the LIFT questionnaire (translated to Korean) over the period from February 2010 to April 2010. Internal consistency was assessed using Cronbach's alpha. Test-retest reliability was analyzed using Pearson's correlation coefficient. Construct validity was assessed by comparing Pearson's correlation coefficients between perceived improvements in quality of life and responses to risk tolerance questions concerning organ transplants. Results: Measurements of the test-retest reliability showed that Pearson's correlation coefficients ranged from 0.241 to 0.902, and Cronbach's alphas ranged from 0.52 to 0.80 for medical students and from 0.63 to 0.83 for the lay public. Pearson's correlation coefficients showed significant correlations between perceived improvements in quality of life and responses to risk tolerance questions concerning organ transplants. Hand transplant showed a significant correlation in medical students. Foot, hand, two hands, larynx, partial face transplants showed significant correlations for the lay public. Conclusion: The applicability of the LIFT to the Korean population was found to be reliable and valid. The LIFT may serve as a useful tool for clinical application in the Korean population.

Statistical Analysis of Experimental Results on Emission Characteristics of Biodiesel Blended Fuel (바이오디젤 혼합연료의 배기특성 실험결과에 대한 통계학적 해석)

  • Yeom, Jeong Kuk;Yoon, Jeong Hwan
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.39 no.12
    • /
    • pp.1199-1206
    • /
    • 2015
  • In this study, the exhaust gas of a diesel engine operating on biodiesel(BD) fuel(a mixture of diesel and soybean oil) was investigated for different fuel mixing ratios in the range of BD3 to BD100. The experiments were conducted using injection pressures of 400, 600, 800, 1000, and 1200 bar. The Pearson correlation coefficient and Spearman rank-order correlation coefficient were used to quantify the NOx and Soot emissions based on the fuel mixing ratio and injection pressure. Consequently, the Pearson correlation coefficient obtained for NOx and Soot emissions according to the mixing ratio and injection pressure was -0.811 and the corresponding Spearman rank-order correlation coefficient was -0.884, which indicated that the correlation of the NOx and Soot emissions was linear. Thus, the NOx and Soot have a trade-off relationship. Moreover, at each injection pressure, the Pearson correlation coefficient was a negative number, which indicated an inversely proportional relationship between NOx and Soot.

Assessment of the usefulness of the Machine Performance Check system that is an evaluation tools for the determination of daily beam output (일간 빔 출력 확인을 위한 평가도구인 Machine Performance Check의 유용성 평가)

  • Lee, Sang Hyeon;Ahn, Woo Sang;Lee, Woo Seok;Choi, Jin Hyeok;Kim, Seon Yeon
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.29 no.2
    • /
    • pp.65-73
    • /
    • 2017
  • Purpose: Machine Performance Check (MPC) is a self-checking software based on the Electronic Portal Imaging Device (EPID) to measure daily beam outputs without external installation. The purpose of this study is to verify the usefulness of MPC by comparing and correlating daily beam output of QA Beamchecker PLUS. Materials and Methods: Linear accelerator (Truebeam 2.5) was used to measure 10 energies which are composed of photon beams(6, 10, 15 MV and 6, 10 MV-FFF) and electron beams(6, 9, 12, 16 and 20 MeV). A total of 80 cycles of data was obtained by measuring beam output measurement before treatment over five months period. The Pearson correlation coefficient was used to evaluate the consistency of the beam output between the MPC and the QA Beamchecker PLUS. In this study, if the Pearson correlation coefficient is; (1) 0.8 or higher, the correlation is very strong (2) between 0.6 and 0.79, the correlation is strong (3) between 0.4 and 0.59, the correlation is moderate (4) between 0.2 and 0.39, the correlation is weak (5) lower than 0.2, the correlation is very weak. Results: Output variations observed between MPC and QA Beamchecker PLUS were within 2 % for photons and electrons. The beam outputs variations of MPC were $0.29{\pm}0.26%$ and $0.30{\pm}0.26%$ for photon and electron beams, respectively. QA Beamchecker PLUS beam outputs were $0.31{\pm}0.24%$ and $0.33{\pm}0.24%$ for photon and electron beams, respectively. The Pearson correlation coefficient between MPC and QA Beamchecker PLUS indicated that photon beams were very strong at 15 MV, and strong at 6 MV, 10 MV, 6 MV-FFF and 10 MV-FFF. For electron beams, the Pearson correlation coefficient were strong at 16 MeV and 20 MeV, moderate at 9 MeV and 12 MeV, and very weak at 6 MeV. Conclusion: MPC showed significantly strong correlation with QA Beamchecker PLUS when testing with photon beams and high-energy electron beams in the evaluation of daily beam output, but the correlation when testing with low-energy electron beams (6 MeV) appeared to be low. However, MPC and QA Beamchecker PLUS are considered to be suitable for checking daily beam output, as they performed within 2 % of beam output consistency during the observation. MPC which can perform faster than the conventional daily beam output measurement tool, is considered to be an effective method for users.

  • PDF