• 제목/요약/키워드: Pearson Correlation Coefficient

검색결과 3,154건 처리시간 0.026초

Chatterjee의 ξ 계수에 대한 탐색적자료분석 (Exploratory data analysis for Chatterjee's ξ coefficient)

  • 장대흥
    • 응용통계연구
    • /
    • 제35권3호
    • /
    • pp.421-434
    • /
    • 2022
  • Chatterjee (2021)는 새로운 상관계수 ξ를 제안하였다. 두 가지 질문 (1. Anscombe's quartet 데이터셋에 대하여 ξ 계수는 구별이 가능한가?, 2. 다양한 종류의 산점도에서 데이터의 개수에 따라 ξ 계수 값의 변화는 어떠한가?)을 중심으로 ξ 계수에 대한 탐색적자료분석을 시도하였다. 세 가지 측도 (ξ 계수, 피어슨상관계수, 상호정보)를 서로 비교하였다.

상관계수과 거리계수의 조합형 척도를 이용한 영상인식 (Image Recognition by Using Hybrid Coefficient Measure of Correlation and Distance)

  • 홍성준;조용현
    • 한국지능시스템학회논문지
    • /
    • 제20권3호
    • /
    • pp.343-347
    • /
    • 2010
  • 본 논문에서는 상관계수와 거리계수의 조합형 유사성 척도에 기반을 둔 효과적인 영상인식 방법을 제안하였다. 여기서 상관계수는 Pearson coefficient에 의한 통계적 유사성을 측정하기 위함이고, 거리계수는 city-block에 의한 공간적인 유사성을 측정하기 위함이다. 또한 영상사이의 전체 유사성은 각 영상이 가지는 특징사이의 유사성으로 계산되며, 영상의 특징은 PCA와 ICA로 각각 추출하였다. 제안된 방법을 40*50 픽셀의 960(30명*4표정*2조명*4포즈)개 다른 표정영상을 대상으로 실험한 결과, ICA 기반 조합형 척도를 이용하는 것이 PCA 기반 조합형 척도보다 우수한 인식률을 가지며, 또한 조명과 같은 주변 환경에도 강건한 인식성능이 있음을 확인하였다.

교통량과 콘크리트 라이닝 균열 상관관계 분석 (Correlation analysis of traffic and crack in concrete lining)

  • 이규필
    • 한국터널지하공간학회 논문집
    • /
    • 제25권5호
    • /
    • pp.345-355
    • /
    • 2023
  • 본 연구에서는 교통량과 균열 두 변수에 대한 관계성을 분석하기 위하여 공분산 분석 및 피어슨 상관관계 분석을 수행하였다. 이를 위하여 국도터널 216개소에 대한 정밀안전점검, 정밀안전진단 시행결과 및 교통량을 조사/분석하였다. 분석결과 교통량과 콘크리트 라이닝에 발생하는 균열은 높은 상관성을 보이는 것으로 나타났다. 따라서 균열 보수 등 선제적 유지관리는 교통량을 고려한 계획수립을 수행하는 것이 바람직하다.

Correlation Analysis of Atmospheric Pollutants and Meteorological Factors Based on Environmental Big Data

  • Chao, Chen;Min, Byung-Won
    • International Journal of Contents
    • /
    • 제18권1호
    • /
    • pp.17-26
    • /
    • 2022
  • With the acceleration of urbanization and industrialization, air pollution has become increasingly serious, and the pollution control situation is not optimistic. Climate change has become a major global challenge faced by mankind. To actively respond to climate change, China has proposed carbon peak and carbon neutral goals. However, atmospheric pollutants and meteorological factors that affect air quality are complex and changeable, and the complex relationship and correlation between them must be further clarified. This paper uses China's 2013-2018 high-resolution air pollution reanalysis open data set, as well as statistical methods of the Pearson Correlation Coefficient (PCC) to calculate and visualize the design and analysis of environmental monitoring big data, which is intuitive and it quickly demonstrated the correlation between pollutants and meteorological factors in the temporal and spatial sequence, and provided convenience for environmental management departments to use air quality routine monitoring data to enable dynamic decision-making, and promote global climate governance. The experimental results show that, apart from ozone, which is negatively correlated, the other pollutants are positively correlated; meteorological factors have a greater impact on pollutants, temperature and pollutants are negatively correlated, air pressure is positively correlated, and the correlation between humidity is insignificant. The wind speed has a significant negative correlation with the six pollutants, which has a greater impact on the diffusion of pollutants.

코로나 19와 서울 소상공인 상권의 상관관계 분석 (The Analysis of Correlation Between COVID-19 and Seoul Small Business Commercial Districts)

  • 김재호;김장영
    • 한국정보통신학회논문지
    • /
    • 제25권3호
    • /
    • pp.384-388
    • /
    • 2021
  • 현재 국내든, 해외든 코로나19로 인해 많은 소상공인들이 피해를 입고 있고, 많은 점포들이 문을 닫고 있는 것이 현실이다. 국가재난지원금을 통해 소비자들의 소비를 격려하면서 어느 정도 피해를 막으려고 하지만, 소상공인들의 폐업을 막는 것은 힘들게 되었다. 2020년 9월 서울기준, 코로나19사태로 인해 점포2만곳 이상이 폐점되었고, 코로나 19블루로 인해 우울증에 호소하는 사람들도 많아졌다. 이 문제는 서울, 대한민국뿐만이 아닌, 전 세계적으로 코로나 19 사태에 피해입은 전 지역에 대한 문제다. 코로나19의 환자 수가 증가할수록 점포 수는 꾸준히 줄어들고 있다. 이를 피어슨, 스피어만, 켄달의 상관계수를 분석해 코로나19 환자 수와 점포 수의 음의 상관관계를 나타낸다는 것을 제시한다.

유사상관계수의 개념을 도입한 범주형 변수의 축약에 관한 연구 (A Method for Reduction of Categorical Variables Based on a Concept of Pseudo-Correlation Coefficient)

  • 권철신;홍순욱
    • 산업공학
    • /
    • 제14권1호
    • /
    • pp.79-83
    • /
    • 2001
  • In this paper, we propose a simple method to reduce categorical variables into smaller, but significant numbers, and also demonstrate how the proposed method can be applied to the problem of reduction that empirical research often faces in the course of data processing. For the purpose, we introduce a concept of pseudo-correlation coefficient to make it possible to use factor analysis (FA) as a tool for reducing variables. The main idea of the concept is to deal with the measures of association of categorical variables in the sense of the concept of Pearson's correlation coefficient in order to meet the input requirement of FA. Upon examination of existing measures that could play as pseudo-correlation coefficients, Cramer's V coefficient is selected for the best result among them. To show the detailed procedure of the proposed method, a specific demonstration with the data from 329 R&D projects conducted in 18 private laboratories in electric and electronics industry is presented.

  • PDF

한국한의학연구원 개발 변증설문지의 신뢰도 연구 (Reliability Study of the Pattern Identification Questionnaire Developed by Korean Institute of Oriental Medicine)

  • 김범수;임정화;이민희;윤영주
    • 대한한의진단학회지
    • /
    • 제17권1호
    • /
    • pp.29-44
    • /
    • 2013
  • Objectives This study is aimed at assessing the reliability of the Pattern identification questionnaire (PIQ) developed by Korea Institute of Oriental Medicine and examining the validity of the PIQ by comparing the pattern identification scores of different groups. Methods We conducted a survey of 258 participants (79 teachers and 179 graduate students at one School of Korean Medicine) using self-reported questionnaire and all the samples were retested. The test-retest reliability was assessed by Kappa coefficient(${\kappa}$) and Pearson correlation coefficient. Also we compared the differences in pattern identification scores according to sex, age and occupation. Results 1. One of 116 questions are impossible to calculate; 22 of them (18.97%) scored under 0.4 in ${\kappa}$; 90(77.59%) ranged from 0.4 to 0.8 in ${\kappa}$; and three questions (3.58%) scored 0.8 or over in ${\kappa}$. 2. Pearson correlation coefficients between test score and retest score of all pattern identification items are 0.4 or over. 3. The mean score for pattern identification in women was generally higher than that in men, particularly in patterns of blood-deficiency, blood-stasis, yang-deficiency and kidney disease. 4. The mean score for pattern identification in the graduate student group was generally higher than that in the teacher group. Conclusion In test-retest reliability, the PIQ showed relatively high reliability. The mean pattern identification score showed differences in regards to retaining knowledge about Korean medicine. Therefore, future research involving modification of questionnaire items and confirming the validity of this questionnaire is required.

An Approach to Credibility Enhancement of Automated Collaborative Filtering System through Accommodating User's Rating Behavior

  • Sung, Jang-Hwan;Park, Jong-Hun
    • 한국경영정보학회:학술대회논문집
    • /
    • 한국경영정보학회 2007년도 International Conference
    • /
    • pp.576-581
    • /
    • 2007
  • The purpose of this paper is to strengthen trust on the automated collaborative filtering system. Automated collaborative filtering system is quickly becoming a popular technique for recommendation system. This elaborative methodology contributes for reducing information overload and the result becomes index of users' preference. In addition, it can be applied to various industries in various fields. After it collaborative filtering system was developed, many researches are executed to enhance credibility and to apply in various fields. Among these diverse systems, collaborative filtering system which uses Pearson correlation coefficient is most common in many researches. In this paper, we proposed new process diagram of collaborative filtering algorithm and new factors which should improve the credibility of system. In addition, the effects and relationships are also tested.

  • PDF

시간경로 유전자 발현자료의 군집분석에서 이질적인 시계열의 탐지를 위한 패턴일치지수 (A Pattern Consistency Index for Detecting Heterogeneous Time Series in Clustering Time Course Gene Expression Data)

  • 손영숙;백장선
    • 응용통계연구
    • /
    • 제18권2호
    • /
    • pp.371-379
    • /
    • 2005
  • 본 논문에서는 피어슨 상관계수를 이용한 시간경로 유전자 발현자료의 군집분석에서 군집의 대표적인 패턴에서 벗어나는 이질적인 패턴을 보이는 시계열을 탐지하기 위한 패턴일치지수를 제안하고, 이를 마이크로어레이 실험으로부터 얻어진 혈청 시간경로 유전자 발현자료에 적용하여 유용성을 검토해 본다.

The Effect of Co-rating on the Recommender System of User Base

  • Lee, Hee-Choon;Lee, Seok-Jun;Chung, Young-Jun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권3호
    • /
    • pp.775-784
    • /
    • 2006
  • This study is to investigate the effect of the number of co-rated users to the MAE. User based collaborative algorithm generally uses similarity weight to compute the relation of active user and other users. The original estimation algorithm of the GroupLens used the Pearson's correlation coefficient, soon after other researchers used various weighting. The Pearson’s correlation coefficient and Vector similarity, which is used in the field of information retrieval, are commonly used to the estimation algorithm. In prediction, we analyze the effect of the number of co-rated users on the user based recommender system.

  • PDF