• 제목/요약/키워드: Exploratory data analysis

검색결과 1,339건 처리시간 0.02초

Correspondence analysis for studying association between geography and cancer

  • Song, Joon-Jin;Yu, Pingjian;Ren, Yuan;Chung, Ming-Hua
    • Journal of the Korean Data and Information Science Society
    • /
    • 제20권5호
    • /
    • pp.919-924
    • /
    • 2009
  • Geographical location carries information such as demography, local economy, environment, and life styles, which could be the sources of cancer occurrence. Analyzing geographical location associated with cancer occurrence can be instructive to physicians, patients, and health administrators regarding resource allocation, expenditures, prophylaxis and treatments. In this paper, we explored the correspondence relationship between geographical locations and mortality rates of the cancers using correspondence analysis and illustrated the approach with the mortality rates of the top 10 cancers in the 75 counties in Arkansas from 2001 to 2005. Geographical variations with respect to the mortality rates of cancers are evaluated across Arkansas counties. Based on the contingency table, correspondence analysis model is developed and the simple indices which indicate the degree to which the regions and the cancers affect each other are calculated. Quantitative results are visualized and mapped in two-dimensional graphs.

  • PDF

카페공간에 대한 수렴적 탐색상황에서의 주의집중 특성의 분석 방법에 관한 연구 - 선택적 주시데이터에 의한 뇌파 데이터 분석을 중심으로 - (A Study on the Attention Concentration Properties in Convergent Exploration Situations in Cafe Space - Focusing on Gaze and Brain wave Data Analysis -)

  • 김종하;김주연;김상희
    • 한국실내디자인학회논문집
    • /
    • 제25권2호
    • /
    • pp.30-40
    • /
    • 2016
  • This study analyzed the attention concentration tendencies of one(1) subject who showed convergent exploratory acts actively through the gaze-brainwave measurement experiment of cafe space images and our research findings are as follows. First, the areas of interest (AOIs) that the subject gazed visually by paying attention to it and concentrating on it at a cafe space include counter&menu area, sign area, partition area, image wall area, stairs area, and movable furniture area, and built-in furniture area: seven areas in total. Second, conscious gaze frequency appeared the highest in counter&menu area, and conscious gaze appeared more later than in initial times. Third, conscious gaze pattern was divided into the zone that explored various areas dispersely (distributed exploratory zone) and the zone that explored between particular areas concentratedly (intensive exploratory zone). Fourth, as a result of analyzing the brainwave attention concentration, it was found that the attention concentration in prefrontal lobe (Fp1, Fp2) and frontal lobe (F3, F4) rose to a higher level in the zone of 15 to 16 seconds and this time zone was considered to be a zone where gazing at counter&menu area was very active. In addition, the attention concentration appeared higher in the initial zone than in the later zone, among the entire experimental time zones. Finally, as a result of analyzing the changes in activation by brain portion of the SMR wave expressed when maintaining the arousal and attention concentration, it was found that the right prefrontal lobe and the frontal lobe became activated in the time zone when the intensive exploration of "counter&menu area" and "movable furniture${\leftrightarrow}$built-in furniture area" had occurred and the time zone when the intensive exploration of "image wall${\leftrightarrow}$partition area" and "counter&menu${\leftrightarrow}$sign area" had occurred.

Factors Affecting Logistics Capabilities for Logistics Service Providers: A Case Study in Vietnam

  • DANG, Dinh Dao;HA, Dieu Linh;TRAN, Van Bao;NGUYEN, Van Tuan;NGUYEN, Thi Lien Huong;DANG, Thuy Hong;LE, Thi Thai Ha
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제8권5호
    • /
    • pp.81-89
    • /
    • 2021
  • This study aimed to investigate the factors affecting Logistics capabilities for Logistics Service Providers in Vietnam. Researchers inherited and developed based on previous research to focus on analyzing and evaluating dynamics, measuring Logistics capabilities, and the factors affecting Logistics capabilities for Logistics Service Providers. The logistics capabilities Model is used based on three factors: customer demand management capability, innovation capability, and information management capability. The empirical analysis used data from the survey data of l90 managers of Logistics Service Providers in Hai Phong, Ho Chi Minh City, Da Nang, Hue, Hanoi with reliable tools (SPSS 26.0 software). The data were analyzed by frequencies, percentages, means, Pearson's Linear Correlation Coefficient, exploratory factor analysis, and multi-linear regression model based on the survey data. The research results identified the following factors affecting Logistics capabilities for Logistics Service Providers: innovation capability has the strongest impact on Logistics capabilities; customer demand management capability has the following strong effects on Logistics capabilities; and finally, information management capability that affects Logistics capabilities. There is also a positive relationship between all factors and Logistics capabilities. Several recommendations are further suggested to enhance to improve Logistics capabilities for Logistics Service Providers in Vietnam.

다학제간 지식융합을 위한 '융합동기' 척도 개발 연구 (Development of the 'convergence motive' scale for interdisciplinary knowledge fusion)

  • 박성미;양황규
    • 수산해양교육연구
    • /
    • 제27권6호
    • /
    • pp.1880-1890
    • /
    • 2015
  • The purpose of this study was to development of the 'convergence motive' scale for interdisciplinary knowledge fusion. Based on results from literature review, this study clarifies a theoretical ground for 'convergence motive'. Initial items to measure this concept were verified by content analysis and then finalized. After a pilot test done with 568 college students, gathered data were analyzed by item selection and exploratory factor analysis to verify their validity. Next, the main test implemented with 1,211 college students was analyzed with exploratory factor analysis using the method for rotation based on maximum likelihood analysis and direct oblimin for validating the final items to measure 'convergence motive'. As a result, the scale for 'convergence motive' consists of 43 items to measure the following four factors: collaboration to identifying and solving problems, challenge of a new perspective, communication for convergence, cohesion for convergence. Construct validity and criterion-related validity were performed at last to check this scale's theoretical construct. In conclusion, this study concluded that the scales for convergence motive could be generalized and applicable to other samples.

한국어판 감성지능 측정도구의 신뢰도와 타당도 검증 (The Reliability and Validity of Korean Version of Wong and Law Emotional Intelligence Scale (K-WLEIS))

  • 정하림;최희정;박명숙
    • 대한간호학회지
    • /
    • 제50권4호
    • /
    • pp.611-620
    • /
    • 2020
  • Purpose: The aim of this study was to evaluate the reliability and validity of the Korean version of the Wong and Law Emotional Intelligence Scale (K-WLEIS). Methods: Data were collected from 360 nursing students using a self reported questionnaire. Exploratory and confirmatory factor analysis were used to test construct validity. Convergence validity was identified by correlation with communication competency. Item convergent and discriminant validity were also analyzed. Reliability was evaluated internal consistency and test-retest reliability. Results: The results of exploratory factor analysis showed that the eigen values ranged from 1.34 to 5.86 and 73.2% of the total explained variance. Confirmatory factor analysis showed adequate model fit indices (χ2/df 1.89, RMSEA .07, GFI .89, CFI .95, and TLI .93) and standardized factor loadings (.48 to .87). The average extracted variances (.71 to .79) and composite reliability (.80 to .87) validated convergence and discriminant validity of the items. Test-retest reliability of intra-class correlation coefficient was .90 and the Cronbach's alpha coefficient was .88. Conclusion: The K-WLEIS is an appropriate scale for measuring the emotional intelligence of Korean nursing students. Therefore, it is expected that the K-WLEIS will be used for nursing education programs to improve nursing students' emotional intelligence.

패션기업과 아티스트 간의 공동상품화를 위한 파트너쉽에 관한 연구 - 탐색적 요인분석, 구조방정식 모형분석을 중심으로 - (Partnerships for joint product development between fashion companies and artists - focusing on exploratory factor analysis and structure equation model analysis -)

  • 최소라;정성지;김동건
    • 한국의상디자인학회지
    • /
    • 제21권1호
    • /
    • pp.47-57
    • /
    • 2019
  • The purpose of the study was to explore effective satisfaction factors for continuous partnerships between fashion companies and artists. A questionnaire was developed by the researchers and results were collected from a total of 273 people who were working for a fashion company or working as an artist. Data was analyzed by the use of a frequency test, a reliability test, an exploratory factor analysis and a structure equation model analysis using AMOS 18.0. The results of the study were as follow. First, profitability and adequacy had significant effects, but awareness had no effect on confidence concerning the partnership. Second, awareness and profitability showed significant effects, but adequacy showed no effect on the flow among those in the partnership. Third, confidence had a significant effect on flow. Fourth, among the partnership factors, confidence and flow had significant effects on partnership satisfaction. Fifth, flow showed a significant effect on the intent for a continuous partnership, but confidence showed no effect.

A Comparative Study of Coffee Culture between Italy and South Korea: An Exploratory Study

  • Moretti, Raul
    • 아태비즈니스연구
    • /
    • 제8권2호
    • /
    • pp.41-55
    • /
    • 2017
  • This exploratory research compares two particular features of coffee culture, namely the reason why a particular coffee shop is frequented and the reason for going to the chosen coffee shop in Italy and South Korea. A survey was carried out targeted at current undergraduate university students in both countries with data being collected in the late spring and early summer of 2017. The main impetus for this research was to investigate the aforementioned areas given the fact that Italy has such a long standing coffee culture that dates back centuries and is still an industry dominated by independent coffee houses while the Korean coffee industry started developing in the early 1980s and taking off after the 1988 Olympic Games. The Korean coffee industry, in contrast, is driven by the franchise coffee shops such as Starbucks, $Caf{\acute{e}}$ Benne, and The Coffee Bean among others. While both countries have well developed coffee cultures, they developed along very different lines. Data collected from respondents are tabulated and presented followed by an analysis and interpretation of the data. Finally, some suggestions on how to conduct further research in order to better understanding the underpinnings and contributing factors in understanding consumer choice and coffee culture in both Italian and Korea are suggested.

  • PDF

확률 및 통계와 교원임용시험 (Probability and statistics in public secondary school teacher employment exam)

  • 오광식
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권6호
    • /
    • pp.1539-1545
    • /
    • 2017
  • 본 연구는 확률과 통계의 내용을 올바르게 지도할 수 있는 역량을 갖춘 수학 교사를 선발하기 위한 중등학교 수학교과 임용고사문제 중에서 확률과 통계 영역의 출세 경향을 분석하고, 앞으로의 출제 방향과 수준에 대하여 논의하고자 한다. 동시에 학교 수학 교사들에게 확률과 통계 단원을 지도하기 위해서 갖추어야 할 내용에 대한 일반적인 가이드를 제시하고자 한다. 첫째, 2015년 개정 고시된 교육과정의 수학 교과 중에서 확률과 통계 단원의 편제와 내용체계, 주요 변화 내용을 조사한다. 둘째, 중등교원임용시험에 15년간 출제 된 확률과 통계 단원의 문제들을 분석한다. 셋째, 기존의 출세 문제들이 중등학교의 확률과 통계를 올바르게 교육할 수 있는 자질을 갖춘 교사를 선발할 수 있는지에 대하여 검토한다. 마지막으로, 앞으로의 출제 내용, 범위, 수준, 그리고 방향에 대해서 논의한다. 결론적으로 4차 산업혁명시대를 맞이하여 빅 데이터의 중요성을 감안한다면 자료와 확률에 대한 통계적사고, 탐색적자료분석, 표본조사, 통계적 추론 그리고 공학적 도구의 활용 등의 출제가 더욱 필요하다고 본다.

Performance evaluation of principal component analysis for clustering problems

  • Kim, Jae-Hwan;Yang, Tae-Min;Kim, Jung-Tae
    • Journal of Advanced Marine Engineering and Technology
    • /
    • 제40권8호
    • /
    • pp.726-732
    • /
    • 2016
  • Clustering analysis is widely used in data mining to classify data into categories on the basis of their similarity. Through the decades, many clustering techniques have been developed, including hierarchical and non-hierarchical algorithms. In gene profiling problems, because of the large number of genes and the complexity of biological networks, dimensionality reduction techniques are critical exploratory tools for clustering analysis of gene expression data. Recently, clustering analysis of applying dimensionality reduction techniques was also proposed. PCA (principal component analysis) is a popular methd of dimensionality reduction techniques for clustering problems. However, previous studies analyzed the performance of PCA for only full data sets. In this paper, to specifically and robustly evaluate the performance of PCA for clustering analysis, we exploit an improved FCBF (fast correlation-based filter) of feature selection methods for supervised clustering data sets, and employ two well-known clustering algorithms: k-means and k-medoids. Computational results from supervised data sets show that the performance of PCA is very poor for large-scale features.

공급사슬성과와 정보기술역량 간의 관계에 관한 탐색적 분석 (Exploratory study on the relationship between supply chain performance and ICT capabilities)

  • 오수정;오광식
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권4호
    • /
    • pp.755-767
    • /
    • 2014
  • 최근 많은 기업들이 공급사슬에 정보통신기술 (information and communication technology; ICT)을 도입하고 있다. 그러나 기존의 연구들은 정보통신기술이 공급사슬에 미치는 영향과 관련하여 명확한 결론을 제시하지는 못하고 있다. 이에 본 연구는 기업에서 정보통신기술을 활용하는 역량의 관점을 제시하고 이를 네 가지 집단으로 분류하여 공급사슬성과에 미치는 영향을 살펴보고자 한다. ICT 역량을 구체적으로 협력과 변화 역량으로 구분하여 이를 토대로 집단을 네 가지 유형으로 분류하고, 공급사슬성과의 각 요인에 대하여 집단 간에 차이가 있는지 ANOVA분석과 사후검정을 실시하였다. 분석결과 정보통신기술의 역량이 모두 높은 집단이 공급사슬성과 중 특히 통합과 유연성 성과에서 가장 높은 수준인 것으로 나타났다. 통합과 유연성 변수의 세부문항에 대하여 집단 간 차이를 분석함으로써 기업 실무자에게 보다 정확하고 세세한 정보를 제공하고자 하였다.