• 제목/요약/키워드: Exploratory data analysis

검색결과 1,339건 처리시간 0.023초

경시적 자료를 이용한 아동 학업성취도 분석 (A longitudinal data analysis for child academic achievement with Korea welfare panel study data)

  • 이나은;허집
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권1호
    • /
    • pp.1-10
    • /
    • 2017
  • 경시적 자료를 이용한 아동 학업성취도에 영향을 주는 요인을 찾기 위한 기존의 분석들은 각 아동의 반복 측정된 자료들이 독립이라고 가정한 모형을 주로 이용하였다. 본 연구에서는 기존 연구들에서 고려한 아동 학업성취도에 영향을 주는 변수들을 선택하여 반복 측정된 경시적 자료의 종속성을 고려한 고정효과와 임의효과를 포함하는 선형혼합모형으로 분석하여 아동 학업성취도에 영향을 주는 변수들은 무엇인지, 각 아동의 특성들이 반영되는 임의절편과 임의기울기가 있는지를 파악하는 것이 연구의 목적이다. 본 연구에 사용된 자료는 한국복지패널 1, 4, 7차 부가조사 중에서 아동용 설문문항에 대한 자료이고, 국어, 영어와 수학의 학업성취도 점수의 합을 아동 학업성취도로 한다. 선형혼합모형을 이용한 분석 시에 다중공선성의 검토와 결측치의 특성을 파악하고 적절한 오차의 상관행렬을 선택한다.

Investigating the underlying structure of particulate matter concentrations: a functional exploratory data analysis study using California monitoring data

  • Montoya, Eduardo L.
    • Communications for Statistical Applications and Methods
    • /
    • 제25권6호
    • /
    • pp.619-631
    • /
    • 2018
  • Functional data analysis continues to attract interest because advances in technology across many fields have increasingly permitted measurements to be made from continuous processes on a discretized scale. Particulate matter is among the most harmful air pollutants affecting public health and the environment, and levels of PM10 (particles less than 10 micrometers in diameter) for regions of California remain among the highest in the United States. The relatively high frequency of particulate matter sampling enables us to regard the data as functional data. In this work, we investigate the dominant modes of variation of PM10 using functional data analysis methodologies. Our analysis provides insight into the underlying data structure of PM10, and it captures the size and temporal variation of this underlying data structure. In addition, our study shows that certain aspects of size and temporal variation of the underlying PM10 structure are associated with changes in large-scale climate indices that quantify variations of sea surface temperature and atmospheric circulation patterns.

기업경기실사지수 예측에 대한 탐색적 연구: 데이터 마이닝을 이용하여 (An Exploratory Study on the Prediction of Business Survey Index Using Data Mining)

  • 박경보;김미량
    • 한국IT서비스학회지
    • /
    • 제22권4호
    • /
    • pp.123-140
    • /
    • 2023
  • In recent times, the global economy has been subject to increasing volatility, which has made it considerably more difficult to accurately predict economic indicators compared to previous periods. In response to this challenge, the present study conducts an exploratory investigation that aims to predict the Business Survey Index (BSI) by leveraging data mining techniques on both structured and unstructured data sources. For the structured data, we have collected information regarding foreign, domestic, and industrial conditions, while the unstructured data consists of content extracted from newspaper articles. By employing an extensive set of 44 distinct data mining techniques, our research strives to enhance the BSI prediction accuracy and provide valuable insights. The results of our analysis demonstrate that the highest predictive power was attained when using data exclusively from the t-1 period. Interestingly, this suggests that previous timeframes play a vital role in forecasting the BSI effectively. The findings of this study hold significant implications for economic decision-makers, as they will not only facilitate better-informed decisions but also serve as a robust foundation for predicting a wide range of other economic indicators. By improving the prediction of crucial economic metrics, this study ultimately aims to contribute to the overall efficacy of economic policy-making and decision processes.

탐색적요인분석과 확인적요인분석의 비교에 과한 연구 (The Study on the comparative analysis of EFA and CFA)

  • 최창호;유연우
    • 디지털융복합연구
    • /
    • 제15권10호
    • /
    • pp.103-111
    • /
    • 2017
  • 본 연구는 탐색적 요인분석과 확인적 요인분석에 대한 특성과 그 차이점에 대하여 살펴보고, 동일한 데이터를 활용하여 탐색적 요인분석과 확인적 요인분석의 분석과정 및 결과를 비교분석함으로써 두 방법론의 올바른 이해와 적용에 대하여 알아보고자 한다. 한편, 실증분석 결과는 아래와 같다. 탐색적 요인분석에서는 판별타당도가 저해되는 p.1, p.3이 제거된 반면, 확인적 요인분에서는 집중타당도가 저해되는 p.3가 제거 되었다. 탐색적 요인분석의 경우 다수의 측정변수를 소수의 요인으로 축약하는 분석과정(다소 부족한 이론적배경)인 반면, 확인적 요인분석은 측정변수와 잠재변수들 간의 관계를 파악 및 확인하는 과정(강력한 이론적배경)으로 동일한 데이터를 활용한다 하더라도 두 방법론은 언제든지 다른 결과 값이 도출될 수 있는 바, 데이터의 성격 등에 따라 올바른 방법론의 활용이 요구된다는 시사점을 보여주고 있다.

Improvement of SOM using Stratification

  • Jun, Sung-Hae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제9권1호
    • /
    • pp.36-41
    • /
    • 2009
  • Self organizing map(SOM) is one of the unsupervised methods based on the competitive learning. Many clustering works have been performed using SOM. It has offered the data visualization according to its result. The visualized result has been used for decision process of descriptive data mining as exploratory data analysis. In this paper we propose improvement of SOM using stratified sampling of statistics. The stratification leads to improve the performance of SOM. To verify improvement of our study, we make comparative experiments using the data sets form UCI machine learning repository and simulation data.

조직의 여유자원과 혁신간의 비선형관계에 관한 연구 : 네트워크 다양성 조절효과 (A Study on the Curvilinear Relationship Between Slack and Innovation : Focus on Moderating Effect of Network Diversity)

  • 강소라;한수진
    • Journal of Information Technology Applications and Management
    • /
    • 제27권6호
    • /
    • pp.181-196
    • /
    • 2020
  • Based on the resource-based perspective, this study seeks to understand the relationship between the organizational slack and innovation, and to demonstrate that there exists a difference in the influence of the organizational slack according to the type of innovation by dividing the types of innovation into exploratory and exploitative innovations. They also want to understand the role that network diversity plays in the relationship between organizational slack and innovation. For this purpose, hypothesis and research models were presented based on resource-based perspectives and empirical analysis was conducted on 171 companies. The analysis confirmed that the impact of organizational slack on exploitative innovation is linear, not non-linear, as expected. In other words, the more resources available, the more productive the enterprise is, and the more resources available to the organization have a positive impact on the innovation. On the other hand, exploratory innovation represented an inverse U-shaped relationship between organizational slack and nonlinearity as expected. The control effect of network diversity was only seen in the relationship between organizational slack and exploratory innovation. Through this study, it provides implications such as the importance of network diversity, which is a relationship between companies, and the difference in the utilization of organizational slack according to the type of innovation.

Exploratory Analysis of AI-based Policy Decision-making Implementation

  • SunYoung SHIN
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제16권1호
    • /
    • pp.203-214
    • /
    • 2024
  • This study seeks to provide implications for domestic-related policies through exploratory analysis research to support AI-based policy decision-making. The following should be considered when establishing an AI-based decision-making model in Korea. First, we need to understand the impact that the use of AI will have on policy and the service sector. The positive and negative impacts of AI use need to be better understood, guided by a public value perspective, and take into account the existence of different levels of governance and interests across public policy and service sectors. Second, reliability is essential for implementing innovative AI systems. In most organizations today, comprehensive AI model frameworks to enable and operationalize trust, accountability, and transparency are often insufficient or absent, with limited access to effective guidance, key practices, or government regulations. Third, the AI system is accountable. The OECD AI Principles set out five value-based principles for responsible management of trustworthy AI: inclusive growth, sustainable development and wellbeing, human-centered values and fairness values and fairness, transparency and explainability, robustness, security and safety, and accountability. Based on this, we need to build an AI-based decision-making system in Korea, and efforts should be made to build a system that can support policies by reflecting this. The limiting factor of this study is that it is an exploratory study of existing research data, and we would like to suggest future research plans by collecting opinions from experts in related fields. The expected effect of this study is analytical research on artificial intelligence-based decision-making systems, which will contribute to policy establishment and research in related fields.

Pay Per Click Marketing Strategies: A Review of Empirical Evidence

  • Bhandari, Ravneet Singh
    • 산경연구논집
    • /
    • 제8권6호
    • /
    • pp.7-16
    • /
    • 2017
  • Purpose - Today's world revolves around search engines which are the driving force behind any marketer. The thirst for marketing has led to the evolution of online 'Pay per click' over last few years and is the most widely used instrument. Research design, data, and methodology - Exploratory research design highlights many marketing variables getting affected by pay per click marketing. To analyze the said phenomenon, the data was gathered through questionnaire from the sample of 338 respondents which were selected by simple random sampling method mostly from the National Capital Region (NCR) of Delhi in India. The data collected from the respondents was loaded on SAS base for exploratory factor analysis and multiple regression analysis. Results - Pay per click as a marketing tool has significant impact on the consumers. The most prominent factors of pay per click marketing identified in the research are Ad quality, Competition, Targeting, Trend and Budget. Conclusions - Organic as well as inorganic ads, keeping in mind the end goal to gage the exchange of these two postings in the marked look territory. Additionally, here we dissected supported pursuit promotions in all. It would be beneficial to break down the impact of promotion position on the pay per click marketing.

탐색적 요인 분석을 이용한 기업의 ISMS 인증 시 장애요인에 관한 연구 (An Empirical Study on the Obstacle Factors of ISMS Certification Using Exploratory Factor Analysis)

  • 박경태;김세헌
    • 정보보호학회논문지
    • /
    • 제24권5호
    • /
    • pp.951-959
    • /
    • 2014
  • 최근 들어 세계적으로 정보자산 유출에 대한 문제가 대두되고 있다. 국가정보원에 따르면, 2003년부터 2013년까지 총 375건의 해외 기술 유출을 적발했으며, 특히 2013년 한 해에만 49건이 적발되는 등 시간이 갈수록 증가 추세를 보이고 있다. 이는 기업에서 정보보호 관리체계를 수립 운용하고 이를 인증 받아야 할 필요성이 있음을 뜻한다. 하지만 ISMS 인증을 받기 위해서는 아직까지 이미 드러난, 혹은 아직 드러나지 않은 장애요인들이 상당히 존재하며, 관련 연구 또한 아직 부족하다. 따라서 본 연구에서는 기업이 ISMS 인증 시에 어떤 장애요인을 가지고 있는지 탐색적 요인 분석 기법을 이용하여 실증적으로 분석하였다. 연구 결과, 심사 난이도 및 기간, 컨설팅 업체 관련 요인, 인증 선행 사례 및 컨설팅 인력 자질, 내부적 요인, 인증기관 신뢰도 및 심사 비용, 인증 혜택 관련 요인과 같이 총 여섯 개의 압축된 요인을 도출하였다.

지구통계기법과 GIS를 이용한 연안지역 해수침투 분포 파악 (Analysis of the Distribution Pattern of Seawater Intrusion in Coastal Area using the Geostatistics and GIS)

  • 최선영;고와라;윤왕중;황세호;강문경
    • Spatial Information Research
    • /
    • 제11권3호
    • /
    • pp.251-260
    • /
    • 2003
  • 본 연구에서는 지구통계기법과 GIS를 이용하여 Cl/sup -/ 농도 분포도를 작성하고 이를 통해 해수침투 분포 양상을 파악하였다. 분포도는 탐색적 공간자료 분석을 통해 자료의 분포 패턴을 파악한 후에 정규크리깅과 공동크리깅을 이용하여 작성하였다. 지구통계기법인 크리깅은 시ㆍ공간적인 자료의 분포특성과 상관관계를 이용하여 신뢰할 만한 결과를 제공한다. 공동크리깅의 이차변수는 상관분석을 통해 Cl/sup -/과의 상관성이 큰 TDS, Na/sup +/, Br/sup -/을 선정하였다. Cl/sup -/ 농도 분포도를 분석한 결과 공동크리깅에 의한 분포도가 정규크리깅의 분포도보다 더욱 정밀하게 나타났으며, 전반적으로 이민촌 일대와 해안가 지역에서 높은 농도 이상대를 보이고 있음을 확인할 수 있었다.

  • PDF