• Title/Summary/Keyword: Exploratory data analysis

Search Result 1,339, Processing Time 0.028 seconds

Long-Term Trend Analysis and Exploratory Data Analysis of Geumho River based on Seasonal Mann-Kendall Test (계절 맨-켄달 기법을 이용한 금호강 본류 BOD의 장기 경향 분석 및 탐색적 자료 분석)

  • Jung, Kang-Young;Lee, In Jung;Lee, Kyung-Lak;Cheon, Se-Uk;Hong, Jun Young;Ahn, Jung-Min
    • Journal of Environmental Science International
    • /
    • v.25 no.2
    • /
    • pp.217-229
    • /
    • 2016
  • The government has conducted a plan of total maximum daily loads(TMDL), which divides with unit watershed, for management of stable water quality target by setting the permitted total amount of the pollutant. In this study, BOD concentration trends over the last 10 years from 2005 to 2014 were analyzed in the Geumho river. Improvement effect of water quality throughout the implementation period of TMDL was evaluated using the seasonal Mann-Kendall test and a LOWESS(locally weighted scatter plot smoother) smooth. As a study result of the seasonal Mann-Kendall test and the LOWESS smooth, BOD concentration in the Geumho river appeared to have been reduced or held at a constant. As a result of quantitatively analysis for BOD concentration with exploratory data analysis(EDA), the mean and the median of BOD concentration appeared in the order of GH8 > GH7 > GH6 > GH5 > GH4 > GH3 > GH2 > GH1. The monthly average concentration of BOD appeared in the order of Apr > Mar > Feb >May > Jun > Jul > Jan > Aug > Sep > Dec > Nov > Oct. As a result of the outlier, its value was the most frequent in February, which is estimated 1.5 times more than July, and was smallest frequent in July. The outlier in terms of water quality management is necessary in order to establish a management plan for the contaminants in watershed.

The Relationship between Residential Distribution of Immigrants and Crime in South Korea

  • Park, Yoonhwan
    • Journal of Distribution Science
    • /
    • v.16 no.7
    • /
    • pp.47-56
    • /
    • 2018
  • Purpose - This study aims to not only investigate spatial pattern of immigrants' residence and crime occurrences in South Korea, but shed light on how geographic distribution of immigrants and immigrant segregation affect crime rates. Research design, data, and methodology - Th unit of analysis is Si-Gun-Gu municipal level entities of South Korea. The crime data was obtained by Korea National Police Agency and two major types(violence and property) of crime were measured. Most demographic, social, and economic variables were derived from Korean Census Data in 2015. In order to examine spatial patterns of immigrants' distribution and crime rates in South Korea, the present study utilized GIS mapping technique and Exploratory Spatial Data Analysis(ESDA) tools. The causal linkage was investigated by a series of regression models using STATA. Results - Spatial inequality between urban metropolitan vs rural areas was visualized by mapping. Assuming large Moran's I value, spatial autocorrelation appeared to be quite strong. Several neighborhood characteristics such as residential stability and economic prosperity were found to be important factors leading to crime rate change. Residential distribution and segregation for immigrants were negatively significant in the regression models. Conclusions - Unlike the traditional arguments of social disorganization theory, immigrant segregation appeared to reduce violent crime rate and the high proportion of immigrants also turned out to be a crime prevention factor.

Exploratory Factor Analysis of SME Internationalization: Factor Differences between AEO and Non-AEO Authorized Companies

  • Son, Sung-Kyun;Kim, Tae-Joong;Kim, So-Hyung
    • Journal of Distribution Science
    • /
    • v.12 no.7
    • /
    • pp.5-12
    • /
    • 2014
  • Purpose - This study identified internationalization factors forKorean SMEs and explored factor differences between AEO and non-AEO authorized companies. Research design, data, and methodology - The study was designed to assess internationalization factors for AEO authorization in Korea through a questionnaire survey and an empirical analysis. The questionnaires were conducted for AEO and Non-AEO authorized companies that were undergoing AEO authorization. The study was conducted through e-mail and AEO manager education classes. Ninety-five questionnaires were collected. We employed the exploratory factor analysis methodology to derive internationalization factors for KoreanSMEs, and explored the factor differences between AEO and Non-AEO authorized companies. Results - AEO authorized companies outperformed Non-AEO authorized companies in R&D and technology. This indicated that AEO authorized companies were recognized as reliable and safe companies by the Korea Customs Service and other Customs services in trade facilitation and customs clearance processes. Conclusions - This study has some implications for AEO authorization and internationalization processes, and involved the empirical analysis of SMEs and the exploratory factor analysis in the internationalization process.

An Exploratory Study on the Approaches to the Statistical Yield and Analysis of Family Data (가족 데이터의 통계적 산출 및 분석방법에 관한탐색적 고찰)

  • 유계숙
    • Journal of Families and Better Life
    • /
    • v.14 no.1
    • /
    • pp.11-20
    • /
    • 1996
  • When data collected from more than one family member are utilized family researchers must take the correlation of family member's perception behavior or attitude scores into account viewing the couple or family as a unit of interdependent members. This paper presents a framework for categorizing family data based on the unit of analysis and several alternatives for the statistical analysis of family variables using individual- dyadic- and family-level data.

  • PDF

Exploratory and Confirmatory Factor Analysis of the Korean version of the Penn State Worry Questionnaire (한글판 펜실베니아 걱정 질문지의 탐색적 및 확인적 요인 분석)

  • Jeon, Jun Won;Kim, Daeho;Kim, Eunkyung;Roh, Sungwon
    • Anxiety and mood
    • /
    • v.13 no.2
    • /
    • pp.86-92
    • /
    • 2017
  • Objective : This study evaluated the factor structure of a Korean version of the Penn State Worry Questionnaire (K-PSWQ) with exploratory factor analysis in healthy adult subjects, and confirmatory factor analysis of subjects who have received psychiatric treatment. Methods : Exploratory principal component analysis was conducted with data from 318 non-psychiatric subjects, and 118 psychiatric patients were subjected to confirmatory factor analysis (maximum likelihood estimation). Participants were voluntary visitors at the booth who agreed to undergo screening for anxiety disorder at 2013 & 2014 Korea Mental Health Exhibitions. Results : Exploratory analysis revealed a two factor structure of the scale with total variance of 56.3%. Factor 1 was considered 'Worry engagement', and factor 2 was considered 'Absence of worry'. However, the results of the confirmatory factor analysis supported that both one factor model with method factor and two factor model are fit to structure of the scale considering fit indices. Internal consistency of total questions was good (Cronbach's ${\alpha}=0.899$). Conclusion : Our results supported the previously suggested factor structure of the PSWQ, and proved factorial validity of the K-PSWQ in both populations.

Various types of analyses for two-dimensional data (2차원 데이터의 여러 가지 분석방법)

  • Baik, Jai-Wook
    • Journal of Applied Reliability
    • /
    • v.10 no.4
    • /
    • pp.251-263
    • /
    • 2010
  • Modelling for failures is important for reliability analysis since failures of products such as automobiles occur as both time and usage progress and the results from the proper analysis of the two-dimensional data can be used for establishing warranty assurance policy. Hence, in this paper general issues which concern modelling failures are discussed, and both one-dimensional approaches and two-dimensional approaches to two-dimensional data are investigated. Finally non-parametric approaches to two-dimensional data are presented as a means of exploratory data analyses.

The Analysis by Postretirement of baby boom generation

  • Kim, Pan-Jin
    • The Journal of Economics, Marketing and Management
    • /
    • v.5 no.2
    • /
    • pp.33-39
    • /
    • 2017
  • As the aging population geworsened by the a of the low fertility rate in the wake of the birth of the low birth rate, the rapid increase in the retirement age of the baby boomers in the wake of the birth of the Korean War is a significant indication of the separation of the aged and the role of the economically rich and the role of the role of the economically rich. Therefore, this study aims to address issues and countermeasures. The study aims to provide basic data for the future life of the baby boom generation by examining the problems and responses to the economic activity after the retirement activity of the baby boomers. The research suggests that the limit was limited to the retirement age of the baby boomer generation in order to boost the employment of the elderly. Due to the lack of exploration of the exploratory research, the lack of analysis of exploratory facts is the biggest limitation of the analysis. So, further analysis of this will lead to meaningful studies. Looking at the composition of this study, the introduction of the study included the necessity and purpose of the study. The focus on the point was on the concepts and characteristics of the baby boomer, and analyzed the characteristics of the economic activity and analyses and analyses of domestic and international cases. In conclusion, the issue was drawn up and the alternatives were sought.

The Relation between Narcissistic and Cosmetics Shopping Orientation of Consumers (소비자의 자기애(自己愛) 성향과 화장품 쇼핑성향과의 관계)

  • Hwang, Yeon-Soon
    • Fashion & Textile Research Journal
    • /
    • v.11 no.2
    • /
    • pp.326-336
    • /
    • 2009
  • The primary purpose of this study was to investigate the relation between narcissistic and cosmetics shopping orientation of female consumers. The data were collected in Busan, Daegu and Ulsan, and 301 data were used for analysis. The aforementioned were analyzed utilizing frequency, factor and multiple regression analysis using SPSS Win 12.0. The results showed as follows. First, the factors related to narcissistic orientation were entitlement, leadership/superiority, self-reliance, self-intoxication, achievable desire and self-absorption. Second, the factors related to cosmetics shopping orientation were impulsive, economic, self-confidence, exploratory, brand/store loyal, shopping convenience, traffic convenience, prudence, pleasure, famous brand inclination and independent. Third, narcissistic orientation and cosmetics shopping orientation were significantly differences impulsive, economic, self-confidence, exploratory, shopping convenience, pleasure and famous brand inclination orientation.

The Study of Failure Mode Data Development and Feature Parameter's Reliability Verification Using LSTM Algorithm for 2-Stroke Low Speed Engine for Ship's Propulsion (선박 추진용 2행정 저속엔진의 고장모드 데이터 개발 및 LSTM 알고리즘을 활용한 특성인자 신뢰성 검증연구)

  • Jae-Cheul Park;Hyuk-Chan Kwon;Chul-Hwan Kim;Hwa-Sup Jang
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.60 no.2
    • /
    • pp.95-109
    • /
    • 2023
  • In the 4th industrial revolution, changes in the technological paradigm have had a direct impact on the maintenance system of ships. The 2-stroke low speed engine system integrates with the core equipment required for propulsive power. The Condition Based Management (CBM) is defined as a technology that predictive maintenance methods in existing calender-based or running time based maintenance systems by monitoring the condition of machinery and diagnosis/prognosis failures. In this study, we have established a framework for CBM technology development on our own, and are engaged in engineering-based failure analysis, data development and management, data feature analysis and pre-processing, and verified the reliability of failure mode DB using LSTM algorithms. We developed various simulated failure mode scenarios for 2-stroke low speed engine and researched to produce data on onshore basis test_beds. The analysis and pre-processing of normal and abnormal status data acquired through failure mode simulation experiment used various Exploratory Data Analysis (EDA) techniques to feature extract not only data on the performance and efficiency of 2-stroke low speed engine but also key feature data using multivariate statistical analysis. In addition, by developing an LSTM classification algorithm, we tried to verify the reliability of various failure mode data with time-series characteristics.

Results of Discriminant Analysis with Respect to Cluster Analyses Under Dimensional Reduction

  • Chae, Seong-San
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.2
    • /
    • pp.543-553
    • /
    • 2002
  • Principal component analysis is applied to reduce p-dimensions into q-dimensions ( $q {\leq} p$). Any partition of a collection of data points with p and q variables generated by the application of six hierarchical clustering methods is re-classified by discriminant analysis. From the application of discriminant analysis through each hierarchical clustering method, correct classification ratios are obtained. The results illustrate which method is more reasonable in exploratory data analysis.