• Title/Summary/Keyword: Exploratory data analysis

Search Result 1,351, Processing Time 0.024 seconds

Data-centric Smart Street Light Monitoring and Visualization Platform for Campus Management

  • Somrudee Deepaisarn;Paphana Yiwsiw;Chanon Tantiwattanapaibul;Suphachok Buaruk;Virach Sornlertlamvanich
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.3
    • /
    • pp.216-224
    • /
    • 2023
  • Smart lighting systems have become increasingly popular in several public sectors because of trends toward urbanization and intelligent technologies. In this study, we designed and implemented a web application platform to explore and monitor data acquired from lighting devices at Thammasat University (Rangsit Campus, Thailand). The platform provides a convenient interface for administrative and operative staff to monitor, control, and collect data from sensors installed on campus in real time for creating geographically specific big data. Platform development focuses on both back- and front-end applications to allow a seamless process for recording and displaying data from interconnected devices. Responsible persons can interact with devices and acquire data effortlessly, minimizing workforce and human error. The collected data were analyzed using an exploratory data analysis process. Missing data behavior caused by system outages was also investigated.

Exploratory Spatial Data Analysis (ESDA) for Age-Specific Migration Characteristics : A Case Study on Daegu Metropolitan City (연령별 인구이동 특성에 대한 탐색적 공간 데이터 분석 (ESDA) : 대구시를 사례로)

  • Kim, Kam-Young
    • Journal of the Korean association of regional geographers
    • /
    • v.16 no.5
    • /
    • pp.590-609
    • /
    • 2010
  • The purpose of the study is to propose and evaluate Exploratory Spatial Data Analysis(ESDA) methods for examining age-specific population migration characteristics. First, population migration pyramid which is a pyramid-shaped graph designed with in-migration, out-migration, and net migration by age (or age group), was developed as a tool exploring age-specific migration propensities and structures. Second, various spatial statistics techniques based on local indicators of spatial association(LISA) such as Local Moran''s $I_i$, Getis-Ord ${G_i}^*$, and AMOEBA were suggested as ways to detect spatial dusters of age-specific net migration rate. These ESDA techniques were applied to age-specific population migration of Daegu Metropolitan City. Application results demonstrated that suggested ESDA methods can effectively detect new information and patterns such as contribution of age-specific migration propensities to population changes in a given region, relationship among different age groups, hot and cold spot of age-specific net migration rate, and similarity between age-specific spatial clusters.

  • PDF

2014 Korea Working Conditions Survey Data Analysis (2014년 근로환경조사 데이터 분석)

  • Kim, Youngsun;Lee, Jaehee;Baik, Jaiwook
    • Journal of Applied Reliability
    • /
    • v.15 no.3
    • /
    • pp.181-191
    • /
    • 2015
  • Change in labor time is affecting occupational safety and health. Recently reduction in labor time brought innovational operation method, investment in plant and equipment, and flexible labor time in some companies, thereby affecting working conditions for labourers. However, working conditions for some vulnerable social groups have deteriorated. As a result, they are becoming more exposed to risk factors such as injury and emotional stress. In this study we use 2014 KWCS data in order to find the social and demographic characteristics in Korean working conditions. To this end, we use exploratory data analysis approach to find the relationship between some of the important variables in the KWCS data. We also use Press-State-Response model to find which group of people are vulnerable to Press and State. We find that women, people over age 50 and business owners are more vulnerable to Press and State than men, people below age 50 and wage workers.

A Study on Road Characteristic Classification using Exploratory Factor Analysis (탐색적 요인분석을 이용한 도로특성분류에 관한 연구)

  • Cho, Jun-Han;Kim, Seong-Ho;Rho, Jeong-Hyun
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.3
    • /
    • pp.53-66
    • /
    • 2008
  • This research is to the establishment of a conceptual framework that supports road characteristic classification from a new point of view in order to complement of the existing road functional classification and examine of traffic pattern. The road characteristic classification(RCC) is expected to use important performance criteria that produced a policy guidelines for transportation planning and operational management. For this study, the traffic data used the permanent traffic counters(PTCs) located within the national highway between 2002 and 2006. The research has described for a systematic review and assessment of how exploratory factor analysis should be applied from 12 explanatory variables. The optimal number of components and clusters are determined by interpretation of the factor analysis results. As a result, the scenario including all 12 explanatory variables is better than other scenarios. The four components is produced the optimal number of factors. This research made contributions to the understanding of the exploratory factor analysis for the road characteristic classification, further applying the objective input data for various analysis method, such as cluster analysis, regression analysis and discriminant analysis.

An Exploratory Methodology for Longitudinal Data Analysis Using SOM Clustering (자기조직화지도 클러스터링을 이용한 종단자료의 탐색적 분석방법론)

  • Cho, Yeong Bin
    • Journal of Convergence for Information Technology
    • /
    • v.12 no.5
    • /
    • pp.100-106
    • /
    • 2022
  • A longitudinal study refers to a research method based on longitudinal data repeatedly measured on the same object. Most of the longitudinal analysis methods are suitable for prediction or inference, and are often not suitable for use in exploratory study. In this study, an exploratory method to analyze longitudinal data is presented, which is to find the longitudinal trajectory after determining the best number of clusters by clustering longitudinal data using self-organizing map technique. The proposed methodology was applied to the longitudinal data of the Employment Information Service, and a total of 2,610 samples were analyzed. As a result of applying the methodology to the actual data applied, time-series clustering results were obtained for each panel. This indicates that it is more effective to cluster longitudinal data in advance and perform multilevel longitudinal analysis.

Analysis of Ammunition Inspection Record Data and Development of Ammunition Condition Code Classification Model (탄약검사기록 데이터 분석 및 탄약상태기호 분류 모델 개발)

  • Young-Jin Jung;Ji-Soo Hong;Sol-Ip Kim;Sung-Woo Kang
    • Journal of the Korea Safety Management & Science
    • /
    • v.26 no.2
    • /
    • pp.23-31
    • /
    • 2024
  • In the military, ammunition and explosives stored and managed can cause serious damage if mishandled, thus securing safety through the utilization of ammunition reliability data is necessary. In this study, exploratory data analysis of ammunition inspection records data is conducted to extract reliability information of stored ammunition and to predict the ammunition condition code, which represents the lifespan information of the ammunition. This study consists of three stages: ammunition inspection record data collection and preprocessing, exploratory data analysis, and classification of ammunition condition codes. For the classification of ammunition condition codes, five models based on boosting algorithms are employed (AdaBoost, GBM, XGBoost, LightGBM, CatBoost). The most superior model is selected based on the performance metrics of the model, including Accuracy, Precision, Recall, and F1-score. The ammunition in this study was primarily produced from the 1980s to the 1990s, with a trend of increased inspection volume in the early stages of production and around 30 years after production. Pre-issue inspections (PII) were predominantly conducted, and there was a tendency for the grade of ammunition condition codes to decrease as the storage period increased. The classification of ammunition condition codes showed that the CatBoost model exhibited the most superior performance, with an Accuracy of 93% and an F1-score of 93%. This study emphasizes the safety and reliability of ammunition and proposes a model for classifying ammunition condition codes by analyzing ammunition inspection record data. This model can serve as a tool to assist ammunition inspectors and is expected to enhance not only the safety of ammunition but also the efficiency of ammunition storage management.

Exploratory data analysis for Chatterjee's ξ coefficient (Chatterjee의 ξ 계수에 대한 탐색적자료분석)

  • Jang, Dae-Heung
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.3
    • /
    • pp.421-434
    • /
    • 2022
  • Chatterjee (2021) proposed a new correlation coefficient ξ. Focusing on two questions (1. Is ξ coefficient distinguishable for Anscombe's quartet data set?, 2. How does the ξ coefficient value change according to the number of data for various kinds of scatterplots?), an exploratory data analysis is attempted for ξ coefficient. We can compare three measures (ξ coefficient, Pearson's correlation coefficient and mutual information).

A study on rethinking EDA in digital transformation era (DX 전환 환경에서 EDA에 대한 재고찰)

  • Seoung-gon Ko
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.1
    • /
    • pp.87-102
    • /
    • 2024
  • Digital transformation refers to the process by which a company or organization changes or innovates its existing business model or sales activities using digital technology. This requires the use of various digital technologies - cloud computing, IoT, artificial intelligence, etc. - to strengthen competitiveness in the market, improve customer experience, and discover new businesses. In addition, in order to derive knowledge and insight about the market, customers, and production environment, it is necessary to select the right data, preprocess the data to an analyzable state, and establish the right process for systematic analysis suitable for the purpose. The usefulness of such digital data is determined by the importance of pre-processing and the correct application of exploratory data analysis (EDA), which is useful for information and hypothesis exploration and visualization of knowledge and insights. In this paper, we reexamine the philosophy and basic concepts of EDA and discuss key visualization information, information expression methods based on the grammar of graphics, and the ACCENT principle, which is the final visualization review standard, for effective visualization.

An Analysis on the Spatial Patterns of Heat Wave Vulnerable Areas and Adaptive Capacity Vulnerable Areas in Seoul (서울시 폭염 취약지역의 공간적 패턴 및 적응능력 취약지역 분석)

  • Choi, Ye Seul;Kim, Jae Won;Lim, Up
    • Journal of Korea Planning Association
    • /
    • v.53 no.7
    • /
    • pp.87-107
    • /
    • 2018
  • With more than 10 million inhabitants, in particular, Seoul, the capital of Korea, has already experienced a number of severe heat wave. To alleviate the potential impacts of heat wave and the vulnerability to heat wave, policy-makers have generally considered the option of heat wave strategies containing adaptation elements. From the perspective of sustainable planning for adaptation to heat wave, the objective of this study is to identify the elements of vulnerability and assess heat wave-vulnerability at the dong level. This study also performs an exploratory investigation of the spatial pattern of vulnerable areas in Seoul to heat wave by applying exploratory spatial data analysis. Then this study attempts to select areas with the relatively highest and lowest level of adaptive capacity to heat wave based on an framework of climate change vulnerability assessment. In our analysis, the adaptive capacity is the relatively highest for Seongsan-2-dong in Mapo and the relatively lowest for Changsin-3-dong in Jongno. This study sheds additional light on the spatial patterns of heat wave-vulnerability and the relationship between adaptive capacity and heat wave.

Effects of Work Environment on Job Satisfaction and Spontaneity Care Workers at Social Welfare Facilities

  • Kim, Moon-Jung
    • Journal of Distribution Science
    • /
    • v.13 no.8
    • /
    • pp.49-59
    • /
    • 2015
  • Purpose - This purpose of this research is to verify the influence of the care workers' environment on their job satisfaction and on their voluntary behavior. Research design, data, and methodology - Data were collected from care workers at elderly medical and home care facilities in Korea in Seoul and Kyung-ki. Of 367 total respondents, 285 responses were used. This study performed exploratory factor analysis in order to verify the validity and credibility of the data. Regression analysis was conducted to verify the influence of the working environment, which encompasses the worker's relationship with the agency and with the elderly, on job satisfaction. Results - The hypothesis results were: First, from analyzing the influence of the working environment on the worker's job satisfaction, both relationship with the agency (p<.001) and relationship with the elderly (p<.05) positively affect job satisfaction; second, the exploratory analysis verifies the influence or the working environment on job satisfaction. Conclusions - The results indicate that the relationship with the agency (p<.001) and relationship with the elderly (p<.001) both positively affect the voluntary behavior of the workers.