• Title/Summary/Keyword: data preprocessing

Search Result 997, Processing Time 0.029 seconds

An Object Detection and Tracking System using Fuzzy C-means and CONDENSATION (Fuzzy C-means와 CONDENSATION을 이용한 객체 검출 및 추적 시스템)

  • Kim, Jong-Ho;Kim, Sang-Kyoon;Hang, Goo-Seun;Ahn, Sang-Ho;Kang, Byoung-Doo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.16 no.4
    • /
    • pp.87-98
    • /
    • 2011
  • Detecting a moving object from videos and tracking it are basic and necessary preprocessing steps in many video systems like object recognition, context aware, and intelligent visual surveillance. In this paper, we propose a method that is able to detect a moving object quickly and accurately in a condition that background and light change in a real time. Furthermore, our system detects strongly an object in a condition that the target object is covered with other objects. For effective detection, effective Eigen-space and FCM are combined and employed, and a CONDENSATION algorithm is used to trace a detected object strongly. First, training data collected from a background image are linear-transformed using Principal Component Analysis (PCA). Second, an Eigen-background is organized from selected principal components having excellent discrimination ability on an object and a background. Next, an object is detected with FCM that uses a convolution result of the Eigen-vector of previous steps and the input image. Finally, an object is tracked by using coordinates of an detected object as an input value of condensation algorithm. Images including various moving objects in a same time are collected and used as training data to realize our system that is able to be adapted to change of light and background in a fixed camera. The result of test shows that the proposed method detects an object strongly in a condition having a change of light and a background, and partial movement of an object.

Smart Emotion Management System based on multi-biosignal Analysis using Artificial Intelligence (인공지능을 활용한 다중 생체신호 분석 기반 스마트 감정 관리 시스템)

  • Noh, Ayoung;Kim, Youngjoon;Kim, Hyeong-Su;Kim, Won-Tae
    • Journal of IKEEE
    • /
    • v.21 no.4
    • /
    • pp.397-403
    • /
    • 2017
  • In the modern society, psychological diseases and impulsive crimes due to stress are occurring. In order to reduce the stress, the existing treatment methods consisted of continuous visit counseling to determine the psychological state and prescribe medication or psychotherapy. Although this face-to-face counseling method is effective, it takes much time to determine the state of the patient, and there is a problem of treatment efficiency that is difficult to be continuously managed depending on the individual situation. In this paper, we propose an artificial intelligence emotion management system that emotions of user monitor in real time and induced to a table state. The system measures multiple bio-signals based on the PPG and the GSR sensors, preprocesses the data into appropriate data types, and classifies four typical emotional states such as pleasure, relax, sadness, and horror through the SVM algorithm. We verify that the emotion of the user is guided to a stable state by providing a real-time emotion management service when the classification result is judged to be a negative state such as sadness or fear through experiments.

Fusion of Aerosol Optical Depth from the GOCI and the AHI Observations (GOCI와 AHI 자료를 활용한 에어로졸 광학두께 합성장 산출 연구)

  • Kang, Hyeongwoo;Choi, Wonei;Park, Jeonghyun;Kim, Serin;Lee, Hanlim
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.5_1
    • /
    • pp.861-870
    • /
    • 2021
  • In this study, fused Aerosol Optical Depth (AOD) data were produced using AOD products from the Geostationary Ocean Color Imager (GOCI) onboard Communication, Oceanography and Meteorology Satellite (COMS)satellite and the Advanced Himawari Imager (AHI) onboard Himawari-8. Since the spatial resolution and the coordinate system between the satellite sensors are different, a preprocessing was first preceded. After that, using the level 1.5 AOD dataset of AErosol RObotic NETwork (AERONET), which is ground-based observation, correlations and trends between each satellite AOD and AERONET AOD were utilized to produce more accurate satellite AOD data than the originalsatellite AODs. The fused AOD were found to be more accurate than the originalsatellite AODs. Root Mean Square Error (RMSE) and mean bias of the fused AODs were calculated to be 0.13 and 0.05, respectively. We also compared errors of the fused AODs against those of the original GOCI AOD (RMSE: 0.15, mean bias: 0.11) and the original AHI AOD (RMSE: 0.15, mean bias: 0.05). It was confirmed that the fused AODs have betterspatial coverage than the original AODsin areas where there are no observations due to the presence of cloud from a single satellite.

An Empirical Study on Predictive Modeling to enhance the Product-Technical Roadmap (제품-기술로드맵 개발을 강화하기 위한 예측모델링에 관한 실증 연구)

  • Park, Kigon;Kim, YoungJun
    • Journal of Technology Innovation
    • /
    • v.29 no.4
    • /
    • pp.1-30
    • /
    • 2021
  • Due to the recent development of system semiconductors, technical innovation for the electric devices of the automobile industry is rapidly progressing. In particular, the electric device of automobiles is accelerating technology development competition among automobile parts makers, and the development cycle is also changing rapidly. Due to these changes, the importance of strategic planning for R&D is further strengthened. Due to the paradigm shift in the automobile industry, the Product-Technical Roadmap (P/TRM), one of the R&D strategies, analyzes technology forecasting, technology level evaluation, and technology acquisition method (Make/Collaborate/Buy) at the planning stage. The product-technical roadmap is a tool that identifies customer needs of products and technologies, selects technologies and sets development directions. However, most companies are developing the product-technical roadmap through a qualitative method that mainly relies on the technical papers, patent analysis, and expert Delphi method. In this study, empirical research was conducted through simulations that can supplement and strengthen the product-technical roadmap centered on the automobile industry by fusing Gartner's hype cycle, cumulative moving average-based data preprocessing, and deep learning (LSTM) time series analysis techniques. The empirical study presented in this paper can be used not only in the automobile industry but also in other manufacturing fields in general. In addition, from the corporate point of view, it is considered that it will become a foundation for moving forward as a leading company by providing products to the market in a timely manner through a more accurate product-technical roadmap, breaking away from the roadmap preparation method that has relied on qualitative methods.

Estimation of stream flow discharge using the satellite synthetic aperture radar images at the mid to small size streams (합성개구레이더 인공위성 영상을 활용한 중소규모 하천에서의 유량 추정)

  • Seo, Minji;Kim, Dongkyun;Ahmad, Waqas;Cha, Jun-Ho
    • Journal of Korea Water Resources Association
    • /
    • v.51 no.12
    • /
    • pp.1181-1194
    • /
    • 2018
  • This study suggests a novel approach of estimating stream flow discharge using the Synthetic Aperture Radar (SAR) images taken from 2015 to 2017 by European Space Agency Sentinel-1 satellite. Fifteen small to medium sized rivers in the Han River basin were selected as study area, and the SAR satellite images and flow data from water level and flow observation system operated by the Korea Institute of Hydrological Survey were used for model construction. First, we apply the histogram matching technique to 12 SAR images that have undergone various preprocessing processes for error correction to make the brightness distribution of the images the same. Then, the flow estimation model was constructed by deriving the relationship between the area of the stream water body extracted using the threshold classification method and the in-situ flow data. As a result, we could construct a power function type flow estimation model at the fourteen study areas except for one station. The minimum, the mean, and the maximum coefficient of determination ($R^2$) of the models of at fourteen study areas were 0.30, 0.80, and 0.99, respectively.

S-wave Velocity Structure and Radial Anisotropy of Saudi Arabia from Surface Wave Tomography (표면파 토모그래피를 이용한 사우디아라비아의 S파 속도구조 및 이방성 연구)

  • Kim, Rinhui;Chang, Sung-Joon;Mai, Martin;Zahran, Hani
    • Geophysics and Geophysical Exploration
    • /
    • v.22 no.1
    • /
    • pp.21-28
    • /
    • 2019
  • We perform a 3D tomographic inversion using surface wave dispersion curves to obtain S-velocity model and radial anisotropy beneath Saudi Arabia. The Arabian Peninsula is geologically and topographically divided into a shield and a platform. We used event data with magnitudes larger than 5.5 and epicentral distances shorter than $40^{\circ}$ during 2008 ~ 2014 from the Saudi Geological Survey. We obtained dispersion curves by using the multiple filtering technique after preprocessing the event data. We constructed SH- and SV-velocity models and consequently radial anisotropy model at 10 ~ 60 km depths by inverting Love and Rayleigh group velocity dispersion curves with period ranges of 5 ~ 140 s, respectively. We observe high-velocity anomalies beneath the Arabian shield at 10 ~ 30 km depths and low-velocity anomalies beneath the Arabian platform at 10 km depth in the SV-velocity model. This discrepancy may be caused by the difference between the Arabian shield and the Arabian platform, that is, the Arabian shield was formed in Proterozoic thereby old and cold, while the Arabian platform is covered by predominant Paleozoic, Mesozoic, and Cenozoic sedimentary layers. Also we obtained radial anisotropy by estimating the differences between SH- and SV-velocity models. Positive anisotropy is observed, which may be generated by lateral tension due to the slab pull of subducting slabs along the Zagros belt.

A Method for Prediction of Quality Defects in Manufacturing Using Natural Language Processing and Machine Learning (자연어 처리 및 기계학습을 활용한 제조업 현장의 품질 불량 예측 방법론)

  • Roh, Jeong-Min;Kim, Yongsung
    • Journal of Platform Technology
    • /
    • v.9 no.3
    • /
    • pp.52-62
    • /
    • 2021
  • Quality control is critical at manufacturing sites and is key to predicting the risk of quality defect before manufacturing. However, the reliability of manual quality control methods is affected by human and physical limitations because manufacturing processes vary across industries. These limitations become particularly obvious in domain areas with numerous manufacturing processes, such as the manufacture of major nuclear equipment. This study proposed a novel method for predicting the risk of quality defects by using natural language processing and machine learning. In this study, production data collected over 6 years at a factory that manufactures main equipment that is installed in nuclear power plants were used. In the preprocessing stage of text data, a mapping method was applied to the word dictionary so that domain knowledge could be appropriately reflected, and a hybrid algorithm, which combined n-gram, Term Frequency-Inverse Document Frequency, and Singular Value Decomposition, was constructed for sentence vectorization. Next, in the experiment to classify the risky processes resulting in poor quality, k-fold cross-validation was applied to categorize cases from Unigram to cumulative Trigram. Furthermore, for achieving objective experimental results, Naive Bayes and Support Vector Machine were used as classification algorithms and the maximum accuracy and F1-score of 0.7685 and 0.8641, respectively, were achieved. Thus, the proposed method is effective. The performance of the proposed method were compared and with votes of field engineers, and the results revealed that the proposed method outperformed field engineers. Thus, the method can be implemented for quality control at manufacturing sites.

Major Factors Influencing Landslide Occurrence along a Forest Road Determined Using Structural Equation Model Analysis and Logistic Regression Analysis (구조방정식과 로지스틱 회귀분석을 이용한 임도비탈면 산사태의 주요 영향인자 선정)

  • Kim, Hyeong-Sin;Moon, Seong-Woo;Seo, Yong-Seok
    • The Journal of Engineering Geology
    • /
    • v.32 no.4
    • /
    • pp.585-596
    • /
    • 2022
  • This study determined major factors influencing landslide occurrence along a forest road near Sangsan village, Sancheok-myeon, Chungju-si, Chungcheongbuk-do, South Korea. Within a 2 km radius of the study area, landslides occur intensively during periods of heavy rainfall (August 2020). This makes study of the area advantageous, as it allows examination of the influence of only geological and tomographic factors while excluding the effects of rainfall and vegetation. Data for 82 locations (37 experiencing landslides and 45 not) were obtained from geological surveys, laboratory tests, and geo-spatial analysis. After some data preprocessing (e.g., error filtering, minimum-maximum normalization, and multicollinearity), structural equation model (SEM) and logistic regression (LR) analyses were conducted. These showed the regolith thickness, porosity, and saturated unit weight to be the factors most influential of landslide risk in the study area. The sums of the influence magnitudes of these factors are 71% in SEM and 83% in LR.

Development of Scaffolding Strategies Model by Information Search Process (ISP) (정보탐색과정(ISP)에 의한 스캐폴딩 전략 모형 개발)

  • Jeong-Hoon Lim
    • Journal of Korean Library and Information Science Society
    • /
    • v.54 no.1
    • /
    • pp.143-165
    • /
    • 2023
  • This study aims to propose a scaffolding strategy that can be applied to the information search process by using Kuhlthau's ISP model, which presented a design and implementation strategy for the mediation role in the learning process. To this end, the relevant literature was reviewed to categorize scaffolding strategies, and impressions were collected from the students surveys after providing 150 middle school students in the Daejeon area with the project class to which the scaffolding strategy based on the ISP model was applied. The collected data were processed into a form suitable for analysis through data preprocessing for word frequencies to be extracted, and topic analysis was performed using STM (Structural Topic Modeling). First, after determining the optimal number of topics and extracting topics for each stage of the ISP model, the extracted topics were classified into three types: cognitive domain-macro perspective, cognitive domain-micro perspective, and emotional domain perspective. In this process, we focused on cognitive verbs and emotional verbs among words extracted through text mining, and presented a scaffolding strategy model related to each topic by reviewing representative document cases. Based on the results of this study, if an appropriate scaffolding strategy is provided at the ISP model stage, a positive effect on learners' self-directed task solving can be expected.

A Classification Model for Customs Clearance Inspection Results of Imported Aquatic Products Using Machine Learning Techniques (머신러닝 기법을 활용한 수입 수산물 통관검사결과 분류 모델)

  • Ji Seong Eom;Lee Kyung Hee;Wan-Sup Cho
    • The Journal of Bigdata
    • /
    • v.8 no.1
    • /
    • pp.157-165
    • /
    • 2023
  • Seafood is a major source of protein in many countries and its consumption is increasing. In Korea, consumption of seafood is increasing, but self-sufficiency rate is decreasing, and the importance of safety management is increasing as the amount of imported seafood increases. There are hundreds of species of aquatic products imported into Korea from over 110 countries, and there is a limit to relying only on the experience of inspectors for safety management of imported aquatic products. Based on the data, a model that can predict the customs inspection results of imported aquatic products is developed, and a machine learning classification model that determines the non-conformity of aquatic products when an import declaration is submitted is created. As a result of customs inspection of imported marine products, the nonconformity rate is less than 1%, which is very low imbalanced data. Therefore, a sampling method that can complement these characteristics was comparatively studied, and a preprocessing method that can interpret the classification result was applied. Among various machine learning-based classification models, Random Forest and XGBoost showed good performance. The model that predicts both compliance and non-conformance well as a result of the clearance inspection is the basic random forest model to which ADASYN and one-hot encoding are applied, and has an accuracy of 99.88%, precision of 99.87%, recall of 99.89%, and AUC of 99.88%. XGBoost is the most stable model with all indicators exceeding 90% regardless of oversampling and encoding type.