• Title/Summary/Keyword: Data Preprocessing

Search Result 948, Processing Time 0.024 seconds

Design of ASM-based Face Recognition System Using (2D)2 Hybird Preprocessing Algorithm (ASM기반 (2D)2 하이브리드 전처리 알고리즘을 이용한 얼굴인식 시스템 설계)

  • Kim, Hyun-Ki;Jin, Yong-Tak;Oh, Sung-Kwun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.2
    • /
    • pp.173-178
    • /
    • 2014
  • In this study, we introduce ASM-based face recognition classifier and its design methodology with the aid of 2-dimensional 2-directional hybird preprocessing algorithm. Since the image of face recognition is easily affected by external environments, ASM(active shape model) as image preprocessing algorithm is used to resolve such problem. In particular, ASM is used widely for the purpose of feature extraction for human face. After extracting face image area by using ASM, the dimensionality of the extracted face image data is reduced by using $(2D)^2$hybrid preprocessing algorithm based on LDA and PCA. Face image data through preprocessing algorithm is used as input data for the design of the proposed polynomials based radial basis function neural network. Unlike as the case in existing neural networks, the proposed pattern classifier has the characteristics of a robust neural network and it is also superior from the view point of predictive ability as well as ability to resolve the problem of multi-dimensionality. The essential design parameters (the number of row eigenvectors, column eigenvectors, and clusters, and fuzzification coefficient) of the classifier are optimized by means of ABC(artificial bee colony) algorithm. The performance of the proposed classifier is quantified through yale and AT&T dataset widely used in the face recognition.

Proposed Pre-Processing Method for Improving Pothole Dataset Performance in Deep Learning Model and Verification by YOLO Model (딥러닝 모델에서 포트홀 데이터셋의 성능 향상을 위한 전처리 방법 제안과 YOLO 모델을 통한 검증)

  • Han-Jin Lee;Ji-Woong Yang;Ellen J. Hong
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.4
    • /
    • pp.249-255
    • /
    • 2022
  • Potholes are an important clue to the structural defects of asphalt pavement and cause many casualties and property damage. Therefore, accurate pothole detection is an important task in road surface maintenance. Many machine learning technologies are being introduced for pothole detection, and data preprocessing is required to increase the efficiency of deep learning models. In this paper, we propose a preprocessing method that emphasizes important textures and shapes in pothole datasets. The proposed preprocessing method uses intensity transformation to reduce unnecessary elements of the road and emphasize the texture and shape of the pothole. In addition, the feature of the porthole is detected using Superpixel and Sobel edge detection. Through performance comparison between the proposed preprocessing method and the existing preprocessing method, it is shown that the proposed preprocessing method is a more effective method than the existing method in detecting potholes.

Opponent Move Prediction of a Real-time Strategy Game Using a Multi-label Classification Based on Machine Learning (기계학습 기반 다중 레이블 분류를 이용한 실시간 전략 게임에서의 상대 행동 예측)

  • Shin, Seung-Soo;Cho, Dong-Hee;Kim, Yong-Hyuk
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.10
    • /
    • pp.45-51
    • /
    • 2020
  • Recently, many games provide data related to the users' game play, and there have been a few studies that predict opponent move by combining machine learning methods. This study predicts opponent move using match data of a real-time strategy game named ClashRoyale and a multi-label classification based on machine learning. In the initial experiment, binary card properties, binary card coordinates, and normalized time information are input, and card type and card coordinates are predicted using random forest and multi-layer perceptron. Subsequently, experiments were conducted sequentially using the next three data preprocessing methods. First, some property information of the input data were transformed. Next, input data were converted to nested form considering the consecutive card input system. Finally, input data were predicted by dividing into the early and the latter according to the normalized time information. As a result, the best preprocessing step was shown about 2.6% improvement in card type and about 1.8% improvement in card coordinates when nested data divided into the early.

Empirical Study on Analyzing Training Data for CNN-based Product Classification Deep Learning Model (CNN기반 상품분류 딥러닝모델을 위한 학습데이터 영향 실증 분석)

  • Lee, Nakyong;Kim, Jooyeon;Shim, Junho
    • The Journal of Society for e-Business Studies
    • /
    • v.26 no.1
    • /
    • pp.107-126
    • /
    • 2021
  • In e-commerce, rapid and accurate automatic product classification according to product information is important. Recent developments in deep learning technology have been actively applied to automatic product classification. In order to develop a deep learning model with good performance, the quality of training data and data preprocessing suitable for the model are crucial. In this study, when categories are inferred based on text product data using a deep learning model, both effects of the data preprocessing and of the selection of training data are extensively compared and analyzed. We employ our CNN model as an example of deep learning model. In the experimental analysis, we use a real e-commerce data to ensure the verification of the study results. The empirical analysis and results shown in this study may be meaningful as a reference study for improving performance when developing a deep learning product classification model.

WiFi CSI Data Preprocessing and Augmentation Techniques in Indoor People Counting using Deep Learning (딥러닝을 활용한 실내 사람 수 추정을 위한 WiFi CSI 데이터 전처리와 증강 기법)

  • Kim, Yeon-Ju;Kim, Seungku
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.12
    • /
    • pp.1890-1897
    • /
    • 2021
  • People counting is an important technology to provide application services such as smart home, smart building, smart car, etc. Due to the social distancing of COVID-19, the people counting technology attracted public attention. People counting system can be implemented in various ways such as camera, sensor, wireless, etc. according to service requirements. People counting system using WiFi AP uses WiFi CSI data that reflects multipath information. This technology is an effective solution implementing indoor with low cost. The conventional WiFi CSI-based people counting technologies have low accuracy that obstructs the high quality service. This paper proposes a deep learning people counting system based on WiFi CSI data. Data preprocessing using auto-encoder, data augmentation that transform WiFi CSI data, and a proposed deep learning model improve the accuracy of people counting. In the experimental result, the proposed approach shows 89.29% accuracy in 6 subjects.

Learning data preprocessing technique for improving indoor positioning performance based on machine learning (기계학습 기반의 실내 측위 성능 향상을 위한 학습 데이터 전처리 기법)

  • Kim, Dae-Jin;Hwang, Chi-Gon;Yoon, Chang-Pyo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.11
    • /
    • pp.1528-1533
    • /
    • 2020
  • Recently, indoor location recognition technology using Wi-Fi fingerprints has been applied and operated in various industrial fields and public services. Along with the interest in machine learning technology, location recognition technology based on machine learning using wireless signal data around a terminal is rapidly developing. At this time, in the process of collecting radio signal data required for machine learning, the accuracy of location recognition is lowered due to distorted or unsuitable data for learning. In addition, when location recognition is performed based on data collected at a specific location, a problem occurs in location recognition at surrounding locations that are not included in the learning. In this paper, we propose a learning data preprocessing technique to obtain an improved position recognition result through the preprocessing of the collected learning data.

A Robust Sequential Preprocessing Scheme for Efficient Lossless Image Compression (영상의 효율적인 무손실 압축을 위한 강인한 순차적 전처리 기법)

  • Kim, Nam-Yee;You, Kang-Soo;Kwak, Hoon-Sung
    • Journal of Internet Computing and Services
    • /
    • v.10 no.1
    • /
    • pp.75-82
    • /
    • 2009
  • In this paper, we propose a robust preprocessing scheme for entropy coding in gray-level image. The issue of this paper is to reduce additional information needed when bit stream is transmitted. The proposed scheme uses the preprocessing method of co-occurrence count about gray-levels in neighboring pixels. That is, gray-levels are substituted by their ranked numbers without additional information. From the results of computer simulation, it is verified that the proposed scheme could be reduced the compression bit rate by up to 44.1%, 37.5% comparing to the entropy coding and conventional preprocessing scheme respectively. So our scheme can be successfully applied to the application areas that require of losslessness and data compaction.

  • PDF

Preprocessing Methods for Effective Modulo Scheduling on High Performance DSPs (고성능 디지털 신호 처리 프로세서상에서 효율적인 모듈로 스케쥴링을 위한 전처리 기법)

  • Cho, Doo-San;Paek, Yun-Heung
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.5
    • /
    • pp.487-501
    • /
    • 2007
  • To achieve high resource utilization for multi-issue DSPs, production compiler commonly includes variants of iterative modulo scheduling algorithm. However, excessive cyclic data dependences, which exist in communication and media processing loops, unduly restrict modulo scheduling freedom. As a result, replicated functional units in multi-issue DSPs are often under-utilized. To address this resource under-utilization problem, our paper describes a novel compiler preprocessing strategy for effective modulo scheduling. The preprocessing strategy proposed capitalizes on two new transformations, which are referred to as cloning and dismantling. Our preprocessing strategy has been validated by an implementation for StarCore SC140 DSP compiler.

Study on Prediction of Internal Quality of Cherry Tomato using Vis/NIR Spectroscopy (가시광 및 근적외선 분광기법을 이용한 방울토마토의 내부품질 예측에 관한 연구)

  • Kim, Dae-Yong;Cho, Byoung-Kwan;Mo, Chang-Yeun;Kim, Young-Sik
    • Journal of Biosystems Engineering
    • /
    • v.35 no.6
    • /
    • pp.450-457
    • /
    • 2010
  • Although cherry tomato is one of major vegetables consumed in fresh vegetable market, the quality grading method is mostly dependant on size measurement using drum shape sorting machines. Using Visible/Near-infrared spectroscopy, apparatus to be able to acquire transmittance spectrum data was made and used to estimate firmness, sugar content, and acidity of cherry tomatoes grown at hydroponic and soil culture. Partial least square (PLS) models were performed to predict firmness, sugar content, and acidity for the acquired transmittance spectra. To enhance accuracy of the PLS models, several preprocessing methods were carried out, such as normalization, multiplicative scatter correction (MSC), standard normal variate (SNV), and derivatives, etc. The coefficient of determination ($R^2_p$) and standard error of prediction (SEP) for the prediction of firmness, sugar, and acidity of cherry tomatoes from green to red ripening stages were 0.859 and 1.899 kgf, with a preprocessing of normalization, 0.790 and $0.434^{\circ}Brix$ with a preprocessing of the 1st derivative of Savitzky Golay, and 0.518 and 0.229% with a preprocessing normalization, respectively.

A multi-modal neural network using Chebyschev polynomials

  • Ikuo Yoshihara;Tomoyuki Nakagawa;Moritoshi Yasunaga;Abe, Ken-ichi
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1998.10a
    • /
    • pp.250-253
    • /
    • 1998
  • This paper presents a multi-modal neural network composed of a preprocessing module and a multi-layer neural network module in order to enhance the nonlinear characteristics of neural network. The former module is based on spectral method using Chebyschev polynomials and transforms input data into spectra. The latter module identifies the system using the spectra generated by the preprocessing module. The omnibus numerical experiments show that the method is applicable to many a nonlinear dynamic system in the real world, and that preprocessing using Chebyschev polynomials reduces the number of neurons required for the multi-layer neural network.

  • PDF