Search | Korea Science

Finding the best suited autoencoder for reducing model complexity

Ngoc, Kien Mai;Hwang, Myunggwon
- Smart Media Journal
- /
- v.10 no.3
- /
- pp.9-22
- /
- 2021
Basically, machine learning models use input data to produce results. Sometimes, the input data is too complicated for the models to learn useful patterns. Therefore, feature engineering is a crucial data preprocessing step for constructing a proper feature set to improve the performance of such models. One of the most efficient methods for automating feature engineering is the autoencoder, which transforms the data from its original space into a latent space. However certain factors, including the datasets, the machine learning models, and the number of dimensions of the latent space (denoted by k), should be carefully considered when using the autoencoder. In this study, we design a framework to compare two data preprocessing approaches: with and without autoencoder and to observe the impact of these factors on autoencoder. We then conduct experiments using autoencoders with classifiers on popular datasets. The empirical results provide a perspective regarding the best suited autoencoder for these factors.
https://doi.org/10.30693/SMJ.2021.10.3.9 인용 PDF KSCI

Design of ASM-based Face Recognition System Using (2D)² Hybird Preprocessing Algorithm (ASM기반 (2D)² 하이브리드 전처리 알고리즘을 이용한 얼굴인식 시스템 설계)

Kim, Hyun-Ki;Jin, Yong-Tak;Oh, Sung-Kwun
- Journal of the Korean Institute of Intelligent Systems
- /
- v.24 no.2
- /
- pp.173-178
- /
- 2014
In this study, we introduce ASM-based face recognition classifier and its design methodology with the aid of 2-dimensional 2-directional hybird preprocessing algorithm. Since the image of face recognition is easily affected by external environments, ASM(active shape model) as image preprocessing algorithm is used to resolve such problem. In particular, ASM is used widely for the purpose of feature extraction for human face. After extracting face image area by using ASM, the dimensionality of the extracted face image data is reduced by using $(2D)^2$hybrid preprocessing algorithm based on LDA and PCA. Face image data through preprocessing algorithm is used as input data for the design of the proposed polynomials based radial basis function neural network. Unlike as the case in existing neural networks, the proposed pattern classifier has the characteristics of a robust neural network and it is also superior from the view point of predictive ability as well as ability to resolve the problem of multi-dimensionality. The essential design parameters (the number of row eigenvectors, column eigenvectors, and clusters, and fuzzification coefficient) of the classifier are optimized by means of ABC(artificial bee colony) algorithm. The performance of the proposed classifier is quantified through yale and AT&T dataset widely used in the face recognition.
https://doi.org/10.5391/JKIIS.2014.24.2.173 인용 PDF KSCI

Proposed Pre-Processing Method for Improving Pothole Dataset Performance in Deep Learning Model and Verification by YOLO Model (딥러닝 모델에서 포트홀 데이터셋의 성능 향상을 위한 전처리 방법 제안과 YOLO 모델을 통한 검증)

Han-Jin Lee;Ji-Woong Yang;Ellen J. Hong
- Journal of the Institute of Convergence Signal Processing
- /
- v.23 no.4
- /
- pp.249-255
- /
- 2022
Potholes are an important clue to the structural defects of asphalt pavement and cause many casualties and property damage. Therefore, accurate pothole detection is an important task in road surface maintenance. Many machine learning technologies are being introduced for pothole detection, and data preprocessing is required to increase the efficiency of deep learning models. In this paper, we propose a preprocessing method that emphasizes important textures and shapes in pothole datasets. The proposed preprocessing method uses intensity transformation to reduce unnecessary elements of the road and emphasize the texture and shape of the pothole. In addition, the feature of the porthole is detected using Superpixel and Sobel edge detection. Through performance comparison between the proposed preprocessing method and the existing preprocessing method, it is shown that the proposed preprocessing method is a more effective method than the existing method in detecting potholes.
https://doi.org/10.23087/jkicsp.2022.23.4.010 인용 PDF KSCI

Opponent Move Prediction of a Real-time Strategy Game Using a Multi-label Classification Based on Machine Learning (기계학습 기반 다중 레이블 분류를 이용한 실시간 전략 게임에서의 상대 행동 예측)

Shin, Seung-Soo;Cho, Dong-Hee;Kim, Yong-Hyuk
- Journal of the Korea Convergence Society
- /
- v.11 no.10
- /
- pp.45-51
- /
- 2020
Recently, many games provide data related to the users' game play, and there have been a few studies that predict opponent move by combining machine learning methods. This study predicts opponent move using match data of a real-time strategy game named ClashRoyale and a multi-label classification based on machine learning. In the initial experiment, binary card properties, binary card coordinates, and normalized time information are input, and card type and card coordinates are predicted using random forest and multi-layer perceptron. Subsequently, experiments were conducted sequentially using the next three data preprocessing methods. First, some property information of the input data were transformed. Next, input data were converted to nested form considering the consecutive card input system. Finally, input data were predicted by dividing into the early and the latter according to the normalized time information. As a result, the best preprocessing step was shown about 2.6% improvement in card type and about 1.8% improvement in card coordinates when nested data divided into the early.
https://doi.org/10.15207/JKCS.2020.11.10.045 인용 PDF KSCI

Empirical Study on Analyzing Training Data for CNN-based Product Classification Deep Learning Model (CNN기반 상품분류 딥러닝모델을 위한 학습데이터 영향 실증 분석)

Lee, Nakyong;Kim, Jooyeon;Shim, Junho
- The Journal of Society for e-Business Studies
- /
- v.26 no.1
- /
- pp.107-126
- /
- 2021
In e-commerce, rapid and accurate automatic product classification according to product information is important. Recent developments in deep learning technology have been actively applied to automatic product classification. In order to develop a deep learning model with good performance, the quality of training data and data preprocessing suitable for the model are crucial. In this study, when categories are inferred based on text product data using a deep learning model, both effects of the data preprocessing and of the selection of training data are extensively compared and analyzed. We employ our CNN model as an example of deep learning model. In the experimental analysis, we use a real e-commerce data to ensure the verification of the study results. The empirical analysis and results shown in this study may be meaningful as a reference study for improving performance when developing a deep learning product classification model.
https://doi.org/10.7838/jsebs.2021.26.1.107 인용 PDF KSCI

WiFi CSI Data Preprocessing and Augmentation Techniques in Indoor People Counting using Deep Learning (딥러닝을 활용한 실내 사람 수 추정을 위한 WiFi CSI 데이터 전처리와 증강 기법)

Kim, Yeon-Ju;Kim, Seungku
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.25 no.12
- /
- pp.1890-1897
- /
- 2021
People counting is an important technology to provide application services such as smart home, smart building, smart car, etc. Due to the social distancing of COVID-19, the people counting technology attracted public attention. People counting system can be implemented in various ways such as camera, sensor, wireless, etc. according to service requirements. People counting system using WiFi AP uses WiFi CSI data that reflects multipath information. This technology is an effective solution implementing indoor with low cost. The conventional WiFi CSI-based people counting technologies have low accuracy that obstructs the high quality service. This paper proposes a deep learning people counting system based on WiFi CSI data. Data preprocessing using auto-encoder, data augmentation that transform WiFi CSI data, and a proposed deep learning model improve the accuracy of people counting. In the experimental result, the proposed approach shows 89.29% accuracy in 6 subjects.
https://doi.org/10.6109/jkiice.2021.25.12.1890 인용 PDF KSCI

Learning data preprocessing technique for improving indoor positioning performance based on machine learning (기계학습 기반의 실내 측위 성능 향상을 위한 학습 데이터 전처리 기법)

Kim, Dae-Jin;Hwang, Chi-Gon;Yoon, Chang-Pyo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.24 no.11
- /
- pp.1528-1533
- /
- 2020
Recently, indoor location recognition technology using Wi-Fi fingerprints has been applied and operated in various industrial fields and public services. Along with the interest in machine learning technology, location recognition technology based on machine learning using wireless signal data around a terminal is rapidly developing. At this time, in the process of collecting radio signal data required for machine learning, the accuracy of location recognition is lowered due to distorted or unsuitable data for learning. In addition, when location recognition is performed based on data collected at a specific location, a problem occurs in location recognition at surrounding locations that are not included in the learning. In this paper, we propose a learning data preprocessing technique to obtain an improved position recognition result through the preprocessing of the collected learning data.
https://doi.org/10.6109/jkiice.2020.24.11.1528 인용 PDF KSCI

A Robust Sequential Preprocessing Scheme for Efficient Lossless Image Compression (영상의 효율적인 무손실 압축을 위한 강인한 순차적 전처리 기법)

Kim, Nam-Yee;You, Kang-Soo;Kwak, Hoon-Sung
- Journal of Internet Computing and Services
- /
- v.10 no.1
- /
- pp.75-82
- /
- 2009
In this paper, we propose a robust preprocessing scheme for entropy coding in gray-level image. The issue of this paper is to reduce additional information needed when bit stream is transmitted. The proposed scheme uses the preprocessing method of co-occurrence count about gray-levels in neighboring pixels. That is, gray-levels are substituted by their ranked numbers without additional information. From the results of computer simulation, it is verified that the proposed scheme could be reduced the compression bit rate by up to 44.1%, 37.5% comparing to the entropy coding and conventional preprocessing scheme respectively. So our scheme can be successfully applied to the application areas that require of losslessness and data compaction.
PDF

Preprocessing Methods for Effective Modulo Scheduling on High Performance DSPs (고성능 디지털 신호 처리 프로세서상에서 효율적인 모듈로 스케쥴링을 위한 전처리 기법)

Cho, Doo-San;Paek, Yun-Heung
- Journal of KIISE:Software and Applications
- /
- v.34 no.5
- /
- pp.487-501
- /
- 2007
To achieve high resource utilization for multi-issue DSPs, production compiler commonly includes variants of iterative modulo scheduling algorithm. However, excessive cyclic data dependences, which exist in communication and media processing loops, unduly restrict modulo scheduling freedom. As a result, replicated functional units in multi-issue DSPs are often under-utilized. To address this resource under-utilization problem, our paper describes a novel compiler preprocessing strategy for effective modulo scheduling. The preprocessing strategy proposed capitalizes on two new transformations, which are referred to as cloning and dismantling. Our preprocessing strategy has been validated by an implementation for StarCore SC140 DSP compiler.
PDF KSCI

Study on Prediction of Internal Quality of Cherry Tomato using Vis/NIR Spectroscopy (가시광 및 근적외선 분광기법을 이용한 방울토마토의 내부품질 예측에 관한 연구)

Kim, Dae-Yong;Cho, Byoung-Kwan;Mo, Chang-Yeun;Kim, Young-Sik
- Journal of Biosystems Engineering
- /
- v.35 no.6
- /
- pp.450-457
- /
- 2010
Although cherry tomato is one of major vegetables consumed in fresh vegetable market, the quality grading method is mostly dependant on size measurement using drum shape sorting machines. Using Visible/Near-infrared spectroscopy, apparatus to be able to acquire transmittance spectrum data was made and used to estimate firmness, sugar content, and acidity of cherry tomatoes grown at hydroponic and soil culture. Partial least square (PLS) models were performed to predict firmness, sugar content, and acidity for the acquired transmittance spectra. To enhance accuracy of the PLS models, several preprocessing methods were carried out, such as normalization, multiplicative scatter correction (MSC), standard normal variate (SNV), and derivatives, etc. The coefficient of determination ($R^2_p$) and standard error of prediction (SEP) for the prediction of firmness, sugar, and acidity of cherry tomatoes from green to red ripening stages were 0.859 and 1.899 kgf, with a preprocessing of normalization, 0.790 and $0.434^{\circ}Brix$ with a preprocessing of the 1st derivative of Savitzky Golay, and 0.518 and 0.229% with a preprocessing normalization, respectively.
https://doi.org/10.5307/JBE.2010.35.6.450 인용 PDF KSCI

Search Result 967, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)