Search | Korea Science

Influence of Data Preprocessing

Zhu, Changming;Gao, Daqi
- Journal of Computing Science and Engineering
- /
- v.10 no.2
- /
- pp.51-57
- /
- 2016
In this paper, we research the influence of data preprocessing. We conclude that using different preprocessing methods leads to different classification performances. Moreover, not all data preprocessing methods are necessary, and a criterion is given to make sure which data preprocessing is necessary and which one is effective. Experiments on some real-world data sets validate that different data preprocessing methods result in different effects. Furthermore, experiments about some algorithms with different preprocessing methods also confirm that preprocessing has a great influence on the performance of a classifier.
https://doi.org/10.5626/JCSE.2016.10.2.51 인용 PDF KSCI

An Implementation of Preprocessing for Interior Point Methods for Linear Programming (내부점 방법을 위한 사전처리의 구현)

성명기;임성묵;박순달
- Journal of the Korean Operations Research and Management Science Society
- /
- v.24 no.1
- /
- pp.1-11
- /
- 1999
We classified preprocessing methods into (1) analytic methods, (2) methods for removing implied free variables, (3) methods using pivot or elementary row operations, (4) methods for removing linearly dependent rows and columns and (5) methods for dense columns. We noted some considerations to which should be paid attention when preprocessing methods are applied to interior point methods for linear programming. We proposed an efficient order of preprocessing methods and data structures. We also noted the recovery process for dual solutions. We implemented the proposed preprocessing methods. and tested it with 28 large scale problems of NETLIB. We compared the results of it with those of preprocessing routines of HOPDM, BPDPM and CPLEX.
PDF

Comparison of Performance According to Preprocessing Methods in Estimating %IMF of Hanwoo Using CNN in Ultrasound Images

Kim, Sang Hyun
- International journal of advanced smart convergence
- /
- v.11 no.2
- /
- pp.185-193
- /
- 2022
There have been various studies in Korea to develop a %IMF(Intramuscular Fat Percentage) estimation method suitable for Hanwoo. Recently, a %IMF estimation method using a convolutional neural network (CNN), a kind of deep learning method among artificial intelligence methods, has been studied. In this study, we performed a performance comparison when various preprocessing methods were applied to the %IMF estimation of ultrasound images using CNN as mentioned above. The preprocessing methods used in this study are normalization, histogram equalization, edge enhancement, and a method combining normalization and edge enhancement. When estimating the %IMF of Hanwoo by the conventional method that did not apply preprocessing in the experiment, the accuracy was 98.2%. The other hand, we found that the accuracy improved to 99.5% when using preprocessing with histogram equalization alone or combined regularization and edge enhancement.
https://doi.org/10.7236/IJASC.2022.11.2.185 인용 PDF KSCI

Preprocessing performance of convolutional neural networks according to characteristic of underwater targets (수중 표적 분류를 위한 합성곱 신경망의 전처리 성능 비교)

Kyung-Min, Park;Dooyoung, Kim
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.6
- /
- pp.629-636
- /
- 2022
We present a preprocessing method for an underwater target detection model based on a convolutional neural network. The acoustic characteristics of the ship show ambiguous expression due to the strong signal power of the low frequency. To solve this problem, we combine feature preprocessing methods with various feature scaling methods and spectrogram methods. Define a simple convolutional neural network model and train it to measure preprocessing performance. Through experiment, we found that the combination of log Mel-spectrogram and standardization and robust scaling methods gave the best classification performance.
https://doi.org/10.7776/ASK.2022.41.6.629 인용 PDF KSCI

Preprocessing Technique for Lane Detection Using Image Clustering and HSV Color Model (영상 클러스터링과 HSV 컬러 모델을 이용한 차선 검출 전처리 기법)

Choi, Na-Rae;Choi, Sang-Il
- Journal of Korea Multimedia Society
- /
- v.20 no.2
- /
- pp.144-152
- /
- 2017
Among the technologies for implementing autonomous vehicles, advanced driver assistance system is a key technology to support driver's safe driving. In the technology using the vision sensor having a high utility, various preprocessing methods are used prior to feature extraction for lane detection. However, in the existing methods, the unnecessary lane candidates such as cars, lawns, and road separator in the road area are false positive. In addition, there are cases where the lane candidate itself can not be extracted in the area under the overpass, the lane within the dark shadow, the center lane of yellow, and weak lane. In this paper, we propose an efficient preprocessing method using k-means clustering for image division and the HSV color model. When the proposed preprocessing method is applied, the true positive region is maximally maintained during the lane detection and many false positive regions are removed.
https://doi.org/10.9717/kmms.2017.20.2.144 인용 PDF KSCI

Big Data Preprocessing for Predicting Box Office Success (영화 흥행 실적 예측을 위한 빅데이터 전처리)

Jun, Hee-Gook;Hyun, Geun-Soo;Lim, Kyung-Bin;Lee, Woo-Hyun;Kim, Hyoung-Joo
- KIISE Transactions on Computing Practices
- /
- v.20 no.12
- /
- pp.615-622
- /
- 2014
The Korean film market has rapidly achieved an international scale, and this has led to a need for decision-making based on analytical methods that are more precise and appropriate. In this modern era, a highly advanced information environment can provide an overwhelming amount of data that is generated in real time, and this data must be properly handled and analyzed in order to extract useful information. In particular, the preprocessing of large data, which is the most time-consuming step, should be done in a reasonable amount of time. In this paper, we investigated a big data preprocessing method for predicting movie box office success. We analyzed the movie data characteristics for specialized preprocessing methods, and used the Hadoop MapReduce framework. The experimental results showed that the preprocessing methods using big data techniques are more effective than existing methods.
https://doi.org/10.5626/KTCP.2014.20.12.615 인용

A Study on the Preprocessing Method Using Construction of Watershed for Character Image segmentation

Nam Sang Yep;Choi Young Kyoo;Kwon Yun Jung;Lee Sung Chang
- Proceedings of the IEEK Conference
- /
- 2004.08c
- /
- pp.814-818
- /
- 2004
Off-line handwritten character recognition is in difficulty of incomplete preprocessing because it has not dynamic and timing information besides has various handwriting, extreme overlap of the consonant and vowel and many error image of stroke. Consequently off-line handwritten character recognition needs to study about preprocessing of various methods such as binarization and thinning. This paper considers running time of watershed algorithm and the quality of resulting image as preprocessing For off-line handwritten Korean character recognition. So it proposes application of effective watershed algorithm for segmentation of character region and background region in gray level character image and segmentation function for binarization image and segmentation function for binarization by extracted watershed image. Besides it proposes thinning methods which effectively extracts skeleton through conditional test mask considering running time and quality. of skeleton, estimates efficiency of existing methods and this paper's methods as running time and quality. Watershed image conversion uses prewitt operator for gradient image conversion, extracts local minima considering 8-neighborhood pixel. And methods by using difference of mean value is used in region merging step, Converted watershed image by means of this methods separates effectively character region and background region applying to segmentation function. Average execution time on the previous method was 2.16 second and on this paper method was 1.72 second. We prove that this paper's method removed noise effectively with overlap stroke as compared with the previous method.
PDF

Preprocessing in large scale linear programming problems (대형선형계획문제의 사전처리)

성명기;박순달
- Proceedings of the Korean Operations and Management Science Society Conference
- /
- 1996.10a
- /
- pp.285-288
- /
- 1996
Generally MPS, standardized by IBM, is the input type of large scale linear programming problems, and there may be unnecessary variables or constraints. These can be discarded by preprocessing. As the size of a problem is reduced by preprocessing, the running time is reduced. And more, the infeasibility of a problem may be detected before using solution methods. When the preprocessing implemented by this paper is used in NETLIB problems, it removes unnecessary variables and constraints by 21%, 15%, respectively. The use of preprocessing gives in the average 21% reduction in running time by applying the interior point method. Preprocessing can detect 10 out of 30 infeasible NETLIB problems.
PDF

A Study on Image Preprocessing Methods for Automatic Detection of Ship Corrosion Based on Deep Learning (딥러닝 기반 선박 부식 자동 검출을 위한 이미지 전처리 방안 연구)

Yun, Gwang-ho;Oh, Sang-jin;Shin, Sung-chul
- Journal of the Korean Society of Industry Convergence
- /
- v.25 no.4_2
- /
- pp.573-586
- /
- 2022
Corrosion can cause dangerous and expensive damage and failures of ship hulls and equipment. Therefore, it is necessary to maintain the vessel by periodic corrosion inspections. During visual inspection, many corrosion locations are inaccessible for many reasons, especially safety's point of view. Including subjective decisions of inspectors is one of the issues of visual inspection. Automation of visual inspection is tried by many pieces of research. In this study, we propose image preprocessing methods by image patch segmentation and thresholding. YOLOv5 was used as an object detection model after the image preprocessing. Finally, it was evaluated that corrosion detection performance using the proposed method was improved in terms of mean average precision.
https://doi.org/10.21289/KSIC.2022.25.4.573 인용 PDF KSCI HTML

A Concordance Study of the Preprocessing Orders in Microarray Data (마이크로어레이 자료의 사전 처리 순서에 따른 검색의 일치도 분석)

Kim, Sang-Cheol;Lee, Jae-Hwi;Kim, Byung-Soo
- The Korean Journal of Applied Statistics
- /
- v.22 no.3
- /
- pp.585-594
- /
- 2009
Researchers of microarray experiment transpose processed images of raw data to possible data of statistical analysis: it is preprocessing. Preprocessing of microarray has image filtering, imputation and normalization. There have been studied about several different methods of normalization and imputation, but there was not further study on the order of the procedures. We have no further study about which things put first on our procedure between normalization and imputation. This study is about the identification of differentially expressed genes(DEG) on the order of the preprocessing steps using two-dye cDNA microarray in colon cancer and gastric cancer. That is, we check for compare which combination of imputation and normalization steps can detect the DEG. We used imputation methods(K-nearly neighbor, Baysian principle comparison analysis) and normalization methods(global, within-print tip group, variance stabilization). Therefore, preprocessing steps have 12 methods. We identified concordance measure of DEG using the datasets to which the 12 different preprocessing orders were applied. When we applied preprocessing using variance stabilization of normalization method, there was a little variance in a sensitive way for detecting DEG.
https://doi.org/10.5351/KJAS.2009.22.3.585 인용 PDF KSCI

Search Result 506, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)