• Title/Summary/Keyword: UCI

Search Result 194, Processing Time 0.023 seconds

Active Selection of Label Data for Semi-Supervised Learning Algorithm (준감독 학습 알고리즘을 위한 능동적 레이블 데이터 선택)

  • Han, Ji-Ho;Park, Eun-Ae;Park, Dong-Chul;Lee, Yunsik;Min, Soo-Young
    • Journal of IKEEE
    • /
    • v.17 no.3
    • /
    • pp.254-259
    • /
    • 2013
  • The choice of labeled data in semi-supervised learning algorithm can result in effects on the performance of the resultant classifier. In order to select labeled data required for the training of a semi-supervised learning algorithm, VCNN(Vector Centroid Neural Network) is proposed in this paper. The proposed selection method of label data is evaluated on UCI dataset and caltech dataset. Experiments and results show that the proposed selection method outperforms conventional methods in terms of classification accuracy and minimum error rate.

Design of Gas Classifier Based On Artificial Neural Network (인공신경망 기반 가스 분류기의 설계)

  • Jeong, Woojae;Kim, Minwoo;Cho, Jaechan;Jung, Yunho
    • Journal of IKEEE
    • /
    • v.22 no.3
    • /
    • pp.700-705
    • /
    • 2018
  • In this paper, we propose the gas classifier based on restricted column energy neural network (RCE-NN) and present its hardware implementation results for real-time learning and classification. Since RCE-NN has a flexible network architecture with real-time learning process, it is suitable for gas classification applications. The proposed gas classifier showed 99.2% classification accuracy for the UCI gas dataset and was implemented with 26,702 logic elements with Intel-Altera cyclone IV FPGA. In addition, it was verified with FPGA test system at an operating frequency of 63MHz.

Experimental Performance Evaluation of Complex Behavior Connector by Scaled Model (축소모형에 의한 복합거동 연결체의 실험적 성능 평가)

  • Kim, Kisung;Kang, Hyounhoi;Park, Jeongjun
    • Journal of the Society of Disaster Information
    • /
    • v.13 no.2
    • /
    • pp.130-138
    • /
    • 2017
  • The connector of the complex behavior is to connect the individual piles of the pile to the lower foundation of the oil sand plant where the floating foundation is used. In this study, to verify the shape of a connector of the complex behavior for applying the advantages of existing group pile and piled raft foundation to an oil sand plant, a scaled model was constructed to measure the behavior of the load.

Support Vector Machine Algorithm for Imbalanced Data Learning (불균형 데이터 학습을 위한 지지벡터기계 알고리즘)

  • Kim, Kwang-Seong;Hwang, Doo-Sung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.7
    • /
    • pp.11-17
    • /
    • 2010
  • This paper proposes an improved SMO solving a quadratic optmization problem for class imbalanced learning. The SMO algorithm is aproporiate for solving the optimization problem of a support vector machine that assigns the different regularization values to the two classes, and the prosoposed SMO learning algorithm iterates the learning steps to find the current optimal solutions of only two Lagrange variables selected per class. The proposed algorithm is tested with the UCI benchmarking problems and compared to the experimental results of the SMO algorithm with the g-mean measure that considers class imbalanced distribution for gerneralization performance. In comparison to the SMO algorithm, the proposed algorithm is effective to improve the prediction rate of the minority class data and could shorthen the training time.

Naive Bayes Learning Algorithm based on Map-Reduce Programming Model (Map-Reduce 프로그래밍 모델 기반의 나이브 베이스 학습 알고리즘)

  • Kang, Dae-Ki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2011.10a
    • /
    • pp.208-209
    • /
    • 2011
  • In this paper, we introduce a Naive Bayes learning algorithm for learning and reasoning in Map-Reduce model based environment. For this purpose, we use Apache Mahout to execute Distributed Naive Bayes on University of California, Irvine (UCI) benchmark data sets. From the experimental results, we see that Apache Mahout' s Distributed Naive Bayes algorithm is comparable to WEKA' s Naive Bayes algorithm in terms of performance. These results indicates that in the future Big Data environment, Map-Reduce model based systems such as Apache Mahout can be promising for machine learning usage.

  • PDF

A Differential Evolution based Support Vector Clustering (차분진화 기반의 Support Vector Clustering)

  • Jun, Sung-Hae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.5
    • /
    • pp.679-683
    • /
    • 2007
  • Statistical learning theory by Vapnik consists of support vector machine(SVM), support vector regression(SVR), and support vector clustering(SVC) for classification, regression, and clustering respectively. In this algorithms, SVC is good clustering algorithm using support vectors based on Gaussian kernel function. But, similar to SVM and SVR, SVC needs to determine kernel parameters and regularization constant optimally. In general, the parameters have been determined by the arts of researchers and grid search which is demanded computing time heavily. In this paper, we propose a differential evolution based SVC(DESVC) which combines differential evolution into SVC for efficient selection of kernel parameters and regularization constant. To verify improved performance of our DESVC, we make experiments using the data sets from UCI machine learning repository and simulation.

Proposal of Weight Adjustment Methods Using Statistical Information in Fuzzy Weighted Mean Classifiers (퍼지 가중치 평균 분류기에서 통계 정보를 활용한 가중치 설정 기법의 제안)

  • Woo, Young-Woon;Heo, Gyeong-Yong;Kim, Kwang-Baek
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.7
    • /
    • pp.9-15
    • /
    • 2009
  • The fuzzy weighted mean classifier is one of the most common classification models and could achieve high performance by adjusting the weights. However, the weights were generally decided based on the experience of experts, which made the resulting classifiers to suffer the lack of consistency and objectivity. To resolve this problem, in this paper, a weight deciding method based on the statistics of the data is introduced, which ensures the learned classifiers to be consistent and objective. To investigate the effectiveness of the proposed methods, Iris data set available from UCI machine learning repository is used and promising results are obtained.

Analysis of the applicability of parameter estimation methods for a transient storage model (저장대모형의 매개변수 산정을 위한 최적화 기법의 적합성 분석)

  • Noh, Hyoseob;Baek, Donghae;Seo, Il Won
    • Journal of Korea Water Resources Association
    • /
    • v.52 no.10
    • /
    • pp.681-695
    • /
    • 2019
  • A Transient Storage Model (TSM) is one of the most widely used model accounting for complex solute transport in natural river to understanding natural river properties with four TSM key parameters. The TSM parameters are estimated via inverse modeling. Parameter estimation of the TSM is carried out by solving optimization problem about finding best fitted simulation curve with measured curve obtained from tracer test. Several studies have reported uncertainty in parameter estimation from non-convexity of the problem. In this study, we assessed best combination of optimization method and objective function for TSM parameter estimation using Cheong-mi Creek tracer test data. In order to find best optimization setting guaranteeing convergence and speed, Evolutionary Algorithm (EA) based global optimization methods, such as CCE of SCE-UA and MCCE of SP-UCI, and error based objective functions were compared, using Shuffled Complex-Self Adaptive Hybrid EvoLution (SC-SAHEL). Overall results showed that multi-EA SC-SAHEL with Percent Mean Squared Error (PMSE) objective function is the best optimization setting which is fastest and stable method in convergence.

Development of a Rotation Swab Pig Method for Cleaning Water Pipes (상수관의 세척을 위한 회전식 스왑피그 공법 개발)

  • Kicheol Lee;Jaeho Kim;Kisung Kim;Jeongjun Park
    • Journal of the Korean Geosynthetics Society
    • /
    • v.23 no.2
    • /
    • pp.63-75
    • /
    • 2024
  • Drinking water is an essential element to ensure the basic human right to live, and the quality of clean water must always be ensured. However, domestic water facilities, which were installed intensively in the early 2000s, are deteriorating. The accidents such as discoloration of water such as chromaticity and turbidity as well as leakage of substances frequently occur. However, since it is virtually impossible to replace all water pipes, the detailed standards for maintenance of water pipe network facilities established in 2021 require water pipe cleaning. The swab pig method, one of the water pipe cleaning methods, is a method of physically removing substances in pipes and is evaluated as having the highest cleaning efficiency. However, Swab is highly likely to be damaged or deformed during the cleaning process, and may even be lost. Therefore, in this study, the material of the pig was changed to a material with high compressibility, and it was made as close as possible to the inner wall of the water pipe. And, to maximize cleaning efficiency, a rotation swab pig with a rotation blade was developed. In addition, high-strength wire and winding equipment were additionally developed to eliminate the possibility of loss and to determine the location of the pig. The inlet and outlet are connected with wires, and after verifying the performance of each detailed technology, the technology was applied on a test bed with a 30m section. As a result of the application, the performance of the technology was verified by measuring the process time and evaluating applicability.

Discretization Method Based on Quantiles for Variable Selection Using Mutual Information

  • CHa, Woon-Ock;Huh, Moon-Yul
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.3
    • /
    • pp.659-672
    • /
    • 2005
  • This paper evaluates discretization of continuous variables to select relevant variables for supervised learning using mutual information. Three discretization methods, MDL, Histogram and 4-Intervals are considered. The process of discretization and variable subset selection is evaluated according to the classification accuracies with the 6 real data sets of UCI databases. Results show that 4-Interval discretization method based on quantiles, is robust and efficient for variable selection process. We also visually evaluate the appropriateness of the selected subset of variables.