• Title/Summary/Keyword: data field selection

Search Result 406, Processing Time 0.036 seconds

Bayesian Typhoon Track Prediction Using Wind Vector Data

  • Han, Minkyu;Lee, Jaeyong
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.3
    • /
    • pp.241-253
    • /
    • 2015
  • In this paper we predict the track of typhoons using a Bayesian principal component regression model based on wind field data. Data is obtained at each time point and we applied the Bayesian principal component regression model to conduct the track prediction based on the time point. Based on regression model, we applied to variable selection prior and two kinds of prior distribution; normal and Laplace distribution. We show prediction results based on Bayesian Model Averaging (BMA) estimator and Median Probability Model (MPM) estimator. We analysis 8 typhoons in 2006 using data obtained from previous 6 years (2000-2005). We compare our prediction results with a moving-nest typhoon model (MTM) proposed by the Korea Meteorological Administration. We posit that is possible to predict the track of a typhoon accurately using only a statistical model and without a dynamical model.

Improvement of Classification Accuracy on Success and Failure Factors in Software Reuse using Feature Selection (특징 선택을 이용한 소프트웨어 재사용의 성공 및 실패 요인 분류 정확도 향상)

  • Kim, Young-Ok;Kwon, Ki-Tae
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.4
    • /
    • pp.219-226
    • /
    • 2013
  • Feature selection is the one of important issues in the field of machine learning and pattern recognition. It is the technique to find a subset from the source data and can give the best classification performance. Ie, it is the technique to extract the subset closely related to the purpose of the classification. In this paper, we experimented to select the best feature subset for improving classification accuracy when classify success and failure factors in software reuse. And we compared with existing studies. As a result, we found that a feature subset was selected in this study showed the better classification accuracy.

Copula entropy and information diffusion theory-based new prediction method for high dam monitoring

  • Zheng, Dongjian;Li, Xiaoqi;Yang, Meng;Su, Huaizhi;Gu, Chongshi
    • Earthquakes and Structures
    • /
    • v.14 no.2
    • /
    • pp.143-153
    • /
    • 2018
  • Correlation among different factors must be considered for selection of influencing factors in safety monitoring of high dam including positive correlation of variables. Therefore, a new factor selection method was constructed based on Copula entropy and mutual information theory, which was deduced and optimized. Considering the small sample size in high dam monitoring and distribution of daily monitoring samples, a computing method that avoids causality of structure as much as possible is needed. The two-dimensional normal information diffusion and fuzzy reasoning of pattern recognition field are based on the weight theory, which avoids complicated causes of the studying structure. Hence, it is used to dam safety monitoring field and simplified, which increases sample information appropriately. Next, a complete system integrating high dam monitoring and uncertainty prediction method was established by combining Copula entropy theory and information diffusion theory. Finally, the proposed method was applied in seepage monitoring of Nuozhadu clay core-wall rockfill dam. Its selection of influencing factors and processing of sample data were compared with different models. Results demonstrated that the proposed method increases the prediction accuracy to some extent.

The Analysis on the Upsteam band Signal in the HFC Access Network (HFC 가입자망 상향대역 신호분석에 관한 연구)

  • 장문종;김선익;이진기
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.10c
    • /
    • pp.142-144
    • /
    • 2004
  • To provide more qualified data service on the HFC(Hybrid-Fiber Coaxial) access network, the channel characteristics of upstream transmission band should be carefully investigated and analysed. It will be easier to do network management if the monitoring system for noise measurement in the network is available, In this paper, noise analysis method and the frequency selection method in the upstream band for duplex transmission are suggested. And, Data aquisition device for the signal measurement Is implemented. With this network monitoring system, field test and the result from the collected data are described.

  • PDF

The selection of RCM analysis system for efficient PM Tasks (효과적인 PM 업무를 위한 RCM분석대상 시스템의 선정)

  • Kim, Min-Ho;Song, Kee-Tae;Baek, Young-Gu;Lee, Key-Seo;Yoon, Hwa-Hyun
    • Proceedings of the KSR Conference
    • /
    • 2007.11a
    • /
    • pp.784-791
    • /
    • 2007
  • Most operational organization and railway authority which conduct scheduled maintenance(SM) have carried out the preventive maintenance(PM) based on the information provided from supplier and manufacturer of railway system. However these activities are far away from reality and low the efficiency, it is because an appropriate methods for system selection didn't take into account for improving maintenance efficiency. Therefore, the current SM tasks and maintenance activities lead to lots of spend on the cost and time. To solve the above problem, this thesis presents new approach methodology. This proposes the criteria for reliability centered maintenance(RCM) system selection through level of quantification of each parameter, i.e, frequency, severity and maintenance cost, etc. To do this, the field operation data and information of maintenance cost are essential. As applying this methodology, we can look forward to improving efficiency of PM/SM, and reducing cost.

  • PDF

Development of an Expert System for Rapid Prototyping Machine Selection (쾌속조형장비 선정을 위한 전문가시스템 개발)

  • 정일용;이일랑;최병욱
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2002.10a
    • /
    • pp.632-635
    • /
    • 2002
  • There are more than five dozen different RP(rapid prototyping) systems in the world and they are fairly expensive. All those systems have different capabilities and requirements in that each of them gives different tolerance, application field and part strength, etc. This situation may cause a problem of selecting an appropriate RP system. This paper presents an expert system, utilizing an algorithm that is composed up of rules to derive recommendations and answers to queries of the RP users. The expert system incorporates RP machines commercially available and adopts multi-selection criteria, namely, machine price, accuracy, build size, adopted process, etc. In the expert system, forward reasoning method is adopted and external spreadsheet for sub-data of the RP systems is used. The rules and knowledge are obtained from interviews and discussions with RP vendors and users, appropriate research publications and other reference materials.

  • PDF

Performance Improvement of Feature Selection Methods based on Bio-Inspired Algorithms (생태계 모방 알고리즘 기반 특징 선택 방법의 성능 개선 방안)

  • Yun, Chul-Min;Yang, Ji-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.15B no.4
    • /
    • pp.331-340
    • /
    • 2008
  • Feature Selection is one of methods to improve the classification accuracy of data in the field of machine learning. Many feature selection algorithms have been proposed and discussed for years. However, the problem of finding the optimal feature subset from full data still remains to be a difficult problem. Bio-inspired algorithms are well-known evolutionary algorithms based on the principles of behavior of organisms, and very useful methods to find the optimal solution in optimization problems. Bio-inspired algorithms are also used in the field of feature selection problems. So in this paper we proposed new improved bio-inspired algorithms for feature selection. We used well-known bio-inspired algorithms, Genetic Algorithm (GA) and Particle Swarm Optimization (PSO), to find the optimal subset of features that shows the best performance in classification accuracy. In addition, we modified the bio-inspired algorithms considering the prior importance (prior relevance) of each feature. We chose the mRMR method, which can measure the goodness of single feature, to set the prior importance of each feature. We modified the evolution operators of GA and PSO by using the prior importance of each feature. We verified the performance of the proposed methods by experiment with datasets. Feature selection methods using GA and PSO produced better performances in terms of the classification accuracy. The modified method with the prior importance demonstrated improved performances in terms of the evolution speed and the classification accuracy.

A Study of Authorized Stockage List Selection using Market Basket Analysis (장바구니 분석을 활용한 ASL 선정 연구)

  • Choi, Myoung-Jin
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.35 no.2
    • /
    • pp.163-172
    • /
    • 2012
  • In this study, It is assumed that customers are both usage unit of spare parts and stores of displaying and selling the goods that are installation unit of having the spare parts. The demand pattern through the effective order of spare parts and issue list in installation unit is investigated based on the assumption. Current ASL (Authorized Stockage List) selection of the army has been conducted in the way of using the analysis result of real usage experiences on spare parts used during the Korea War. For this study, ASL selection criteria and procedures based on army regulations and field manuals are specified. Since the traditional method does not presents the association analysis on spare parts used for the current equipment operating and does not have the clear criterion and analysis system about the ASL selection, in order to solve these problems, it was carried out that the association rule is employed for analyzing relationship between the effective order and issue list of the spare parts in point of the spare parts between usage unit and occurring month about purchase spare parts based on the star-schema table. Finally the new ASL selection way using the analysis result is proposed.

On the Negative Estimates of Direct and Maternal Genetic Correlation - A Review

  • Lee, C.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.15 no.8
    • /
    • pp.1222-1226
    • /
    • 2002
  • Estimates of genetic correlation between direct and maternal effects for weaning weight of beef cattle are often negative in field data. The biological existence of this genetic antagonism has been the point at issue. Some researchers perceived such negative estimate to be an artifact from poor modeling. Recent studies on sources affecting the genetic correlation estimates are reviewed in this article. They focus on heterogeneity of the correlation by sex, selection bias caused from selective reporting, selection bias caused from splitting data by sex, sire by year interaction variance, and sire misidentification and inbreeding depression as factors contributing sire by year interaction variance. A biological justification of the genetic antagonism is also discussed. It is proposed to include the direct-maternal genetic covariance in the analytical models.

Significant Gene Selection Using Integrated Microarray Data Set with Batch Effect

  • Kim Ki-Yeol;Chung Hyun-Cheol;Jeung Hei-Cheul;Shin Ji-Hye;Kim Tae-Soo;Rha Sun-Young
    • Genomics & Informatics
    • /
    • v.4 no.3
    • /
    • pp.110-117
    • /
    • 2006
  • In microarray technology, many diverse experimental features can cause biases including RNA sources, microarray production or different platforms, diverse sample processing and various experiment protocols. These systematic effects cause a substantial obstacle in the analysis of microarray data. When such data sets derived from different experimental processes were used, the analysis result was almost inconsistent and it is not reliable. Therefore, one of the most pressing challenges in the microarray field is how to combine data that comes from two different groups. As the novel trial to integrate two data sets with batch effect, we simply applied standardization to microarray data before the significant gene selection. In the gene selection step, we used new defined measure that considers the distance between a gene and an ideal gene as well as the between-slide and within-slide variations. Also we discussed the association of biological functions and different expression patterns in selected discriminative gene set. As a result, we could confirm that batch effect was minimized by standardization and the selected genes from the standardized data included various expression pattems and the significant biological functions.