• Title/Summary/Keyword: Statistical Learning Model

Search Result 518, Processing Time 0.031 seconds

A Co-Evolutionary Computing for Statistical Learning Theory

  • Jun Sung-Hae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.5 no.4
    • /
    • pp.281-285
    • /
    • 2005
  • Learning and evolving are two basics for data mining. As compared with classical learning theory based on objective function with minimizing training errors, the recently evolutionary computing has had an efficient approach for constructing optimal model without the minimizing training errors. The global search of evolutionary computing in solution space can settle the local optima problems of learning models. In this research, combining co-evolving algorithm into statistical learning theory, we propose an co-evolutionary computing for statistical learning theory for overcoming local optima problems of statistical learning theory. We apply proposed model to classification and prediction problems of the learning. In the experimental results, we verify the improved performance of our model using the data sets from UCI machine learning repository and KDD Cup 2000.

Machine Learning vs. Statistical Model for Prediction Modelling: Application in Medical Imaging Research (예측모형의 머신러닝 방법론과 통계학적 방법론의 비교: 영상의학 연구에서의 적용)

  • Leeha Ryu;Kyunghwa Han
    • Journal of the Korean Society of Radiology
    • /
    • v.83 no.6
    • /
    • pp.1219-1228
    • /
    • 2022
  • Clinical prediction models has been increasingly published in radiology research. In particular, as a radiomics research is being actively conducted, the prediction model is developed based on the traditional statistical model, as well as machine learning, to account for the high-dimensional data. In this review, we investigated the statistical and machine learning methods used in clinical prediction model research, and briefly summarized each analytical method for statistical model, machine learning, and statistical learning. Finally, we discussed several considerations for choosing the prediction modeling method.

Fostering Students' Statistical Thinking through Data Modelling

  • Ken W. Li
    • Research in Mathematical Education
    • /
    • v.26 no.3
    • /
    • pp.127-146
    • /
    • 2023
  • Statistical thinking has a broad definition but focuses on the context of regression modelling in the present study. To foster students' statistical thinking within the context, teaching should no longer be seen as transfer of knowledge from teacher to students but as a process of engaging with learning activities in which they develop ownership of knowledge. This study aims at collaborative learning contexts; students were divided into small groups in order to increase opportunities for peer collaboration. Each group of students was asked to do a regression project after class. Through doing the project, they learnt to organize and connect previously accrued piecemeal statistical knowledge in an integrated manner. They could also clarify misunderstandings and solve problems through verbal exchanges among themselves. They gave a clear and lucid account of the model they had built and showed collaborative interactions when presenting their projects in front of class. A survey was conducted to solicit their feedback on how peer collaboration would facilitate learning of statistics. Almost all students found their interaction with their peers productive; they focused on the development of statistical thinking with concerted effort.

A RESEARCH ANALYSIS ON EFFECTIVE LEARNING IN INTERNATIONAL CONSTRUCTION JOINT VENTURES

  • L.T. Zhang;W.F. Wong;Charles Y.J. Cheah
    • International conference on construction engineering and project management
    • /
    • 2007.03a
    • /
    • pp.450-458
    • /
    • 2007
  • This paper presents the results of a statistical analysis and its research findings focusing on the learning aspect in the process of international joint ventures (IJVs). The contents of this paper is derived from a sample of 96 field cases based on a proposed conceptual model of effective learning for international construction joint ventures (ICJVs). The paper presents a brief review on the conceptual model with hypotheses and summarized the key results of statistical analysis including factor and multiple regression analysis for the testing of the validity of the proposed conceptual model and its associated research hypotheses. Among other research findings, the research confirms that ICJVs provides an excellent platform of in-action learning for construction organization and suggests that good outcomes in learning could be reaped by a company who has a clear learning intent from the beginning and subsequently take corresponding learning actions during the full process of the joint venture.

  • PDF

Fault Prediction Using Statistical and Machine Learning Methods for Improving Software Quality

  • Malhotra, Ruchika;Jain, Ankita
    • Journal of Information Processing Systems
    • /
    • v.8 no.2
    • /
    • pp.241-262
    • /
    • 2012
  • An understanding of quality attributes is relevant for the software organization to deliver high software reliability. An empirical assessment of metrics to predict the quality attributes is essential in order to gain insight about the quality of software in the early phases of software development and to ensure corrective actions. In this paper, we predict a model to estimate fault proneness using Object Oriented CK metrics and QMOOD metrics. We apply one statistical method and six machine learning methods to predict the models. The proposed models are validated using dataset collected from Open Source software. The results are analyzed using Area Under the Curve (AUC) obtained from Receiver Operating Characteristics (ROC) analysis. The results show that the model predicted using the random forest and bagging methods outperformed all the other models. Hence, based on these results it is reasonable to claim that quality models have a significant relevance with Object Oriented metrics and that machine learning methods have a comparable performance with statistical methods.

Statistical Inference in Non-Identifiable and Singular Statistical Models

  • Amari, Shun-ichi;Amari, Shun-ichi;Tomoko Ozeki
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.2
    • /
    • pp.179-192
    • /
    • 2001
  • When a statistical model has a hierarchical structure such as multilayer perceptrons in neural networks or Gaussian mixture density representation, the model includes distribution with unidentifiable parameters when the structure becomes redundant. Since the exact structure is unknown, we need to carry out statistical estimation or learning of parameters in such a model. From the geometrical point of view, distributions specified by unidentifiable parameters become a singular point in the parameter space. The problem has been remarked in many statistical models, and strange behaviors of the likelihood ratio statistics, when the null hypothesis is at a singular point, have been analyzed so far. The present paper studies asymptotic behaviors of the maximum likelihood estimator and the Bayesian predictive estimator, by using a simple cone model, and show that they are completely different from regular statistical models where the Cramer-Rao paradigm holds. At singularities, the Fisher information metric degenerates, implying that the cramer-Rao paradigm does no more hold, and that he classical model selection theory such as AIC and MDL cannot be applied. This paper is a first step to establish a new theory for analyzing the accuracy of estimation or learning at around singularities.

  • PDF

Prediction of English Premier League Game Using an Ensemble Technique (앙상블 기법을 통한 잉글리시 프리미어리그 경기결과 예측)

  • Yi, Jae Hyun;Lee, Soo Won
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.5
    • /
    • pp.161-168
    • /
    • 2020
  • Predicting outcome of the sports enables teams to establish their strategy by analyzing variables that affect overall game flow and wins and losses. Many studies have been conducted on the prediction of the outcome of sports events through statistical techniques and machine learning techniques. Predictive performance is the most important in a game prediction model. However, statistical and machine learning models show different optimal performance depending on the characteristics of the data used for learning. In this paper, we propose a new ensemble model to predict English Premier League soccer games using statistical models and the machine learning models which showed good performance in predicting the results of the soccer games and this model is possible to select a model that performs best when predicting the data even if the data are different. The proposed ensemble model predicts game results by learning the final prediction model with the game prediction results of each single model and the actual game results. Experimental results for the proposed model show higher performance than the single models.

Hybrid Statistical Learning Model for Intrusion Detection of Networks (네트워크 침입 탐지를 위한 변형된 통계적 학습 모형)

  • Jun, Sung-Hae
    • The KIPS Transactions:PartC
    • /
    • v.10C no.6
    • /
    • pp.705-710
    • /
    • 2003
  • Recently, most interchanges of information have been performed in the internet environments. So, the technuque, which is used as intrusion deleting tool for system protecting against attack, is very important. But, the skills of intrusion detection are newer and more delicate, we need preparations for defending from these attacks. Currently, lots of intrusion detection systemsmake the midel of intrusion detection rule using experienced data, based on this model they have the strategy of defence against attacks. This is not efficient for defense from new attack. In this paper, a new model of intrusion detection is proposed. This is hybrid statistical learning model using likelihood ratio test and statistical learning theory, then this model can detect a new attack as well as experienced attacks. This strategy performs intrusion detection according to make a model by finding abnomal attacks. Using KDD Cup-99 task data, we can know that the proposed model has a good result of intrusion detection.

One-dimensional CNN Model of Network Traffic Classification based on Transfer Learning

  • Lingyun Yang;Yuning Dong;Zaijian Wang;Feifei Gao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.2
    • /
    • pp.420-437
    • /
    • 2024
  • There are some problems in network traffic classification (NTC), such as complicated statistical features and insufficient training samples, which may cause poor classification effect. A NTC architecture based on one-dimensional Convolutional Neural Network (CNN) and transfer learning is proposed to tackle these problems and improve the fine-grained classification performance. The key points of the proposed architecture include: (1) Model classification--by extracting normalized rate feature set from original data, plus existing statistical features to optimize the CNN NTC model. (2) To apply transfer learning in the classification to improve NTC performance. We collect two typical network flows data from Youku and YouTube, and verify the proposed method through extensive experiments. The results show that compared with existing methods, our method could improve the classification accuracy by around 3-5%for Youku, and by about 7 to 27% for YouTube.

An Empirical Study on Improving the Accuracy of Demand Forecasting Based on Multi-Machine Learning (다중 머신러닝 기법을 활용한 무기체계 수리부속 수요예측 정확도 개선에 관한 실증연구)

  • Myunghwa Kim;Yeonjun Lee;Sangwoo Park;Kunwoo Kim;Taehee Kim
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.3
    • /
    • pp.406-415
    • /
    • 2024
  • As the equipment of the military has become more advanced and expensive, the cost of securing spare parts is also constantly increasing along with the increase in equipment assets. In particular, forecasting demand for spare parts one of the important management tasks in the military, and the accuracy of these predictions is directly related to military operations and cost management. However, because the demand for spare parts is intermittent and irregular, it is often difficult to make accurate predictions using traditional statistical methods or a single statistical or machine learning model. In this paper, we propose a model that can increase the accuracy of demand forecasting for irregular patterns of spare parts demanding by using a combination of statistical and machine learning algorithm, and through experiments on Cheonma spare parts demanding data.