• Title/Summary/Keyword: Statistical Learning

Search Result 1,289, Processing Time 0.025 seconds

Implementation of Efficient Weather Forecasting Model Using the Selecting Concentration Learning of Neural Network (신경망의 선별학습 집중화를 이용한 효율적 온도변화예측모델 구현)

  • 이기준;강경아;정채영
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.6B
    • /
    • pp.1120-1126
    • /
    • 2000
  • Recently, in order to analyze the time series problems that occur in the nature word, and analyzing method using a neural electric network is being studied more than a typical statistical analysis method. A neural electric network has a generalization performance that is possible to estimate and analyze about non-learning data through the learning of a population. In this paper, after collecting weather datum that was collected from 1987 to 1996 and learning a population established, it suggests the weather forecasting system for an estimation and analysis the future weather. The suggested weather forecasting system uses 28*30*1 neural network structure, raises the total learning numbers and accuracy letting the selecting concentration learning about the pattern, that is not collected, using the descending epsilon learning method. Also, the weather forecasting system, that is suggested through a comparative experiment of the typical time series analysis method shows more superior than the existing statistical analysis method in the part of future estimation capacity.

  • PDF

E-Learning Implementation of Statistics and Lab Course with ASP and Web Folder (ASP와 웹-폴더를 활용한 통계학 및 실습 교과목의 E-LEARNING 구현)

  • 이기원;이윤환
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.2
    • /
    • pp.441-454
    • /
    • 2003
  • We have shown how e-Learning can be implemented for statistics and lab course with minimum cost of operation using APM. We have illustrated how traditional class educational component can be implemented under e-Learning environment, especially statistical package labs with ASP and web-folder.

An Empirical Study on Improving the Accuracy of Demand Forecasting Based on Multi-Machine Learning (다중 머신러닝 기법을 활용한 무기체계 수리부속 수요예측 정확도 개선에 관한 실증연구)

  • Myunghwa Kim;Yeonjun Lee;Sangwoo Park;Kunwoo Kim;Taehee Kim
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.3
    • /
    • pp.406-415
    • /
    • 2024
  • As the equipment of the military has become more advanced and expensive, the cost of securing spare parts is also constantly increasing along with the increase in equipment assets. In particular, forecasting demand for spare parts one of the important management tasks in the military, and the accuracy of these predictions is directly related to military operations and cost management. However, because the demand for spare parts is intermittent and irregular, it is often difficult to make accurate predictions using traditional statistical methods or a single statistical or machine learning model. In this paper, we propose a model that can increase the accuracy of demand forecasting for irregular patterns of spare parts demanding by using a combination of statistical and machine learning algorithm, and through experiments on Cheonma spare parts demanding data.

Design and Implementation of Teaching Simple Random Sampling by Using Constructivism and Information Technology

  • Han Beom Soo;Han Kyung Soo
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.1
    • /
    • pp.229-240
    • /
    • 2005
  • This study described the application of constructivism and information technology for teaching simple random sampling. We considered more student's participation, more interaction, and more flow in their introductory statistics class. In addition, we presented a web-based teaching and learning system for simple random sampling to demonstrate.

WHEN CAN SUPPORT VECTOR MACHINE ACHIEVE FAST RATES OF CONVERGENCE?

  • Park, Chang-Yi
    • Journal of the Korean Statistical Society
    • /
    • v.36 no.3
    • /
    • pp.367-372
    • /
    • 2007
  • Classification as a tool to extract information from data plays an important role in science and engineering. Among various classification methodologies, support vector machine has recently seen significant developments. The central problem this paper addresses is the accuracy of support vector machine. In particular, we are interested in the situations where fast rates of convergence to the Bayes risk can be achieved by support vector machine. Through learning examples, we illustrate that support vector machine may yield fast rates if the space spanned by an adopted kernel is sufficiently large.

A Spatial Analysis of Seismic Vulnerability of Buildings Using Statistical and Machine Learning Techniques Comparative Analysis (통계분석 기법과 머신러닝 기법의 비교분석을 통한 건물의 지진취약도 공간분석)

  • Seong H. Kim;Sang-Bin Kim;Dae-Hyeon Kim
    • Journal of Industrial Convergence
    • /
    • v.21 no.1
    • /
    • pp.159-165
    • /
    • 2023
  • While the frequency of seismic occurrence has been increasing recently, the domestic seismic response system is weak, the objective of this research is to compare and analyze the seismic vulnerability of buildings using statistical analysis and machine learning techniques. As the result of using statistical technique, the prediction accuracy of the developed model through the optimal scaling method showed about 87%. As the result of using machine learning technique, because the accuracy of Random Forest method is 94% in case of Train Set, 76.7% in case of Test Set, which is the highest accuracy among the 4 analyzed methods, Random Forest method was finally chosen. Therefore, Random Forest method was derived as the final machine learning technique. Accordingly, the statistical analysis technique showed higher accuracy of about 87%, whereas the machine learning technique showed the accuracy of about 76.7%. As the final result, among the 22,296 analyzed building data, the seismic vulnerabilities of 1,627(0.1%) buildings are expected as more dangerous when the statistical analysis technique is used, 10,146(49%) buildings showed the same rate, and the remaining 10,523(50%) buildings are expected as more dangerous when the machine learning technique is used. As the comparison of the results of using advanced machine learning techniques in addition to the existing statistical analysis techniques, in spatial analysis decisions, it is hoped that this research results help to prepare more reliable seismic countermeasures.

Introduction to convolutional neural network using Keras; an understanding from a statistician

  • Lee, Hagyeong;Song, Jongwoo
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.6
    • /
    • pp.591-610
    • /
    • 2019
  • Deep Learning is one of the machine learning methods to find features from a huge data using non-linear transformation. It is now commonly used for supervised learning in many fields. In particular, Convolutional Neural Network (CNN) is the best technique for the image classification since 2012. For users who consider deep learning models for real-world applications, Keras is a popular API for neural networks written in Python and also can be used in R. We try examine the parameter estimation procedures of Deep Neural Network and structures of CNN models from basics to advanced techniques. We also try to figure out some crucial steps in CNN that can improve image classification performance in the CIFAR10 dataset using Keras. We found that several stacks of convolutional layers and batch normalization could improve prediction performance. We also compared image classification performances with other machine learning methods, including K-Nearest Neighbors (K-NN), Random Forest, and XGBoost, in both MNIST and CIFAR10 dataset.

Review of statistical methods for survival analysis using genomic data

  • Lee, Seungyeoun;Lim, Heeju
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.41.1-41.12
    • /
    • 2019
  • Survival analysis mainly deals with the time to event, including death, onset of disease, and bankruptcy. The common characteristic of survival analysis is that it contains "censored" data, in which the time to event cannot be completely observed, but instead represents the lower bound of the time to event. Only the occurrence of either time to event or censoring time is observed. Many traditional statistical methods have been effectively used for analyzing survival data with censored observations. However, with the development of high-throughput technologies for producing "omics" data, more advanced statistical methods, such as regularization, should be required to construct the predictive survival model with high-dimensional genomic data. Furthermore, machine learning approaches have been adapted for survival analysis, to fit nonlinear and complex interaction effects between predictors, and achieve more accurate prediction of individual survival probability. Presently, since most clinicians and medical researchers can easily assess statistical programs for analyzing survival data, a review article is helpful for understanding statistical methods used in survival analysis. We review traditional survival methods and regularization methods, with various penalty functions, for the analysis of high-dimensional genomics, and describe machine learning techniques that have been adapted to survival analysis.

The effect of Woohwangchungsimwon on the learning and memory in NOS inhibitor treated rats in Morris water maze. (우황청심원(牛黃淸心元)이 NOS inhibitor에 의한 흰쥐의 학습(學習) 및 기억장애(記憶障碍)에 미치는 영향(影響))

  • Baek Ji-Seong;Kim Jong-Woo
    • Journal of Oriental Neuropsychiatry
    • /
    • v.10 no.2
    • /
    • pp.115-126
    • /
    • 1999
  • This study was conducted to find out the effects of Woohwangchungsimwon on learning and memory in the NOS inhibitor treated rats. The Morris water maze was used in evaluating them. The result of the study was summarized as follows. 1. In the learning test, three groups have showed a gradual improvement of learning ability by repeating the trials in Morris water maze. WHCS group have showed statistical improvement than control group at 4,5,6 trial(p<0.05, p<0.01, p<0.01). 2. In the memory test, the first latency of WHCH group was statistically shortened than that of control group(p<0.05). 3. In the memory test, there was no statistical difference in the entry number between WHCH group and control. 4. In the memory test, there was no statistical difference in the memory score between WHCH group and control. The result of this experimental study presents that Woohwangchungsimwon has the improving effect on impaired learning and memory in NOS inhibitor treated rats, and implies that Woohwangchungsimwon may be one of the useful prescription for the treatment of vascular dementia after cerebral ischemia.

  • PDF

Exploring modern machine learning methods to improve causal-effect estimation

  • Kim, Yeji;Choi, Taehwa;Choi, Sangbum
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.2
    • /
    • pp.177-191
    • /
    • 2022
  • This paper addresses the use of machine learning methods for causal estimation of treatment effects from observational data. Even though conducting randomized experimental trials is a gold standard to reveal potential causal relationships, observational study is another rich source for investigation of exposure effects, for example, in the research of comparative effectiveness and safety of treatments, where the causal effect can be identified if covariates contain all confounding variables. In this context, statistical regression models for the expected outcome and the probability of treatment are often imposed, which can be combined in a clever way to yield more efficient and robust causal estimators. Recently, targeted maximum likelihood estimation and causal random forest is proposed and extensively studied for the use of data-adaptive regression in estimation of causal inference parameters. Machine learning methods are a natural choice in these settings to improve the quality of the final estimate of the treatment effect. We explore how we can adapt the design and training of several machine learning algorithms for causal inference and study their finite-sample performance through simulation experiments under various scenarios. Application to the percutaneous coronary intervention (PCI) data shows that these adaptations can improve simple linear regression-based methods.