• 제목/요약/키워드: Statistical Learning Model

검색결과 536건 처리시간 0.027초

유전 알고리즘 기반의 서포트 벡터 회귀를 이용한 소프트웨어 비용산정 (Estimation of software project effort with genetic algorithm and support vector regression)

  • 권기태;박수권
    • 정보처리학회논문지D
    • /
    • 제16D권5호
    • /
    • pp.729-736
    • /
    • 2009
  • 소프트웨어 공학에서 정확한 개발 비용 예측은 성공적인 개발 프로젝트를 위한 필수적인 요소로, 현재까지 많은 소프트웨어 비용산정을 위한 모델들이 개발되어 왔다. 전통적인 통계적 기법부터 기계학습을 적용한 알고리즘까지 다양한 분야의 아이디어를 접목하고 있다. 본 논문에서는 소프트웨어 개발 비용 예측을 위한 방법으로 유전 알고리즘과 서포트 벡터 머신의 회귀모델인 서포트 벡터 회귀를 결합한 GA-SVR 모델을 제안한다. 제안된 모델은 기존의 연구에 비해 향상된 결과를 보이고 있다.

머신러닝을 활용한 통계 분석 기반의 수면 호흡 장애 중증도 예측 (Severity Prediction of Sleep Respiratory Disease Based on Statistical Analysis Using Machine Learning)

  • 김준수;최병재
    • 대한임베디드공학회논문지
    • /
    • 제18권2호
    • /
    • pp.59-65
    • /
    • 2023
  • Currently, polysomnography is essential to diagnose sleep-related breathing disorders. However, there are several disadvantages to polysomnography, such as the requirement for multiple sensors and a long reading time. In this paper, we propose a system for predicting the severity of sleep-related breathing disorders at home utilizing measurable elements in a wearable device. To predict severity, the variables were refined through a three-step variable selection process, and the refined variables were used as inputs into three machine-learning models. As a result of the study, random forest models showed excellent prediction performance throughout. The best performance of the model in terms of F1 scores for the three threshold criteria of 5, 15, and 30 classified as the AHI index was about 87.3%, 90.7%, and 90.8%, respectively, and the maximum performance of the model for the three threshold criteria classified as the RDI index was approx 79.8%, 90.2%, and 90.1%, respectively.

접근 기록 분석 기반 적응형 이상 이동 탐지 방법론 (Adaptive Anomaly Movement Detection Approach Based On Access Log Analysis)

  • 김남의;신동천
    • 융합보안논문지
    • /
    • 제18권5_1호
    • /
    • pp.45-51
    • /
    • 2018
  • 데이터의 활용도와 중요성이 점차 높아짐에 따라 데이터와 관련된 사고와 피해는 점점 증가 하고 있으며, 특히 내부자에 의한 사고는 그 위험성이 더 높다. 이런 내부자의 공격은 전통적인 보안 시스템으로 방어하기 힘들어, 규칙 기반의 이상 행동 탐지 방법이 널리 활용되어오고 있다. 하지만, 새로운 공격 방식 및 새로운 환경과 같이 변화에 유연하게 적응하지 못하는 문제점을 가지고 있다. 본 논문에서는 이에 대한 해결책으로서 통계적 마르코프 모델 기반의 적응형 이상 이동 탐지 프레임워크를 제안하고자 한다. 이 프레임워크는 사람의 이동에 초점을 맞추어 내부자에 의한 위험을 사전에 탐지한다. 이동에 직접적으로 영향을 주는 환경 요소와 지속적인 통계 학습을 통해 변화하는 환경에 적응함으로써 오탐지와 미탐지를 최소화하도록 설계되었다. 프레임워크를 활용한 실험에서는 0.92의 높은 F2-점수를 얻을 수 있었으며, 나아가 정상으로 보여지지만, 의심해볼 이동까지 발견할 수 있었다. 통계 학습과 환경 요소를 바탕으로 행동과 관련된 데이터와 모델링 알고리즘을 다양화 시켜 적용한다면 보다 더 범위 넓은 비정상 행위에 대해 탐지할 수 있는 확장성을 제공한다.

  • PDF

Vehicle Classification by Road Lane Detection and Model Fitting Using a Surveillance Camera

  • Shin, Wook-Sun;Song, Doo-Heon;Lee, Chang-Hun
    • Journal of Information Processing Systems
    • /
    • 제2권1호
    • /
    • pp.52-57
    • /
    • 2006
  • One of the important functions of an Intelligent Transportation System (ITS) is to classify vehicle types using a vision system. We propose a method using machine-learning algorithms for this classification problem with 3-D object model fitting. It is also necessary to detect road lanes from a fixed traffic surveillance camera in preparation for model fitting. We apply a background mask and line analysis algorithm based on statistical measures to Hough Transform (HT) in order to remove noise and false positive road lanes. The results show that this method is quite efficient in terms of quality.

Analysis Model Evaluation based on IoT Data and Machine Learning Algorithm for Prediction of Acer Mono Sap Liquid Water

  • Lee, Han Sung;Jung, Se Hoon
    • 한국멀티미디어학회논문지
    • /
    • 제23권10호
    • /
    • pp.1286-1295
    • /
    • 2020
  • It has been increasingly difficult to predict the amounts of Acer mono sap to be collected due to droughts and cold waves caused by recent climate changes with few studies conducted on the prediction of its collection volume. This study thus set out to propose a Big Data prediction system based on meteorological information for the collection of Acer mono sap. The proposed system would analyze collected data and provide managers with a statistical chart of prediction values regarding climate factors to affect the amounts of Acer mono sap to be collected, thus enabling efficient work. It was designed based on Hadoop for data collection, treatment and analysis. The study also analyzed and proposed an optimal prediction model for climate conditions to influence the volume of Acer mono sap to be collected by applying a multiple regression analysis model based on Hadoop and Mahout.

Neural Network and Its Application to Rainfall-Runoff Forecasting

  • Kang, Kwan-Won;Park, Chan-Young;Kim, Ju-Hwan
    • Korean Journal of Hydrosciences
    • /
    • 제4권
    • /
    • pp.1-9
    • /
    • 1993
  • It is a major objective for the management and operation of water resources system to forecast streamflows. The applicability of artificial neural network model to hydrologic system is analyzed and the performance is compared by statistical method with observed. Multi-layered perception was used to model rainfall-runoff process at Pyung Chang River Basin in Korea. The neural network model has the function of learning the process which can be trained with the error backpropagation (EBP) algorithm in two phases; (1) learning phase permits to find the best parameters(weight matrix) between input and output. (2) adaptive phase use the EBP algorithm in order to learn from the provided data. The generalization results have been obtained on forecasting the daily and hourly streamflows by assuming them with the structure of ARMA model. The results show validities in applying to hydrologic forecasting system.

  • PDF

센서 시스템의 매개변수 교정을 위한 데이터 기반 일괄 처리 방법 (Data-Driven Batch Processing for Parameter Calibration of a Sensor System)

  • 이규만
    • 센서학회지
    • /
    • 제32권6호
    • /
    • pp.475-480
    • /
    • 2023
  • When modeling a sensor system mathematically, we assume that the sensor noise is Gaussian and white to simplify the model. If this assumption fails, the performance of the sensor model-based controller or estimator degrades due to incorrect modeling. In practice, non-Gaussian or non-white noise sources often arise in many digital sensor systems. Additionally, the noise parameters of the sensor model are not known in advance without additional noise statistical information. Moreover, disturbances or high nonlinearities often cause unknown sensor modeling errors. To estimate the uncertain noise and model parameters of a sensor system, this paper proposes an iterative batch calibration method using data-driven machine learning. Our simulation results validate the calibration performance of the proposed approach.

전산수업에서 ARCS 모형이 인지양식에 따라 학습동기와 학업성취도에 미치는 영향 (The Effects of ARCS Model Based Instruction and Cognitive Style on Learning Motivation and Learning Achievement in Computer Education)

  • 김성완;윤종성
    • 컴퓨터교육학회논문지
    • /
    • 제9권5호
    • /
    • pp.1-11
    • /
    • 2006
  • 본 연구는 ARCS 모형을 적용한 전산수업에서 인지유형이 학습동기와 학업성취도에 미치는 영향에 관해 살펴보고자 하는데 목적이 있다. 이를 위해 경기도 수원에 소재한 A 고등학교 학생 70명을 4개 집단(ARCS 모형 적용 수업-장독립, ARCS 모형 적용 수업-장의존, 전통적 수업-장독립, 전통적 수업-장의존)으로 나누어 실험을 실시했다. 본 연구의 결과는 다음과 같다. 첫째, 각 집단 간 사후 학습동기의 평균값은 유의수준 5%에서 통계적으로 유의미한 차이를 보였다. 특히 사후검증 결과, 'ARCS 모형 적용 수업-장독립' 집단(M=3.36)과 '전통적 수업-장독립' 집단(M-3.18) 간에 유익미한 차이가 있었다. 또한 학습동기의 사전-사후검사 향상점수 분석 결과, '전통적 수업 장의존형' 집단을 제외한 모든 집단에서 통계적으로 유의미한 차이를 보였다. 둘째, 각 집단 간 사후 학업성취도의 평균값은 유의수준 5%에서 통계적으로 유의미한 차이가 나타나지 않았다. 또한 사전-사후 학업성취도 검사의 향상점수 분석 결과, ARCS 모형 적용여부와 인지유형에 관계없이 4개 집단 간 학업성취도 평균 간에는 유의미한 차이를 보였다. 이 결과는 향후 맞춤형 교수설계를 통한 개별화된 교수 학습 구현에 중요한 시사점을 제공해 줄 것이다.

  • PDF

Multi-Sensor Signal based Situation Recognition with Bayesian Networks

  • Kim, Jin-Pyung;Jang, Gyu-Jin;Jung, Jae-Young;Kim, Moon-Hyun
    • Journal of Electrical Engineering and Technology
    • /
    • 제9권3호
    • /
    • pp.1051-1059
    • /
    • 2014
  • In this paper, we propose an intelligent situation recognition model by collecting and analyzing multiple sensor signals. Multiple sensor signals are collected for fixed time window. A training set of collected sensor data for each situation is provided to K2-learning algorithm to generate Bayesian networks representing causal relationship between sensors for the situation. Statistical characteristics of sensor values and topological characteristics of generated graphs are learned for each situation. A neural network is designed to classify the current situation based on the extracted features from collected multiple sensor values. The proposed method is implemented and tested with UCI machine learning repository data.

Area-wise relational knowledge distillation

  • Sungchul Cho;Sangje Park;Changwon Lim
    • Communications for Statistical Applications and Methods
    • /
    • 제30권5호
    • /
    • pp.501-516
    • /
    • 2023
  • Knowledge distillation (KD) refers to extracting knowledge from a large and complex model (teacher) and transferring it to a relatively small model (student). This can be done by training the teacher model to obtain the activation function values of the hidden or the output layers and then retraining the student model using the same training data with the obtained values. Recently, relational KD (RKD) has been proposed to extract knowledge about relative differences in training data. This method improved the performance of the student model compared to conventional KDs. In this paper, we propose a new method for RKD by introducing a new loss function for RKD. The proposed loss function is defined using the area difference between the teacher model and the student model in a specific hidden layer, and it is shown that the model can be successfully compressed, and the generalization performance of the model can be improved. We demonstrate that the accuracy of the model applying the method proposed in the study of model compression of audio data is up to 1.8% higher than that of the existing method. For the study of model generalization, we demonstrate that the model has up to 0.5% better performance in accuracy when introducing the RKD method to self-KD using image data.