• 제목/요약/키워드: Ensemble models

검색결과 352건 처리시간 0.028초

Quantitative analysis by derivative spectrophotometry (ll) Derivative spectrophotometry and methods for the reduction of high frequency noises

  • Park, Man-Ki;Cho, Jung-Hwan
    • Archives of Pharmacal Research
    • /
    • 제10권1호
    • /
    • pp.1-8
    • /
    • 1987
  • One of the problems of derivatie spectrophotometry, the decrease of signal-to-noise ratio by derivative operations, was solved by three concepts of digital filtering, ensemble averaging, least squares polynomial smoothing and Fourier smoothing. The suthors made several compouter programs written in APPLE SOFT BASIC language for the actual applications of the concepts of these digital filters on UV spectrophotometer system. As a result, ensemble averaging could not be used as a routine operation for the spectrophotometer used. The maximum S/N ratio enhancement factors achieved by least squares polynomial smoothing were 6.17 and 7.47 for the spectra of Gaussian and Lorentzian distribution models, and by Fourier smoothing 16.42 and 11.78 for the spectra of two models, respectively.

  • PDF

원격상관을 이용한 북동아시아 여름철 강수량 예측 (A Prediction of Northeast Asian Summer Precipitation Using Teleconnection)

  • 이강진;권민호
    • 대기
    • /
    • 제25권1호
    • /
    • pp.179-183
    • /
    • 2015
  • Even though state-of-the-art general circulation models is improved step by step, the seasonal predictability of the East Asian summer monsoon still remains poor. In contrast, the seasonal predictability of western North Pacific and Indian monsoon region using dynamic models is relatively high. This study builds canonical correlation analysis model for seasonal prediction using wind fields over western North Pacific and Indian Ocean from the Global Seasonal Forecasting System version 5 (GloSea5), and then assesses the predictability of so-called hybrid model. In addition, we suggest improvement method for forecast skill by introducing the lagged ensemble technique.

Proper Noun Embedding Model for the Korean Dependency Parsing

  • Nam, Gyu-Hyeon;Lee, Hyun-Young;Kang, Seung-Shik
    • Journal of Multimedia Information System
    • /
    • 제9권2호
    • /
    • pp.93-102
    • /
    • 2022
  • Dependency parsing is a decision problem of the syntactic relation between words in a sentence. Recently, deep learning models are used for dependency parsing based on the word representations in a continuous vector space. However, it causes a mislabeled tagging problem for the proper nouns that rarely appear in the training corpus because it is difficult to express out-of-vocabulary (OOV) words in a continuous vector space. To solve the OOV problem in dependency parsing, we explored the proper noun embedding method according to the embedding unit. Before representing words in a continuous vector space, we replace the proper nouns with a special token and train them for the contextual features by using the multi-layer bidirectional LSTM. Two models of the syllable-based and morpheme-based unit are proposed for proper noun embedding and the performance of the dependency parsing is more improved in the ensemble model than each syllable and morpheme embedding model. The experimental results showed that our ensemble model improved 1.69%p in UAS and 2.17%p in LAS than the same arc-eager approach-based Malt parser.

심층 신경망 기반의 앙상블 방식을 이용한 토마토 작물의 질병 식별 (Tomato Crop Disease Classification Using an Ensemble Approach Based on a Deep Neural Network)

  • 김민기
    • 한국멀티미디어학회논문지
    • /
    • 제23권10호
    • /
    • pp.1250-1257
    • /
    • 2020
  • The early detection of diseases is important in agriculture because diseases are major threats of reducing crop yield for farmers. The shape and color of plant leaf are changed differently according to the disease. So we can detect and estimate the disease by inspecting the visual feature in leaf. This study presents a vision-based leaf classification method for detecting the diseases of tomato crop. ResNet-50 model was used to extract the visual feature in leaf and classify the disease of tomato crop, since the model showed the higher accuracy than the other ResNet models with different depths. We propose a new ensemble approach using several DCNN classifiers that have the same structure but have been trained at different ranges in the DCNN layers. Experimental result achieved accuracy of 97.19% for PlantVillage dataset. It validates that the proposed method effectively classify the disease of tomato crop.

인공신경망 앙상블을 이용한 옵션 투자예측 시스템 (A Forecasting System for KOSPI 200 Option Trading using Artificial Neural Network Ensemble)

  • 이재식;송영균;허성회
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2000년도 추계정기학술대회:지능형기술과 CRM
    • /
    • pp.489-497
    • /
    • 2000
  • After IMF situation, the money market environment is changing rapidly. Therefore, many companies including financial institutions and many individual investors are concerned about forecasting the money market, and they make an effort to insure the various profit and hedge methods using derivatives like option, futures and swap. In this research, we developed a prototype of forecasting system for KOSPI 200 option, especially call option, trading using artificial neural networks(ANN), To avoid the overfitting problem and the problem involved int the choice of ANN structure and parameters, we employed the ANN ensemble approach. We conducted two types of simulation. One is conducted with the hold signals taken into account, and the other is conducted without hold signals. Even though our models show low accuracy for the sample set extracted from the data collected in the early stage of IMF situation, they perform better in terms of profit and stability than the model that uses only the theoretical price.

  • PDF

Predicting movie audience with stacked generalization by combining machine learning algorithms

  • Park, Junghoon;Lim, Changwon
    • Communications for Statistical Applications and Methods
    • /
    • 제28권3호
    • /
    • pp.217-232
    • /
    • 2021
  • The Korea film industry has matured and the number of movie-watching per capita has reached the highest level in the world. Since then, movie industry growth rate is decreasing and even the total sales of movies per year slightly decreased in 2018. The number of moviegoers is the first factor of sales in movie industry and also an important factor influencing additional sales. Thus it is important to predict the number of movie audiences. In this study, we predict the cumulative number of audiences of films using stacking, an ensemble method. Stacking is a kind of ensemble method that combines all the algorithms used in the prediction. We use box office data from Korea Film Council and web comment data from Daum Movie (www.movie.daum.net). This paper describes the process of collecting and preprocessing of explanatory variables and explains regression models used in stacking. Final stacking model outperforms in the prediction of test set in terms of RMSE.

One Step Measurements of hippocampal Pure Volumes from MRI Data Using an Ensemble Model of 3-D Convolutional Neural Network

  • Basher, Abol;Ahmed, Samsuddin;Jung, Ho Yub
    • 스마트미디어저널
    • /
    • 제9권2호
    • /
    • pp.22-32
    • /
    • 2020
  • The hippocampal volume atrophy is known to be linked with neuro-degenerative disorders and it is also one of the most important early biomarkers for Alzheimer's disease detection. The measurements of hippocampal pure volumes from Magnetic Resonance Imaging (MRI) is a crucial task and state-of-the-art methods require a large amount of time. In addition, the structural brain development is investigated using MRI data, where brain morphometry (e.g. cortical thickness, volume, surface area etc.) study is one of the significant parts of the analysis. In this study, we have proposed a patch-based ensemble model of 3-D convolutional neural network (CNN) to measure the hippocampal pure volume from MRI data. The 3-D patches were extracted from the volumetric MRI scans to train the proposed 3-D CNN models. The trained models are used to construct the ensemble 3-D CNN model and the aggregated model predicts the pure volume in one-step in the test phase. Our approach takes only 5 seconds to estimate the volumes from an MRI scan. The average errors for the proposed ensemble 3-D CNN model are 11.7±8.8 (error%±STD) and 12.5±12.8 (error%±STD) for the left and right hippocampi of 65 test MRI scans, respectively. The quantitative study on the predicted volumes over the ground truth volumes shows that the proposed approach can be used as a proxy.

딥러닝과 앙상블 머신러닝 모형의 하천 탁도 예측 특성 비교 연구 (Comparative characteristic of ensemble machine learning and deep learning models for turbidity prediction in a river)

  • 박정수
    • 상하수도학회지
    • /
    • 제35권1호
    • /
    • pp.83-91
    • /
    • 2021
  • The increased turbidity in rivers during flood events has various effects on water environmental management, including drinking water supply systems. Thus, prediction of turbid water is essential for water environmental management. Recently, various advanced machine learning algorithms have been increasingly used in water environmental management. Ensemble machine learning algorithms such as random forest (RF) and gradient boosting decision tree (GBDT) are some of the most popular machine learning algorithms used for water environmental management, along with deep learning algorithms such as recurrent neural networks. In this study GBDT, an ensemble machine learning algorithm, and gated recurrent unit (GRU), a recurrent neural networks algorithm, are used for model development to predict turbidity in a river. The observation frequencies of input data used for the model were 2, 4, 8, 24, 48, 120 and 168 h. The root-mean-square error-observations standard deviation ratio (RSR) of GRU and GBDT ranges between 0.182~0.766 and 0.400~0.683, respectively. Both models show similar prediction accuracy with RSR of 0.682 for GRU and 0.683 for GBDT. The GRU shows better prediction accuracy when the observation frequency is relatively short (i.e., 2, 4, and 8 h) where GBDT shows better prediction accuracy when the observation frequency is relatively long (i.e. 48, 120, 160 h). The results suggest that the characteristics of input data should be considered to develop an appropriate model to predict turbidity.

드론 항공영상을 이용한 딥러닝 기반 앙상블 토지 피복 분할 알고리즘 개발 (Development of Deep Learning Based Ensemble Land Cover Segmentation Algorithm Using Drone Aerial Images)

  • 박해광;백승기;정승현
    • 대한원격탐사학회지
    • /
    • 제40권1호
    • /
    • pp.71-80
    • /
    • 2024
  • 이 연구에서는 무인 항공기(Unmanned Aerial Vehicle, UAV)가 캡처한 이미지의 의미론적 토지 피복 분할 성능을 향상시키기 위한 앙상블 학습 기법을 제안하고 있다. 도시 계획과 같은 분야에서 UAV 사용이 증가함에 따라 토지 피복 분할을 위한 딥러닝 분할 방법을 활용한 기술 개발이 활발히 이루어지고 있다. 이 연구는 대표적인 분할 모델인 U-Net, DeepLabV3 그리고 Fully Convolutional Network (FCN)를 사용하여 분할 예측 성능을 개선하는 방법을 제안한다. 제안된 접근 방식은 세 가지 분할 모델의 훈련 손실, 검증 정확도 및 클래스별 점수를 통합하여 앙상블 모델을 개발하고 전반적인 예측 성능을 향상시킨다. 이 방법은 건물, 도로, 주차장, 논, 밭, 나무, 빈 공간, 미분류 영역을 포함하는 일곱 가지 클래스가 있는 토지 피복 분할 문제에 적용하여 평가하였다. 앙상블 모델의 성능은 mean Intersection over Union (mIoU)으로 평가하였으며, 제안된 앙상블 모델과 기존의 세 가지 분할 방법을 비교한 결과 mIoU 성능이 향상되었음이 나타났다. 따라서 이 연구는 제안된 기술이 의미론적 분할 모델의 성능을 향상시킬 수 있음을 확인하였다.

뇌파의 중첩 분할에 기반한 CNN 앙상블 모델을 이용한 뇌전증 발작 검출 (Epileptic Seizure Detection Using CNN Ensemble Models Based on Overlapping Segments of EEG Signals)

  • 김민기
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제10권12호
    • /
    • pp.587-594
    • /
    • 2021
  • 뇌파(electroencephalogram, EEG)를 이용한 진단이 확대되면서 EEG 신호를 자동으로 분류하기 위한 다양한 연구가 활발히 이루어지고 있다. 본 논문은 일반인과 뇌전증 환자에게서 추출한 EEG 신호를 효과적으로 식별할 수 있는 CNN 모델을 제안한다. CNN의 학습에 필요한 데이터를 확장하기 위하여 EEG 신호를 낮은 차원의 신호로 분할하고, 이것을 다시 여러 개의 세그먼트로 중첩 분할하여 CNN 학습에 이용한다. 이와 더불어 CNN의 성능을 개선하기 위하여 CNN 앙상블 전략을 제안한다. 공개된 Bonn 데이터세트로 실험을 수행한 결과 뇌전증 발작을 99.0% 이상의 정확도로 검출하였고, 앙상블 방식에 의해 3-클래스와 5-클래스의 EEG 분류에서 정확도가 향상되었다.