• 제목/요약/키워드: Kernel regression

검색결과 239건 처리시간 0.024초

Predicting Daily Nutrient Water Consumption by Strawberry Plants in a Greenhouse Environment

  • Sathishkumar, VE;Lee, Myeong-Bae;Lim, Jong-Hyun;Shin, Chang-Sun;Park, Chang-Woo;Cho, Yong Yun
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2019년도 추계학술발표대회
    • /
    • pp.581-584
    • /
    • 2019
  • Food consumption is growing worldwide every year owing to a growing population. Hence, the increasing population needs the production of sufficient and good quality food products. Strawberry is one of the world's most famous fruit. To obtain the highest strawberry output, we worked with three strawberry varieties supplied with three kinds of nutrient water in a greenhouse and with the outcome of the strawberry production, the highest yielding strawberry variety is detected. This Study uses the nutrient water consumed every day by the highest yielding strawberry variety. The atmospheric temperature, humidity and CO2 levels within the greenhouse are identified and used for the prediction, since the water consumption by any plant depends primarily on weather conditions. Machine learning techniques show successful outcomes in a multitude of issues including time series and regression issues. In this study, daily nutrient water consumption of strawberry plants is predicted using machine learning algorithms is proposed. Four Machine learning algorithms are used such as Linear Regression (LR), K nearest neighbour (KNN), Support Vector Machine with Radial Kernel (SVM) and Gradient Boosting Machine (GBM). Gradient Boosting System produces the best results.

BCI에서 기계 학습을 위한 간질 뇌파 특징 선택을 통한 차원 감소 방법 분석 (Analysis of Dimensionality Reduction Methods Through Epileptic EEG Feature Selection for Machine Learning in BCI)

  • 양통;;임창균
    • 한국전자통신학회논문지
    • /
    • 제13권6호
    • /
    • pp.1333-1342
    • /
    • 2018
  • 지금까지 뇌파(Electroencephalography - EEG)는 뇌전증 진단 및 치료를 위한 가장 중요하고 편리한 방법이었다. 그러나 뇌전증 뇌파 신호의 파형 특성은 매우 약하고 비 정지 상태이며 배경 노이즈가 강하기 때문에 식별하기가 어렵다. 이 논문에서는 간질 뇌파의 특징 선택을 통한 차원 감소를 통한 분류 방법의 효과를 분석한다. 우리는 차원 감소를 위해 주 요소 분석, 커널 요소 분석, 선형 판별 분석 방법을 사용하였다. 차원 감소방법의 성능 분석을 위해 Support Vector Machine: SVM), Logistic Regression(: LR), K-Nearestneighbor(: K-NN), Decision Tree(: DR), Random Forest(: RF) 분류 방법들을 사용해 평가하였다. 실험 결과에 따르면, PCA는 SVM, LR 및 K-NN에서 75% 정확도를 나타냈다. KPCA는 SVM과 K-KNN에서 85%의 성능을 보였으며 LDA는 K-NN를 이용했을 때 100 %의 정확도 보여주었다. 따라서 LDA를 이용한 차원 감소가 뇌전증 EEG 신호에 대한 최고의 분류 결과 보여주었다.

VALIDATION OF ON-LINE MONITORING TECHNIQUES TO NUCLEAR PLANT DATA

  • Garvey, Jamie;Garvey, Dustin;Seibert, Rebecca;Hines, J. Wesley
    • Nuclear Engineering and Technology
    • /
    • 제39권2호
    • /
    • pp.133-142
    • /
    • 2007
  • The Electric Power Research Institute (EPRI) demonstrated a method for monitoring the performance of instrument channels in Topical Report (TR) 104965, 'On-Line Monitoring of Instrument Channel Performance.' This paper presents the results of several models originally developed by EPRI to monitor three nuclear plant sensor sets: Pressurizer Level, Reactor Protection System (RPS) Loop A, and Reactor Coolant System (RCS) Loop A Steam Generator (SG) Level. The sensor sets investigated include one redundant sensor model and two non-redundant sensor models. Each model employs an Auto-Associative Kernel Regression (AAKR) model architecture to predict correct sensor behavior. Performance of each of the developed models is evaluated using four metrics: accuracy, auto-sensitivity, cross-sensitivity, and newly developed Error Uncertainty Limit Monitoring (EULM) detectability. The uncertainty estimate for each model is also calculated through two methods: analytic formulas and Monte Carlo estimation. The uncertainty estimates are verified by calculating confidence interval coverages to assure that 95% of the measured data fall within the confidence intervals. The model performance evaluation identified the Pressurizer Level model as acceptable for on-line monitoring (OLM) implementation. The other two models, RPS Loop A and RCS Loop A SG Level, highlight two common problems that occur in model development and evaluation, namely faulty data and poor signal selection

개선된 데이터마이닝을 위한 혼합 학습구조의 제시 (Hybrid Learning Architectures for Advanced Data Mining:An Application to Binary Classification for Fraud Management)

  • Kim, Steven H.;Shin, Sung-Woo
    • 정보기술응용연구
    • /
    • 제1권
    • /
    • pp.173-211
    • /
    • 1999
  • The task of classification permeates all walks of life, from business and economics to science and public policy. In this context, nonlinear techniques from artificial intelligence have often proven to be more effective than the methods of classical statistics. The objective of knowledge discovery and data mining is to support decision making through the effective use of information. The automated approach to knowledge discovery is especially useful when dealing with large data sets or complex relationships. For many applications, automated software may find subtle patterns which escape the notice of manual analysis, or whose complexity exceeds the cognitive capabilities of humans. This paper explores the utility of a collaborative learning approach involving integrated models in the preprocessing and postprocessing stages. For instance, a genetic algorithm effects feature-weight optimization in a preprocessing module. Moreover, an inductive tree, artificial neural network (ANN), and k-nearest neighbor (kNN) techniques serve as postprocessing modules. More specifically, the postprocessors act as second0order classifiers which determine the best first-order classifier on a case-by-case basis. In addition to the second-order models, a voting scheme is investigated as a simple, but efficient, postprocessing model. The first-order models consist of statistical and machine learning models such as logistic regression (logit), multivariate discriminant analysis (MDA), ANN, and kNN. The genetic algorithm, inductive decision tree, and voting scheme act as kernel modules for collaborative learning. These ideas are explored against the background of a practical application relating to financial fraud management which exemplifies a binary classification problem.

  • PDF

수입 박류사료내 에너지 및 영양소 함량의 변이 (Variation in Energy and Nutrient Composition of Oilseed Meals from Different Countries)

  • 손아름
    • 한국가금학회지
    • /
    • 제47권2호
    • /
    • pp.107-114
    • /
    • 2020
  • This study was conducted to investigate the variation in nutrient composition of oilseed meals and to develop prediction equations for amino acid concentrations. Energy and nutrient contents were determined in a total of 1,380 feed ingredient samples including copra byproducts, corn distillers, dried grains with solubles, palm kernel byproducts, and soybean meal. The ingredient samples were imported to the Republic of Korea between 2006 and 2015. Data were analyzed using the MIXED procedure of SAS. The regression procedure of SAS was used to generate the prediction equation for the lysine concentration using the crude protein (CP) concentration as an independent variable. The concentrations of moisture, gross energy, CP, ether extract, crude fiber, ash, calcium, phosphorus, lysine, methionine, cysteine, and threonine in tested oilseed meals differed (P<0.05) depending on producing countries. The prediction equations for amino acid concentrations (% as-is basis) in the oilseed meals are: lysine = -1.08 + 0.080 × CP (root mean square error = 0.244, R2 = 0.924, and P<0.001); threonine = -0.297 + 0.044 × CP (root mean square error = 0.099, R2 = 0.958, and P<0.001). In conclusion, energy and nutrient compositions vary in the oilseed meals depending on the producing countries. Moreover, the crude protein concentration can be used as a suitable independent variable for estimating lysine and threonine concentrations in the oilseed meals.

조건부 분위수의 중도절단을 고려한 비모수적 추정 (Nonparametric estimation of conditional quantile with censored data)

  • 김은영;최혜미
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권2호
    • /
    • pp.211-222
    • /
    • 2013
  • 중도절단된 자료가 있을 경우 조건부 분위수함수를 비모수적으로 추정하는 문제에 대하여 다루고 있다. 역함수에 근거한 방법인 Yu와 Jones (1998)에 의해 제안된 중복커널기법 추정량과 Lee 등(2006)의 국소로지스틱기법 추정량을 중도절단된 자료가 있는 경우로 수정하여 새롭게 제안하고, 이들을 기존의 Koenker와 Bassett (1978)의 점검함수에 근거한 커널평활 추정량들과 모의실험을 통해 비교해 보았다. 모의실험을 통하여 역함수에 근거한 추정량들은 조건부 분포가 대칭인 모형에서, 점검함수기법 추정량들은 한쪽으로 치우친 분포인 경우에 조건부 분위수를 대체로 더 잘 추정하고 있음을 알 수 있었다.

가능도함수를 이용한 로그분산함수의 불연속점 검정 (Testing of a discontinuity point in the log-variance function based on likelihood)

  • 허집
    • Journal of the Korean Data and Information Science Society
    • /
    • 제20권1호
    • /
    • pp.1-9
    • /
    • 2009
  • 회귀모형의 분산함수가 알려져 있지 않은 한 점에서 불연속이라 가정하자. Yu와 Jones (2004)는 음이 아닌 값을 취하는 분산함수를 실수 값을 취하도록 하기 위하여 로그 변환하였고, 변환된 로그분산함수를 국소다항적합으로 추정하였다. 로그분산함수의 국소다항적합을 이용하여, Huh (2008)는 분산함수의 불연속점의 추정하는 대신 로그분산함수의 불연속점을 추정하였다. 본 연구는 Huh의 점프의 크기 추정량의 점근분포를 이용하여 로그분산함수의 불연속점의 존재여부에 대한 가설검정을 제안하고, 제안한 방법에 대한 모의실험 결과를 제시하고자 한다.

  • PDF

비선형 평균 일반화 이분산 자기회귀모형의 추정 (Estimation of nonlinear GARCH-M model)

  • 심주용;이장택
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권5호
    • /
    • pp.831-839
    • /
    • 2010
  • 최소제곱 서포트벡터기계는 비선형회귀분석과 분류에 널리 쓰이는 커널기법이다. 본 논문에서는 금융시계열자료의 평균 및 변동성을 추정하기 위하여 평균의 추정 방법으로는 가중최소제곱 서포트벡터기계, 변동성의 추정 방법으로는 최소제곱 서포트벡터기계를 사용하는 비선형 평균 일반화 이분산 자기회귀모형을 제안한다. 제안된 모형은 선형 일반화 이분산 자기회귀모형 및 선형 평균 일반화 이분산 자기회귀모형보다 더 나은 추정 능력을 가진다는 것을 실제자료의 추정을 통하여 보였다.

자격증이 임금, 노동이동에 미치는 효과: 기능사 2급 자격증을 중심으로 (Analysis of Certification Effects on Wage and Labor Mobility : Evidence from Craft II Class Certification)

  • 이상준
    • 노동경제논집
    • /
    • 제29권2호
    • /
    • pp.145-169
    • /
    • 2006
  • 이 연구는 국가기술자격증 중 기능사 2급 자격 등급을 이용하여 자격증의 임금, 노동이동 효과에 대한 실증분석을 하고 있다. 이를 위해 모수적 방법과 비모수적 방법을 사용한다. 모수적 방법에서는 자격증의 선택 문제를 해결하고자 직종별 사업장 규모별 자격증 비율을 IV로 사용하였으며 비모수적 방법에는 페어메칭을 이용하였다. 간략한 연구 결과를 살펴보면 자격증이 임금에 미치는 효과는 작게는 5.5%에서 많게는 9.9% 가량 존재하고 있다. 자격증과 노동이동 간의 관계에서는 실제 노동이동을 통한 자격증의 임금효과보다는 한 직장에 근속함으로써 얻는 임금효과가 크게 나타나고 있음을 알 수 있다. 또한 자격증이 없는 근로자들일수록 상대적으로 동일 사업장에 정착하기가 어렵다는 것을 알 수 있었다.

  • PDF

SVM을 이용한 지구에 영향을 미치는 Halo CME 예보

  • 최성환;문용재;박영득
    • 천문학회보
    • /
    • 제38권1호
    • /
    • pp.61.1-61.1
    • /
    • 2013
  • In this study we apply Support Vector Machine (SVM) to the prediction of geo-effective halo coronal mass ejections (CMEs). The SVM, which is one of machine learning algorithms, is used for the purpose of classification and regression analysis. We use halo and partial halo CMEs from January 1996 to April 2010 in the SOHO/LASCO CME Catalog for training and prediction. And we also use their associated X-ray flare classes to identify front-side halo CMEs (stronger than B1 class), and the Dst index to determine geo-effective halo CMEs (stronger than -50 nT). The combinations of the speed and the angular width of CMEs, and their associated X-ray classes are used for input features of the SVM. We make an attempt to find the best model by using cross-validation which is processed by changing kernel functions of the SVM and their parameters. As a result we obtain statistical parameters for the best model by using the speed of CME and its associated X-ray flare class as input features of the SVM: Accuracy=0.66, PODy=0.76, PODn=0.49, FAR=0.72, Bias=1.06, CSI=0.59, TSS=0.25. The performance of the statistical parameters by applying the SVM is much better than those from the simple classifications based on constant classifiers.

  • PDF