• Title/Summary/Keyword: M5P Tree-based model

Search Result 9, Processing Time 0.025 seconds

Prediction of short-term algal bloom using the M5P model-tree and extreme learning machine

  • Yi, Hye-Suk;Lee, Bomi;Park, Sangyoung;Kwak, Keun-Chang;An, Kwang-Guk
    • Environmental Engineering Research
    • /
    • v.24 no.3
    • /
    • pp.404-411
    • /
    • 2019
  • In this study, we designed a data-driven model to predict chlorophyll-a using M5P model tree and extreme learning machine (ELM). The Juksan weir in the Youngsan River has high chlorophyll-a, which is the primary indicator of algal bloom every year. Short-term algal bloom prediction is important for environmental management and ecological assessment. Two models were developed and evaluated for short-term algal bloom prediction. M5P is a classification and regression-analysis-based method, and ELM is a feed-forward neural network with fast learning using the least square estimate for regression. The dataset used in this study includes water temperature, rainfall, solar radiation, total nitrogen, total phosphorus, N/P ratio, and chlorophyll-a, which were collected on a daily basis from January 2013 to December 2016. The M5P model showed that the prediction model after one day had the highest performance power and dropped off rapidly starting with predictions after three days. Comparing the performance power of the ELM model with the M5P model, it was found that the performance power of the 1-7 d chlorophyll-a prediction model was higher. Moreover, in a period of rapidly increasing algal blooms, the ELM model showed higher accuracy than the M5P model.

Development of a model to analyze the relationship between smart pig-farm environmental data and daily weight increase based on decision tree (의사결정트리를 이용한 돈사 환경데이터와 일당증체 간의 연관성 분석 모델 개발)

  • Han, KangHwi;Lee, Woongsup;Sung, Kil-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.12
    • /
    • pp.2348-2354
    • /
    • 2016
  • In recent days, IoT (Internet of Things) technology has been widely used in the field of agriculture, which enables the collection of environmental data and biometric data into the database. The availability of big data on agriculture results in the increase of the machine learning based analysis. Through the analysis, it is possible to forecast agricultural production and the diseases of livestock, thus helping the efficient decision making in the management of smart farm. Herein, we use the environmental and biometric data of Smart Pig farm to derive the accurate relationship model between the environmental information and the daily weight increase of swine and verify the accuracy of the derived model. To this end, we applied the M5P tree algorithm of machine learning which reveals that the wind speed is the major factor which affects the daily weight increase of swine.

Application of machine learning methods for predicting the mechanical properties of rubbercrete

  • Miladirad, Kaveh;Golafshani, Emadaldin Mohammadi;Safehian, Majid;Sarkar, Alireza
    • Advances in concrete construction
    • /
    • v.14 no.1
    • /
    • pp.15-34
    • /
    • 2022
  • The use of waste rubber in concrete can reduce natural aggregate consumption and improve some technical properties of concrete. Although there are several equations for estimating the mechanical properties of concrete containing waste rubber, limited numbers of machine learning-based models have been proposed to predict the mechanical properties of rubbercrete. In this study, an extensive database of the mechanical properties of rubbercrete was gathered from a comprehensive survey of the literature. To model the mechanical properties of rubbercrete, M5P tree and linear gene expression programming (LGEP) methods as two machine learning techniques were employed to achieve reliable mathematical equations. Two procedures of input variable selection were considered in this study. The crucial component ratios of rubbercrete and concrete age were assumed as the input variables in the first procedure. In contrast, the volumes of the coarse and fine waste rubber and the compressive strength of concrete without waste rubber were considered the second procedure of the input variables. The results show that the models obtained by LGEP are more accurate than those achieved by the M5P model tree and existing traditional equations. Besides, the volumes of the coarse and fine waste rubber and the compressive strength of concrete without waste rubber are better predictors of the mechanical properties of rubbercrete compared to the first procedure of input variable selection.

Shifts of Geographic Distribution of Pinus koraiensis Based on Climate Change Scenarios and GARP Model (GARP 모형과 기후변화 시나리오에 따른 잣나무의 지리적 분포 변화)

  • Chun, Jung Hwa;Lee, Chang Bae;Yoo, So Min
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.17 no.4
    • /
    • pp.348-357
    • /
    • 2015
  • The main purpose of this study is to understand the potential geographic distribution of P. koraiensis, which is known to be one of major economic tree species, based on the RCP (Representative Concentration Pathway) 8.5 scenarios and current geographic distribution from National Forest Inventory(NFI) data using ecological niche modeling. P. koraiensis abundance data extracted from NFI were utilized to estimate current geographic distribution. Also, GARP (Genetic Algorithm for Rule-set Production) model, one of the ecological niche models, was applied to estimate potential geographic distribution and to project future changes. Environmental explanatory variables showing Area Under Curve (AUC) value bigger than 0.6 were selected and constructed into the final model by running the model for each of the 27 variables. The results of the model validation which was performed based on confusion matrix statistics, showed quite high suitability. Currently P. koraiensis is distributed widely from 300m to 1,200m in altitude and from south to north as a result of national greening project in 1970s although major populations are found in elevated and northern area. The results of this study were successful in showing the current distribution of P. koraiensis and projecting their future changes. Future model for P. koraiensis suggest large areas predicted under current climate conditions may be contracted by 2090s showing dramatic habitat loss. Considering the increasing status of atmospheric $CO_2$ and air temperature in Korea, P. koraiensis seems to experience the significant decrease of potential distribution range in the future. The final model in this study may be used to identify climate change impacts on distribution of P. koraiensis in Korea, and a deeper understanding of its correlation may be helpful when planning afforestation strategies.

Data Modeling using Cluster Based Fuzzy Model Tree (클러스터 기반 퍼지 모델트리를 이용한 데이터 모델링)

  • Lee, Dae-Jong;Park, Jin-Il;Park, Sang-Young;Jung, Nahm-Chung;Chun, Meung-Geun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.16 no.5
    • /
    • pp.608-615
    • /
    • 2006
  • This paper proposes a fuzzy model tree consisting of local linear models using fuzzy cluster for data modeling. First, cluster centers are calculated by fuzzy clustering method using all input and output attributes. And then, linear models are constructed at internal nodes with fuzzy membership values between centers and input attributes. The expansion of internal node is determined by comparing errors calculated in parent node with ones in child node, respectively. As a final step, data prediction is performed with a linear model having the highest fuzzy membership value between input attributes and cluster centers in leaf nodes. To show the effectiveness of the proposed method, we have applied our method to various dataset. Under various experiments, our proposed method shows better performance than conventional model tree and artificial neural networks.

A study of glass and carbon fibers in FRAC utilizing machine learning approach

  • Ankita Upadhya;M. S. Thakur;Nitisha Sharma;Fadi H. Almohammed;Parveen Sihag
    • Advances in materials Research
    • /
    • v.13 no.1
    • /
    • pp.63-86
    • /
    • 2024
  • Asphalt concrete (AC), is a mixture of bitumen and aggregates, which is very sensitive in the design of flexible pavement. In this study, the Marshall stability of the glass and carbon fiber bituminous concrete was predicted by using Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF), and M5P Tree machine learning algorithms. To predict the Marshall stability, nine inputs parameters i.e., Bitumen, Glass and Carbon fibers mixed in 100:0, 75:25, 50:50, 25:75, 0:100 percentage (designated as 100GF:0CF, 75GF:25CF, 50GF:50 CF, 25GF:75CF, 0GF:100CF), Bitumen grade (VG), Fiber length (FL), and Fiber diameter (FD) were utilized from the experimental and literary data. Seven statistical indices i.e., coefficient of correlation (CC), mean absolute error (MAE), root mean squared error (RMSE), relative absolute error (RAE), root relative squared error (RRSE), Scattering index (SI), and BIAS were applied to assess the effectiveness of the developed models. According to the performance evaluation results, Artificial neural network (ANN) was outperforming among other models with CC values as 0.9147 and 0.8648, MAE values as 1.3757 and 1.978, RMSE values as 1.843 and 2.6951, RAE values as 39.88 and 49.31, RRSE values as 40.62 and 50.50, SI values as 0.1379 and 0.2027 and BIAS value as -0.1 290 and -0.2357 in training and testing stage respectively. The Taylor diagram (testing stage) also confirmed that the ANN-based model outperforms the other models. Results of sensitivity analysis showed that the fiber length is the most influential in all nine input parameters whereas the fiber combination of 25GF:75CF was the most effective among all the fiber mixes in Marshall stability.

SELDI-TOF MS Combined with Magnetic Beads for Detecting Serum Protein Biomarkers and Establishment of a Boosting Decision Tree Model for Diagnosis of Pancreatic Cancer

  • Qian, Jing-Yi;Mou, Si-Hua;Liu, Chi-Bo
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.5
    • /
    • pp.1911-1915
    • /
    • 2012
  • Aim: New technologies for the early detection of pancreatic cancer (PC) are urgently needed. The aim of the present study was to screen for the potential protein biomarkers in serum using proteomic fingerprint technology. Methods: Magnetic beads combined with surface-enhanced laser desorption/ionization (SELDI) TOF MS were used to profile and compare the protein spectra of serum samples from 85 patients with pancreatic cancer, 50 patients with acute-on-chronic pancreatitis and 98 healthy blood donors. Proteomic patterns associated with pancreatic cancer were identified with Biomarker Patterns Software. Results: A total of 37 differential m/z peaks were identified that were related to PC (P < 0.01). A tree model of biomarkers was constructed with the software based on the three biomarkers (7762 Da, 8560 Da, 11654 Da), this showing excellent separation between pancreatic cancer and non-cancer., with a sensitivity of 93.3% and a specificity of 95.6%. Blind test data showed a sensitivity of 88% and a specificity of 91.4%. Conclusions: The results suggested that serum biomarkers for pancreatic cancer can be detected using SELDI-TOF-MS combined with magnetic beads. Application of combined biomarkers may provide a powerful and reliable diagnostic method for pancreatic cancer with a high sensitivity and specificity.

Comparison of machine learning algorithms to evaluate strength of concrete with marble powder

  • Sharma, Nitisha;Upadhya, Ankita;Thakur, Mohindra S.;Sihag, Parveen
    • Advances in materials Research
    • /
    • v.11 no.1
    • /
    • pp.75-90
    • /
    • 2022
  • In this paper, functionality of soft computing algorithms such as Group method of data handling (GMDH), Random forest (RF), Random tree (RT), Linear regression (LR), M5P, and artificial neural network (ANN) have been looked out to predict the compressive strength of concrete mixed with marble powder. Assessment of result suggests that, the overall performance of ANN based model gives preferable results over the different applied algorithms for the estimate of compressive strength of concrete. The results of coefficient of correlation were maximum in ANN model (0.9139) accompanied through RT with coefficient of correlation (CC) value 0.8241 and minimum root mean square error (RMSE) value of ANN (4.5611) followed by RT with RMSE (5.4246). Similarly, other evaluating parameters like, Willmott's index and Nash-sutcliffe coefficient value of ANN was 0.9458 and 0.7502 followed by RT model (0.8763 and 0.6628). The end result showed that, for both subsets i.e., training and testing subset, ANN has the potential to estimate the compressive strength of concrete. Also, the results of sensitivity suggest that the water-cement ratio has a massive impact in estimating the compressive strength of concrete with marble powder with ANN based model in evaluation with the different parameters for this data set.

Modelling Analysis of Climate and Soil Depth Effects on Pine Tree Dieback in Korea Using BIOME-BGC (BIOME-BGC 모형을 이용한 국내 소나무 고사의 기후 및 토심 영향 분석)

  • Kang, Sinkyu;Lim, Jong-Hwan;Kim, Eun-Sook;Cho, Nanghyun
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.242-252
    • /
    • 2016
  • A process-based ecosystem model, BIOME-BGC, was applied to simulate seasonal and inter-annual dynamics of carbon and water processes for potential evergreen needleleaf forest (ENF) biome in Korea. Two simulation sites, Milyang and Unljin, were selected to reflect warm-and-dry and cool-and-wet climate regimes, where massive diebacks of pines including Pinus densiflora, P. koraiensis and P thunbergii, were observed in 2009 and 2014, respectively. Standard Precipitation Index (SPI) showed periodic drought occurrence at every 5 years or so for both sites. Since mid-2000s, droughts occurred with hotter climate condition. Among many model variables, Cpool (i.e., a temporary carbon pool reserving photosynthetic compounds before allocations for new tissue production) was identified as a useful proxy variable of tree carbon starvation caused by reduction of gross primary production (GPP) and/or increase of maintenance respiration (Rm). Temporal Cpool variation agreed well with timings of pine tree diebacks for both sites. Though water stress was important, winter- and spring-time warmer temperature also played critical roles in reduction of Cpool, especially for the cool-and-wet Uljin. Shallow soil depth intensified the drought effect, which was, however, marginal for soil depth shallower than 0.5 m. Our modeling analysis implicates seasonal drought and warmer climate can intensify vulnerability of ENF dieback in Korea, especially for shallower soils, in which multi-year continued stress is of concern more than short-term episodic stress.