• Title/Summary/Keyword: tree-based models

Search Result 437, Processing Time 0.025 seconds

Computational analysis of pollutant dispersion in urban street canyons with tree planting influenced by building roof shapes

  • Bouarbi, Lakhdar;Abed, Bouabdellah;Bouzit, Mohamed
    • Wind and Structures
    • /
    • v.23 no.6
    • /
    • pp.505-521
    • /
    • 2016
  • The objective of this study is to investigate numerically the effect of building roof shaps on wind flow and pollutant dispersion in a street canyon with one row of trees of pore volume, $P_{vol}=96%$. A three-dimensional computational fluid dynamics (CFD) model is used to evaluate air flow and pollutant dispersion within an urban street canyon using Reynolds-averaged Navier-Stokes (RANS) equations and the Explicit Algebraic Reynolds Stress Models (EARSM) based on k-${\varepsilon}$ turbulence model to close the equation system. The numerical model is performed with ANSYS-CFX code. Vehicle emissions were simulated as double line sources along the street. The numerical model was validated by the wind tunnel experiment results. Having established this, the wind flow and pollutant dispersion in urban street canyons (with six roof shapes buildings) are simulated. The numerical simulation results agree reasonably with the wind tunnel data. The results obtained in this work, indicate that the flow in 3D domain is more complicated; this complexity is increased with the presence of trees and variability of the roof shapes. The results also indicated that the largest pollutant concentration level for two walls (leeward and windward wall) is observed with the upwind wedge-shaped roof. But the smallest pollutant concentration level is observed with the dome roof-shaped.

A Numerical Study of Flame Spread of A Surface Forest Fire (지표화 산불의 화염전파 수치해석)

  • Kim, Dong-Hyun;Lee, Myung-Bo;Kim, Kwang-Il
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2008.03b
    • /
    • pp.80-83
    • /
    • 2008
  • The characteristics of the spread of a forest fire are generally related to the attributes of combustibles, geographical features, and meteorological conditions, such as wind conditions. The most common methodology used to create a prediction model for the spread of forest fires, based on the numerical analysis of the development stages of a forest fire, is an analysis of heat energy transmission by the stage of heat transmission. When a forest fire breaks out, the analysis of the transmission velocity of heat energy is quantifiable by the spread velocity of flame movement through a physical and chemical analysis at every stage of the fire development from flame production and heat transmission to its termination. In this study, the formula used for the 1-dimensional surface forest fire behavior prediction model, derived from a numerical analysis of the surface flame spread rate of solid combustibles, is introduced. The formula for the 1-dimensional surface forest fire behavior prediction model is the estimated equation of the flame spread velocity, depending on the condition of wind velocity on the ground. Experimental and theoretical equations on flame duration, flame height, flame temperature, ignition temperature of surface fuels, etc., has been applied to the device of this formula. As a result of a comparison between the ROS(rate of spread) from this formula and ROSs from various equations of other models or experimental values, a trend suggesting an increasing curved line of the exponent function under 3m/s or less wind velocity condition was identified. As a result of a comparison between experimental values and numerically analyzed values for fallen pine tree leaves, the flame spread velocity reveals has a error of less than 20%.

  • PDF

Study of child abuse families using logistic regression models (로지스틱회귀모형을 활용한 아동학대 가족의 연구)

  • Min, Dae Kee;Choi, Mi Kyung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1327-1336
    • /
    • 2016
  • Most cases of child abuse in South Korea are caused by parents in the family home. Currently, these types of incidents are growing. Child abuse creates irreparable damage to a child's development and its effects are prolonged. This damage can create a maladjusted adolescent and adult criminal acts. Because of this damage and the long lasting effects on a person and society as a whole, special attention needs to be paid to this pressing issue. South Korea's rapidly changing social environment has created a variety of new family forms including dual-income families and single-parent families. With the current economic downturn and accompanying employment instability, many families exist in uneasy financial and emotional states. The children in these stressful family environments are the most vulnerable and live in risk of experiencing physical or psychological abuse from their parents. In the context of significant and often difficult social changes, this study identifies the characteristics of child abuse based on family status and parental mental health.

Development of Electronic Management System for improving the utilization of Engineering Model in Domestic Nuclear Power Plant (국내 원전 엔지니어링운영모델 활용성 향상을 위한 시스템 개발)

  • Lee, Sang-Dae;Kim, Jung-Wun;Kim, Mun-Soo
    • Journal of the Korean Society of Safety
    • /
    • v.36 no.5
    • /
    • pp.79-85
    • /
    • 2021
  • A standard engineering model that reflects the current organization system and engineering operation process of domestic nuclear power plants was developed based on the Standard Nuclear Performance Model developed by the American Nuclear Energy Association. The level 0 screen, which is the main screen of the engineering model computer system, consisted of an object tree structure, which provided information that is phased down from a higher structure level to a lower structure level (i.e., level 3). The level 1 screen provided information related to the sub-process of the engineering operation, whereas the Level 2 screen provided information related to each engineering operation activity. In addition, the Level 2 screen provided additional functions, such as linking electronic procedures/guidelines, providing electronic performance forms, and connecting legacy computer systems (such as total equipment reliability monitoring system, configuration management systems, technical information systems, risk monitoring systems, regulatory information, and electronic drawing system). This screen level increased the convenience of user's engineering tasks by implementing them. The computerization of an engineering model that connects the entire engineering tasks of an establishment enables the easy understanding of information related to the engineering process before and after the operation, and builds a foundation for the enhancement of the work efficiency and employee capacity. In addition, KHNP developed an online training module, which operates as an e-learning process, on the overview and utilization of a standard engineering model to expand the understanding of standard engineering models by plant employees and to secure competitiveness.

Case Study: Groundwater Recharge Hydrograph in Pyeongchang River (평창강 지하수 함양곡선 연구)

  • Kwak, Jaewon
    • Journal of Wetlands Research
    • /
    • v.23 no.2
    • /
    • pp.173-182
    • /
    • 2021
  • It is important to extract and assess low-flow recession characteristics for water resources management in the upper reaches of a stream. It is difficult to express the groundwater flow recession characteristics for streamflow synthetically. The linear recession model has been widely used by baseflow recession analysis for reason of simplicity and convenience, but recent studies show that nonlinear recession models fit well, and the relationship between the reservoir storage of shallow unconfined aquifers and the groundwater discharge was to be identified as nonlinear in the literature based on the analysis of numerous streamflow recession curves. The objective of the study is to decode these nonlinear characteristics, including evaporation loss, storage, and recharge of groundwater using streamflow. By analyzing the observed time series of streamflow from the study area, which is the Pyeongchang River basin in Korea, the main components of the underlying groundwater balance, namely, discharge, evaporation loss, storage, and recharge, can be identified and quantified. As a result of the study, depletion of groundwater by evapotranspiration losses through the water uptake of tree roots was found to bias the recession curves and the estimated reservoir parameters. The seasonality of both rainfall and potential evaporation, analysis of the recession curves, stratified according to time of the year, allowed the quantification of evapotranspiration loss as a function of a calendar month and stored groundwater storage.

Feature Extraction and Evaluation for Classification Models of Injurious Falls Based on Surface Electromyography

  • Lim, Kitaek;Choi, Woochol Joseph
    • Physical Therapy Korea
    • /
    • v.28 no.2
    • /
    • pp.123-131
    • /
    • 2021
  • Background: Only 2% of falls in older adults result in serious injuries (i.e., hip fracture). Therefore, it is important to differentiate injurious versus non-injurious falls, which is critical to develop effective interventions for injury prevention. Objects: The purpose of this study was to a. extract the best features of surface electromyography (sEMG) for classification of injurious falls, and b. find a best model provided by data mining techniques using the extracted features. Methods: Twenty young adults self-initiated falls and landed sideways. Falling trials were consisted of three initial fall directions (forward, sideways, or backward) and three knee positions at the time of hip impact (the impacting-side knee contacted the other knee ("knee together") or the mat ("knee on mat"), or neither the other knee nor the mat was contacted by the impacting-side knee ("free knee"). Falls involved "backward initial fall direction" or "free knee" were defined as "injurious falls" as suggested from previous studies. Nine features were extracted from sEMG signals of four hip muscles during a fall, including integral of absolute value (IAV), Wilson amplitude (WAMP), zero crossing (ZC), number of turns (NT), mean of amplitude (MA), root mean square (RMS), average amplitude change (AAC), difference absolute standard deviation value (DASDV). The decision tree and support vector machine (SVM) were used to classify the injurious falls. Results: For the initial fall direction, accuracy of the best model (SVM with a DASDV) was 48%. For the knee position, accuracy of the best model (SVM with an AAC) was 49%. Furthermore, there was no model that has sensitivity and specificity of 80% or greater. Conclusion: Our results suggest that the classification model built upon the sEMG features of the four hip muscles are not effective to classify injurious falls. Future studies should consider other data mining techniques with different muscles.

Multi-dimensional Analysis and Prediction Model for Tourist Satisfaction

  • Shrestha, Deepanjal;Wenan, Tan;Gaudel, Bijay;Rajkarnikar, Neesha;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.2
    • /
    • pp.480-502
    • /
    • 2022
  • This work assesses the degree of satisfaction tourists receive as final recipients in a tourism destination based on the fact that satisfied tourists can make a significant contribution to the growth and continuous improvement of a tourism business. The work considers Pokhara, the tourism capital of Nepal as a prefecture of study. A stratified sampling methodology with open-ended survey questions is used as a primary source of data for a sample size of 1019 for both international and domestic tourists. The data collected through a survey is processed using a data mining tool to perform multi-dimensional analysis to discover information patterns and visualize clusters. Further, supervised machine learning algorithms, kNN, Decision tree, Support vector machine, Random forest, Neural network, Naive Bayes, and Gradient boost are used to develop models for training and prediction purposes for the survey data. To find the best model for prediction purposes, different performance matrices are used to evaluate a model for performance, accuracy, and robustness. The best model is used in constructing a learning-enabled model for predicting tourists as satisfied, neutral, and unsatisfied visitors. This work is very important for tourism business personnel, government agencies, and tourism stakeholders to find information on tourist satisfaction and factors that influence it. Though this work was carried out for Pokhara city of Nepal, the study is equally relevant to any other tourism destination of similar nature.

Predicting rock brittleness indices from simple laboratory test results using some machine learning methods

  • Davood Fereidooni;Zohre Karimi
    • Geomechanics and Engineering
    • /
    • v.34 no.6
    • /
    • pp.697-726
    • /
    • 2023
  • Brittleness as an important property of rock plays a crucial role both in the failure process of intact rock and rock mass response to excavation in engineering geological and geotechnical projects. Generally, rock brittleness indices are calculated from the mechanical properties of rocks such as uniaxial compressive strength, tensile strength and modulus of elasticity. These properties are generally determined from complicated, expensive and time-consuming tests in laboratory. For this reason, in the present research, an attempt has been made to predict the rock brittleness indices from simple, inexpensive, and quick laboratory test results namely dry unit weight, porosity, slake-durability index, P-wave velocity, Schmidt rebound hardness, and point load strength index using multiple linear regression, exponential regression, support vector machine (SVM) with various kernels, generating fuzzy inference system, and regression tree ensemble (RTE) with boosting framework. So, this could be considered as an innovation for the present research. For this purpose, the number of 39 rock samples including five igneous, twenty-six sedimentary, and eight metamorphic were collected from different regions of Iran. Mineralogical, physical and mechanical properties as well as five well known rock brittleness indices (i.e., B1, B2, B3, B4, and B5) were measured for the selected rock samples before application of the above-mentioned machine learning techniques. The performance of the developed models was evaluated based on several statistical metrics such as mean square error, relative absolute error, root relative absolute error, determination coefficients, variance account for, mean absolute percentage error and standard deviation of the error. The comparison of the obtained results revealed that among the studied methods, SVM is the most suitable one for predicting B1, B2 and B5, while RTE predicts B3 and B4 better than other methods.

Model Development for Specific Degradation Using Data Mining and Geospatial Analysis of Erosion and Sedimentation Features

  • Kang, Woochul;Kang, Joongu;Jang, Eunkyung;Julien, Piere Y.
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.85-85
    • /
    • 2020
  • South Korea experiences few large scale erosion and sedimentation problems, however, there are numerous local sedimentation problems. A reliable and consistent approach to modelling and management for sediment processes are desirable in the country. In this study, field measurements of sediment concentration from 34 alluvial river basins in South Korea were used with the Modified Einstein Procedure (MEP) to determine the total sediment load at the sampling locations. And then the Flow Duration-Sediment Rating Curve (FD-SRC) method was used to estimate the specific degradation for all gauging stations. The specific degradation of most rivers were found to be typically 50-300 tons/㎢·yr. A model tree data mining technique was applied to develop a model for the specific degradation based on various watershed characteristics of each watershed from GIS analysis. The meaningful parameters are: 1) elevation at the middle relative area of the hypsometric curve [m], 2) percentage of wetland and water [%], 3) percentage of urbanized area [%], and 4) Main stream length [km]. The Root Mean Square Error (RMSE) of existing models is in excess of 1,250 tons/㎢·yr and the RMSE of the proposed model with 6 additional validations decreased to 65 tons/㎢·yr. Erosion loss maps from the Revised Universal Soil Loss Equation (RUSLE), satellite images, and aerial photographs were used to delineate the geospatial features affecting erosion and sedimentation. The results of the geospatial analysis clearly shows that the high risk erosion area (hill slopes and construction sites at urbanized area) and sedimentation features (wetlands and agricultural reservoirs). The result of physiographical analysis also indicates that the watershed morphometric characteristic well explain the sediment transport. Sustainable management with the data mining methodologies and geospatial analysis could be helpful to solve various erosion and sedimentation problems under different conditions.

  • PDF

Development of Sentiment Analysis Model for the hot topic detection of online stock forums (온라인 주식 포럼의 핫토픽 탐지를 위한 감성분석 모형의 개발)

  • Hong, Taeho;Lee, Taewon;Li, Jingjing
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.187-204
    • /
    • 2016
  • Document classification based on emotional polarity has become a welcomed emerging task owing to the great explosion of data on the Web. In the big data age, there are too many information sources to refer to when making decisions. For example, when considering travel to a city, a person may search reviews from a search engine such as Google or social networking services (SNSs) such as blogs, Twitter, and Facebook. The emotional polarity of positive and negative reviews helps a user decide on whether or not to make a trip. Sentiment analysis of customer reviews has become an important research topic as datamining technology is widely accepted for text mining of the Web. Sentiment analysis has been used to classify documents through machine learning techniques, such as the decision tree, neural networks, and support vector machines (SVMs). is used to determine the attitude, position, and sensibility of people who write articles about various topics that are published on the Web. Regardless of the polarity of customer reviews, emotional reviews are very helpful materials for analyzing the opinions of customers through their reviews. Sentiment analysis helps with understanding what customers really want instantly through the help of automated text mining techniques. Sensitivity analysis utilizes text mining techniques on text on the Web to extract subjective information in the text for text analysis. Sensitivity analysis is utilized to determine the attitudes or positions of the person who wrote the article and presented their opinion about a particular topic. In this study, we developed a model that selects a hot topic from user posts at China's online stock forum by using the k-means algorithm and self-organizing map (SOM). In addition, we developed a detecting model to predict a hot topic by using machine learning techniques such as logit, the decision tree, and SVM. We employed sensitivity analysis to develop our model for the selection and detection of hot topics from China's online stock forum. The sensitivity analysis calculates a sentimental value from a document based on contrast and classification according to the polarity sentimental dictionary (positive or negative). The online stock forum was an attractive site because of its information about stock investment. Users post numerous texts about stock movement by analyzing the market according to government policy announcements, market reports, reports from research institutes on the economy, and even rumors. We divided the online forum's topics into 21 categories to utilize sentiment analysis. One hundred forty-four topics were selected among 21 categories at online forums about stock. The posts were crawled to build a positive and negative text database. We ultimately obtained 21,141 posts on 88 topics by preprocessing the text from March 2013 to February 2015. The interest index was defined to select the hot topics, and the k-means algorithm and SOM presented equivalent results with this data. We developed a decision tree model to detect hot topics with three algorithms: CHAID, CART, and C4.5. The results of CHAID were subpar compared to the others. We also employed SVM to detect the hot topics from negative data. The SVM models were trained with the radial basis function (RBF) kernel function by a grid search to detect the hot topics. The detection of hot topics by using sentiment analysis provides the latest trends and hot topics in the stock forum for investors so that they no longer need to search the vast amounts of information on the Web. Our proposed model is also helpful to rapidly determine customers' signals or attitudes towards government policy and firms' products and services.