• Title/Summary/Keyword: tree-based models

Search Result 437, Processing Time 0.026 seconds

Movie Popularity Classification Based on Support Vector Machine Combined with Social Network Analysis

  • Dorjmaa, Tserendulam;Shin, Taeksoo
    • Journal of Information Technology Services
    • /
    • v.16 no.3
    • /
    • pp.167-183
    • /
    • 2017
  • The rapid growth of information technology and mobile service platforms, i.e., internet, google, and facebook, etc. has led the abundance of data. Due to this environment, the world is now facing a revolution in the process that data is searched, collected, stored, and shared. Abundance of data gives us several opportunities to knowledge discovery and data mining techniques. In recent years, data mining methods as a solution to discovery and extraction of available knowledge in database has been more popular in e-commerce service fields such as, in particular, movie recommendation. However, most of the classification approaches for predicting the movie popularity have used only several types of information of the movie such as actor, director, rating score, language and countries etc. In this study, we propose a classification-based support vector machine (SVM) model for predicting the movie popularity based on movie's genre data and social network data. Social network analysis (SNA) is used for improving the classification accuracy. This study builds the movies' network (one mode network) based on initial data which is a two mode network as user-to-movie network. For the proposed method we computed degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality as centrality measures in movie's network. Those four centrality values and movies' genre data were used to classify the movie popularity in this study. The logistic regression, neural network, $na{\ddot{i}}ve$ Bayes classifier, and decision tree as benchmarking models for movie popularity classification were also used for comparison with the performance of our proposed model. To assess the classifier's performance accuracy this study used MovieLens data as an open database. Our empirical results indicate that our proposed model with movie's genre and centrality data has by approximately 0% higher accuracy than other classification models with only movie's genre data. The implications of our results show that our proposed model can be used for improving movie popularity classification accuracy.

Applying Nonlinear Mixed-effects Models to Taper Equations: A Case Study of Pinus densiflora in Gangwon Province, Republic of Korea (비선형 혼합효과 모형의 수간곡선 적용: 강원지방 소나무를 대상으로)

  • Shin, Joong-Hoon;Han, Hee;Ko, Chi-Ung;Kang, Jin-Taek;Kim, Young-Hwan
    • Journal of Korean Society of Forest Science
    • /
    • v.111 no.1
    • /
    • pp.136-149
    • /
    • 2022
  • In this study, the performance of a nonlinear mixed-effects (NLME) model used to estimate the stem taper of Pinus densiflora in Gangwon Province was compared with that of a nonlinear fixed-effects (NLFE) model using several performance measures. For the diameters of whole tree stems, the NLME model improved on the performance of the NLFE model by 26.4%, 42.9%, 43.1%, and 0.9% in terms of BIAS, MAB, RMSE, and FI, respectively. For the cross-section areas of whole tree stems, the NLME model improved on the performance of the NLFE model by 67.7%, 44.7%, 45.8%, and 1.0% in terms of BIAS, MAB, RMSE, and FI, respectively. Based on the analysis of 12 relative height classes of tree stems, stem taper estimation performance was also reasonably improved by the NLME model, which showed better MAB, RMSE, and FI at every relative height class compared with those of the NLFE model. In some classes, the NLFE model had better BIAS than the NLME model (stem diameter: 0.05, 0.2, 0.3, and 0.8; stem cross-section area: 0.05, 0.3, 0.5, 0.6, and 1.0). However, the NLME model enhanced the performance of stem diameter and cross-section area estimations at the lowest stem part (0.2 m from the ground). Improvements for stem diameter in terms of BIAS, MAB, RMSE, and FI were 84.2%, 69.8%, 68.7%, and 3.1%, respectively. For stem cross-section areas, the improvements in BIAS, MAB, RMSE, and FI were 98.5%, 70.1%, 68.7%, and 3.1%, respectively. The cross-section area at 0.2 m from the ground occupied 22.7% of total cross-section area. Improvements in estimation of cross-section area at the lowest stem part indicate that stem volume estimation performance could also be enhanced. Although NLME models are more difficult to fit than NLFE models, the use of NLME models as a standard method for the estimating the parameters of stem taper equations should be considered.

ForTIA : A Tool Supporting Formal Method based on LOTOS (ForTIA: LOTOS 기반의 정형기법 지원도구)

  • Cho, Soo-Sun;Cheon, Yoon-Sik;Oh, Young-Bae;Chung, Yun-Dae
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.6 no.2
    • /
    • pp.161-172
    • /
    • 2000
  • In this paper, we introduce the development of a LOTOS-based tool, supporting formal methods, called ForTIA (A Formalism for Telecommunication and Information Systems). By using LOTOS, an ISO standard formal specification language, the user requirements and system models can be abstracted and represented formally. Therefore, the system can be validated and verified on the specifications, before implementations. ForTIA supports light-weight formal methods based on validation to be used in real industry. Key functions of ForTIA are simulation and C++ code generation. In simulation, tree based visual validation mechanism is provided and in code generation, the full C++ source code is generated to be used for system implementations.

  • PDF

Rainfall Interception by and Quantitative Models for Urban Landscape Trees - For Seven Native Species - (도시조경수의 우수차집 효과와 계량모델 - 7개 향토수종을 대상으로 -)

  • Park, Hye-Mi;Jo, Hyun-Kil;Kim, Jin-Young
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.49 no.4
    • /
    • pp.30-40
    • /
    • 2021
  • This study developed quantitative models to estimate the rainfall interception by seven native landscape tree species based on throughfall measurements. The tree species considered in this study were Abies holophylla, Acer palmatum, Ginkgo biloba, Pinus densiflora, Pinus koraiensis, Prunus yedoensis, and Zelkova serrata, which are frequently planted in the Korea. Among these species, 35.8% of the annual precipitation was intercepted by P. koraiensis, 34.1% by A. holophylla, 31.0% by Z. serrata, 27.6% by P. densiflora, 26.9% by G. biloba, 18.6% by A. palmatum, and 18.4% by P. yedoensis. All the quantitative models showed high fitness with r2 values of 0.90-0.99. The annual rainfall interception from a tree with DBH of 20 cm were greatest with Z. serrata (5.1 m3/yr), followed by P. koraiensis (4.1 m3/yr), A. holophylla (3.1 m3/yr), G. biloba (2.8 m3/yr), P. densiflora (2.1 m3/yr), P. yedoensis (1.9 m3/yr), and A. palmatum (1.8 m3/yr) in order. Thus, evergreen tree species or those with a relatively high crown density were more effective in intercepting rainfall. In particular, the annual rainfall interception by Z. serrata was the greatest because its crown area, volume, and density were higher than those of the other species. This study pioneers in quantifying annual rainfall interception for landscape tree species in Korea. The study results can be useful for evaluating rainfall interception by landscape trees in urban greenspace design for governments and corporations.

Detection of Urban Trees Using YOLOv5 from Aerial Images (항공영상으로부터 YOLOv5를 이용한 도심수목 탐지)

  • Park, Che-Won;Jung, Hyung-Sup
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_2
    • /
    • pp.1633-1641
    • /
    • 2022
  • Urban population concentration and indiscriminate development are causing various environmental problems such as air pollution and heat island phenomena, and causing human resources to deteriorate the damage caused by natural disasters. Urban trees have been proposed as a solution to these urban problems, and actually play an important role, such as providing environmental improvement functions. Accordingly, quantitative measurement and analysis of individual trees in urban trees are required to understand the effect of trees on the urban environment. However, the complexity and diversity of urban trees have a problem of lowering the accuracy of single tree detection. Therefore, we conducted a study to effectively detect trees in Dongjak-gu using high-resolution aerial images that enable effective detection of tree objects and You Only Look Once Version 5 (YOLOv5), which showed excellent performance in object detection. Labeling guidelines for the construction of tree AI learning datasets were generated, and box annotation was performed on Dongjak-gu trees based on this. We tested various scale YOLOv5 models from the constructed dataset and adopted the optimal model to perform more efficient urban tree detection, resulting in significant results of mean Average Precision (mAP) 0.663.

Statistical Location Estimation in Container-Grown Seedlings Based on Wireless Sensor Networks

  • Lee, Sang-Hyun;Moon, Kyung-Il
    • International Journal of Advanced Culture Technology
    • /
    • v.2 no.2
    • /
    • pp.15-18
    • /
    • 2014
  • This paper presents a sensor location decision making method respect to Container-Grown Seedlings in view of precision agriculture (PA) when sensors involved in tree container measure received signal strength (RSS) or time-of-arrival (TOA) between themselves and neighboring sensors. A small fraction of sensors in the container-grown seedlings system have a known location, whereas the remaining locations must be estimated. We derive Rao-Cramer bounds and maximum-likelihood estimators under Gaussian and log-normal models for the TOA and RSS measurements, respectively.

Audio Coder Using an Adaptive Wavelet packet Decomposition and Psychoacoustic (적응 웨이블릿 패킷을 이용한 오디오 부호화기와 심리음향 모델링)

  • 김준성
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06c
    • /
    • pp.245-248
    • /
    • 1998
  • In this paper, a new variable wavelet packet decomposition audio coder, based on the time varying characteristic of the audio signals, is proposed and presents a technique to incorporate psychoacoustic models into an adaptive wave let packet scheme. The proposed filterbank improves the defect of the polyphase filterbank that could not properly represent the critical band and the defect of QMF-tree filter that need high complexity to implement. The filterbank consists of varying number of subband from 4 to 26 bands and use Daubechies 6-order wave let. The codec yields excellent quality at total bit rates of about 128kbps for monophonic CD-quality signals with an sampling frequency of 44.1kHz and reduces complexity by 19% for various bit-rates and sources with encoding and decoding process.

  • PDF

A Hybrid Data Mining Technique Using Error Pattern Modeling (오차 패턴 모델링을 이용한 Hybrid 데이터 마이닝 기법)

  • Hur, Joon;Kim, Jong-Woo
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.30 no.4
    • /
    • pp.27-43
    • /
    • 2005
  • This paper presents a new hybrid data mining technique using error pattern modeling to improve classification accuracy when the data type of a target variable is binary. The proposed method increases prediction accuracy by combining two different supervised learning methods. That is, the algorithm extracts a subset of training cases that are predicted inconsistently by both methods, and models error patterns from the cases. Based on the error pattern model, the Predictions of two different methods are merged to generate final prediction. The proposed method has been tested using practical 10 data sets. The analysis results show that the performance of proposed method is superior to the existing methods such as artificial neural networks and decision tree induction.

Recommendation of Optimal Treatment Method for Heart Disease using EM Clustering Technique

  • Jung, Yong Gyu;Kim, Hee Wan
    • International Journal of Advanced Culture Technology
    • /
    • v.5 no.3
    • /
    • pp.40-45
    • /
    • 2017
  • This data mining technique was used to extract useful information from percutaneous coronary intervention data obtained from the US public data homepage. The experiment was performed by extracting data on the area, frequency of operation, and the number of deaths. It led us to finding of meaningful correlations, patterns, and trends using various algorithms, pattern techniques, and statistical techniques. In this paper, information is obtained through efficient decision tree and cluster analysis in predicting the incidence of percutaneous coronary intervention and mortality. In the cluster analysis, EM algorithm was used to evaluate the suitability of the algorithm for each situation based on performance tests and verification of results. In the cluster analysis, the experimental data were classified using the EM algorithm, and we evaluated which models are more effective in comparing functions. Using data mining technique, it was identified which areas had effective treatment techniques and which areas were vulnerable, and we can predict the frequency and mortality of percutaneous coronary intervention for heart disease.

Hybrid Model-Based Motion Recognition for Smartphone Users

  • Shin, Beomju;Kim, Chulki;Kim, Jae Hun;Lee, Seok;Kee, Changdon;Lee, Taikjin
    • ETRI Journal
    • /
    • v.36 no.6
    • /
    • pp.1016-1022
    • /
    • 2014
  • This paper presents a hybrid model solution for user motion recognition. The use of a single classifier in motion recognition models does not guarantee a high recognition rate. To enhance the motion recognition rate, a hybrid model consisting of decision trees and artificial neural networks is proposed. We define six user motions commonly performed in an indoor environment. To demonstrate the performance of the proposed model, we conduct a real field test with ten subjects (five males and five females). Experimental results show that the proposed model provides a more accurate recognition rate compared to that of other single classifiers.