• Title/Summary/Keyword: Decision Tree analysis

Search Result 725, Processing Time 0.038 seconds

Early diagnosis of jaw osteomyelitis by easy digitalized panoramic analysis

  • Park, Moo Soung;Eo, Mi Young;Myoung, Hoon;Kim, Soung Min;Lee, Jong Ho
    • Maxillofacial Plastic and Reconstructive Surgery
    • /
    • v.41
    • /
    • pp.6.1-6.10
    • /
    • 2019
  • Background: Osteomyelitis is an intraosseous inflammatory disease characterized by progressive inflammatory osteoclasia and ossification. The use of quantitative analysis to assist interpretation of osteomyelitis is increasingly being considered. The objective of this study was to perform early diagnosis of osteomyelitis on digital panoramic radiographs using basic functions provided by picture archiving and communication system (PACS), a program used to show radiographic images. Methods: This study targeted a total of 95 patients whose symptoms were confirmed as osteomyelitis under clinical, radiologic, pathological diagnosis over 11 years from 2008 to 2017. Five categorized patients were osteoradionecrosis, bisphosphonate-related osteonecrosis of jaw (BRONJ, suppurative and sclerosing type), and bacterial osteomyelitis (suppurative and sclerosing type), and the control group was 117 randomly sampled. The photographic density in a certain area of the digital panoramic radiograph was determined and compared using the "measure area rectangle," one of the basic PACS functions in INFINITT PACS® (INFINITT Healthcare, Seoul, South Korea). A conditional inference tree, one type of decision making tree, was generated with the program R for statistical analysis with SPSS®. Results: In the conditional inference tree generated from the obtained data, cases where the difference in average value exceeded 54.49 and the difference in minimum value was less than 54.49 and greater than 12.81 and the difference in minimum value exceeded 39 were considered suspicious of osteomyelitis. From these results, the disease could be correctly classified with a probability of 88.1%. There was no difference in photographic density value of BRONJ and bacterial osteomyelitis; therefore, it was not possible to classify BRONJ and bacterial osteomyelitis by quantitative analysis of panoramic radiographs based on existing research. Conclusions: This study demonstrates that it is feasible to measure photographic density using a basic function in PACS and apply the data to assist in the diagnosis of osteomyelitis.

Development of a Detection Model for the Companies Designated as Administrative Issue in KOSDAQ Market (KOSDAQ 시장의 관리종목 지정 탐지 모형 개발)

  • Shin, Dong-In;Kwahk, Kee-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.157-176
    • /
    • 2018
  • The purpose of this research is to develop a detection model for companies designated as administrative issue in KOSDAQ market using financial data. Administration issue designates the companies with high potential for delisting, which gives them time to overcome the reasons for the delisting under certain restrictions of the Korean stock market. It acts as an alarm to inform investors and market participants of which companies are likely to be delisted and warns them to make safe investments. Despite this importance, there are relatively few studies on administration issues prediction model in comparison with the lots of studies on bankruptcy prediction model. Therefore, this study develops and verifies the detection model of the companies designated as administrative issue using financial data of KOSDAQ companies. In this study, logistic regression and decision tree are proposed as the data mining models for detecting administrative issues. According to the results of the analysis, the logistic regression model predicted the companies designated as administrative issue using three variables - ROE(Earnings before tax), Cash flows/Shareholder's equity, and Asset turnover ratio, and its overall accuracy was 86% for the validation dataset. The decision tree (Classification and Regression Trees, CART) model applied the classification rules using Cash flows/Total assets and ROA(Net income), and the overall accuracy reached 87%. Implications of the financial indictors selected in our logistic regression and decision tree models are as follows. First, ROE(Earnings before tax) in the logistic detection model shows the profit and loss of the business segment that will continue without including the revenue and expenses of the discontinued business. Therefore, the weakening of the variable means that the competitiveness of the core business is weakened. If a large part of the profits is generated from one-off profit, it is very likely that the deterioration of business management is further intensified. As the ROE of a KOSDAQ company decreases significantly, it is highly likely that the company can be delisted. Second, cash flows to shareholder's equity represents that the firm's ability to generate cash flow under the condition that the financial condition of the subsidiary company is excluded. In other words, the weakening of the management capacity of the parent company, excluding the subsidiary's competence, can be a main reason for the increase of the possibility of administrative issue designation. Third, low asset turnover ratio means that current assets and non-current assets are ineffectively used by corporation, or that asset investment by corporation is excessive. If the asset turnover ratio of a KOSDAQ-listed company decreases, it is necessary to examine in detail corporate activities from various perspectives such as weakening sales or increasing or decreasing inventories of company. Cash flow / total assets, a variable selected by the decision tree detection model, is a key indicator of the company's cash condition and its ability to generate cash from operating activities. Cash flow indicates whether a firm can perform its main activities(maintaining its operating ability, repaying debts, paying dividends and making new investments) without relying on external financial resources. Therefore, if the index of the variable is negative(-), it indicates the possibility that a company has serious problems in business activities. If the cash flow from operating activities of a specific company is smaller than the net profit, it means that the net profit has not been cashed, indicating that there is a serious problem in managing the trade receivables and inventory assets of the company. Therefore, it can be understood that as the cash flows / total assets decrease, the probability of administrative issue designation and the probability of delisting are increased. In summary, the logistic regression-based detection model in this study was found to be affected by the company's financial activities including ROE(Earnings before tax). However, decision tree-based detection model predicts the designation based on the cash flows of the company.

Penalized quantile regression tree (벌점화 분위수 회귀나무모형에 대한 연구)

  • Kim, Jaeoh;Cho, HyungJun;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1361-1371
    • /
    • 2016
  • Quantile regression provides a variety of useful statistical information to examine how covariates influence the conditional quantile functions of a response variable. However, traditional quantile regression (which assume a linear model) is not appropriate when the relationship between the response and the covariates is a nonlinear. It is also necessary to conduct variable selection for high dimensional data or strongly correlated covariates. In this paper, we propose a penalized quantile regression tree model. The split rule of the proposed method is based on residual analysis, which has a negligible bias to select a split variable and reasonable computational cost. A simulation study and real data analysis are presented to demonstrate the satisfactory performance and usefulness of the proposed method.

Finding a plan to improve recognition rate using classification analysis

  • Kim, SeungJae;Kim, SungHwan
    • International journal of advanced smart convergence
    • /
    • v.9 no.4
    • /
    • pp.184-191
    • /
    • 2020
  • With the emergence of the 4th Industrial Revolution, core technologies that will lead the 4th Industrial Revolution such as AI (artificial intelligence), big data, and Internet of Things (IOT) are also at the center of the topic of the general public. In particular, there is a growing trend of attempts to present future visions by discovering new models by using them for big data analysis based on data collected in a specific field, and inferring and predicting new values with the models. In order to obtain the reliability and sophistication of statistics as a result of big data analysis, it is necessary to analyze the meaning of each variable, the correlation between the variables, and multicollinearity. If the data is classified differently from the hypothesis test from the beginning, even if the analysis is performed well, unreliable results will be obtained. In other words, prior to big data analysis, it is necessary to ensure that data is well classified according to the purpose of analysis. Therefore, in this study, data is classified using a decision tree technique and a random forest technique among classification analysis, which is a machine learning technique that implements AI technology. And by evaluating the degree of classification of the data, we try to find a way to improve the classification and analysis rate of the data.

A Study on Forecasting Risk of Gas Accident using Weather Data (기상 데이터를 활용한 가스사고위험 예보에 관한 연구)

  • Oh, Jeong Seok
    • Journal of the Korean Institute of Gas
    • /
    • v.22 no.5
    • /
    • pp.107-113
    • /
    • 2018
  • While accident data are used to show alertness to accidents or to review similar cases, the analysis of nature of accident data its association with surrounding environment is very insufficient. Therefore, it is very necessary to demonstrate the possibility of an accident for a particular region by developing analysis techniques with the related accident data. The purpose of this study is to develop an analysis model and implement a system that produces regional accident probability based on historical weather information data and accident and reporting data. In other words, the system is designed and developed to create models by k-NN and decision tree algorithms with optional user-environment variables based on the probability between weather and accidents about many particular region of Korea. In the future, the models developed in this study are intended to be used to analyze and calculate the risk of a more narrow area.

Study on the effectiveness of english-medium class (영어강의의 효과성에 대한 연구)

  • Cho, Jang Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.6
    • /
    • pp.1137-1144
    • /
    • 2012
  • Many universities stress gradually the importance of english-medium class in order to improve the international competitiveness and the internationalization of the university. In this paper, we compare english-medium class with korean class using course evaluation score. Also we analyze the factors that affect the effectiveness of the course evaluation score of english-medium class. First, logistic regression analysis is used to examine the main effects of subjects and individual characteristics. Also, decision tree analysis is used to examine the interaction effects for subjects and individual characteristics. The results of this paper are as follows. Grade, department category, class size, GPA and screening method affect the effectiveness of english-medium class. The highest effectiveness group of english-medium class is that grade is freshmen and department category is humanity. Also the group of the second highest effectiveness group is that grade is freshmen and department category is nature and art and GPA is high.

A study on integrating and discovery of semantic based knowledge model (의미 기반의 지식모델 통합과 탐색에 관한 연구)

  • Chun, Seung-Su
    • Journal of Internet Computing and Services
    • /
    • v.15 no.6
    • /
    • pp.99-106
    • /
    • 2014
  • Generation and analysis methods have been proposed in recent years, such as using a natural language and formal language processing, artificial intelligence algorithms based knowledge model is effective meaning. its semantic based knowledge model has been used effective decision making tree and problem solving about specific context. and it was based on static generation and regression analysis, trend analysis with behavioral model, simulation support for macroeconomic forecasting mode on especially in a variety of complex systems and social network analysis. In this study, in this sense, integrating knowledge-based models, This paper propose a text mining derived from the inter-Topic model Integrated formal methods and Algorithms. First, a method for converting automatically knowledge map is derived from text mining keyword map and integrate it into the semantic knowledge model for this purpose. This paper propose an algorithm to derive a method of projecting a significant topic map from the map and the keyword semantically equivalent model. Integrated semantic-based knowledge model is available.

Analysis of Dimensionality Reduction Methods Through Epileptic EEG Feature Selection for Machine Learning in BCI (BCI에서 기계 학습을 위한 간질 뇌파 특징 선택을 통한 차원 감소 방법 분석)

  • Tong, Yang;Aliyu, Ibrahim;Lim, Chang-Gyoon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.6
    • /
    • pp.1333-1342
    • /
    • 2018
  • Until now, Electroencephalography(: EEG) has been the most important and convenient method for the diagnosis and treatment of epilepsy. However, it is difficult to identify the wave characteristics of an epileptic EEG signals because it is very weak, non-stationary and has strong background noise. In this paper, we analyse the effect of dimensionality reduction methods on Epileptic EEG feature selection and classification. Three dimensionality reduction methods: Pincipal Component Analysis(: PCA), Kernel Principal Component Analysis(: KPCA) and Linear Discriminant Analysis(: LDA) were investigated. The performance of each method was evaluated by using Support Vector Machine SVM, Logistic Regression(: LR), K-Nearestneighbor(: K-NN), Decision Tree(: DR) and Random Forest(: RF). From the experimental result, PCA recorded 75% of highest accuracy in SVM, LR and K-NN. KPCA recorded 85% of best performance in SVM and K-KNN while LDA achieved 100% accuracy in K-NN. Thus, LDA dimensionality reduction is found to provide the best classification result for epileptic EEG signal.

Exploring predictors of subsequent childbirth plan for non-employed and employed mothers : The application of decision tree analysis (의사결정나무분석을 적용한 비취업모와 취업모의 후속출산계획 예측요인 탐색)

  • Lim, Yang-Mi
    • Journal of Korean Home Economics Education Association
    • /
    • v.27 no.4
    • /
    • pp.155-172
    • /
    • 2015
  • This study aimed to identify the effects of mothers' variables and present children's variables on subsequent childbirth plan and to explore predictors of subsequent childbirth plan for non-employed and employed mothers. The subjects were 1,635 mothers participating in the Panel Study on Korean Children from 2008 to 2010 and having no subsequent children until 2010 after giving birth to children in 2008. The data were analyzed with descriptive statistics, t test, ${\chi}^2$ test, and decision tree analysis. The main results of this study were as follows. Firstly, mothers' child-rearing stresses, child value, marital satisfaction, social support, present children's birth order and sex influenced mothers' subsequent childbirth plans, whereas mothers' average family income per month did not. Secondly, in the case of non-employed mothers, their present children's birth order and sex, and mothers' child value predicted their subsequent childbirth plan. Specifically, mothers whose present children's birth order and sex was first and female had the highest possibilities of subsequent childbirth plan, followed by mothers whose present children's birth order and sex was first and male, and child value was higher. Thirdly, in the case of employed mothers, their present children's birth order and mothers' marital satisfaction predicted their subsequent childbirth plan. Specifically, mothers whose present children' birth order was first and marital satisfaction was higher had the highest possibilities of subsequent childbirth plan. Finally, the study suggested the role of Home Economics Education in raising the rate of subsequent childbirth.

Analysis of Factors for Seasonal Meat Color Characteristics in Hanwoo(Korean Cattle) Beef using Decision Tree Method (의사결정나무분석기법을 이용한 계절별 한우육의 육색 특성에 미치는 요인분석)

  • Kim, Seok-Jung;Kim, Yong-Sun;Song, Young-Han;Lee, Sung-Ki
    • Journal of Animal Science and Technology
    • /
    • v.44 no.5
    • /
    • pp.607-616
    • /
    • 2002
  • This study analyzed the effects of pH, sex, backfat thickness, ribeye area, cold carcass weight, shipping month, muscle internal temperature, average daily temperature, and average relative humidity for slaughtered Hanwoo to meat color by season. The analyses focused on interaction and each effect to meat color of the factors. For the result for analysis of multiple linear regressions, meat color values were decreased as pH increased in all meat color, and the meat color values increased as the backfat thickness was increased. As the results of the decision tree analysis by each factor, cow and steer slaughtered in spring and autumn were the highest in the lightness(L*). The redness(a*) was the cases that pH was less than 5.63 and average relative humidity was over than 71.5% for Hanwoo slaughtered in autumn. The chroma(C*) value was the highest for Hanwoo that was slaughtered in summer and autumn, the pH was less than 5.60, and the back fat thickness was over than 8 mm. The hue angle($h^0$) was shown that the muscle internal temperature was less than 4.7$^{\circ}C$ among Hanwoo which was slaughtered in spring, summer, and autumn, the pH was less than 5.66, and the back fat thickness was over than 8 mm.