• 제목/요약/키워드: decision trees

검색결과 300건 처리시간 0.024초

Stream-based Biomedical Classification Algorithms for Analyzing Biosignals

  • Fong, Simon;Hang, Yang;Mohammed, Sabah;Fiaidhi, Jinan
    • Journal of Information Processing Systems
    • /
    • 제7권4호
    • /
    • pp.717-732
    • /
    • 2011
  • Classification in biomedical applications is an important task that predicts or classifies an outcome based on a given set of input variables such as diagnostic tests or the symptoms of a patient. Traditionally the classification algorithms would have to digest a stationary set of historical data in order to train up a decision-tree model and the learned model could then be used for testing new samples. However, a new breed of classification called stream-based classification can handle continuous data streams, which are ever evolving, unbound, and unstructured, for instance--biosignal live feeds. These emerging algorithms can potentially be used for real-time classification over biosignal data streams like EEG and ECG, etc. This paper presents a pioneer effort that studies the feasibility of classification algorithms for analyzing biosignals in the forms of infinite data streams. First, a performance comparison is made between traditional and stream-based classification. The results show that accuracy declines intermittently for traditional classification due to the requirement of model re-learning as new data arrives. Second, we show by a simulation that biosignal data streams can be processed with a satisfactory level of performance in terms of accuracy, memory requirement, and speed, by using a collection of stream-mining algorithms called Optimized Very Fast Decision Trees. The algorithms can effectively serve as a corner-stone technology for real-time classification in future biomedical applications.

Insights gained from applying negate-down during quantification for seismic probabilistic safety assessment

  • Kim, Ji Suk;Kim, Man Cheol
    • Nuclear Engineering and Technology
    • /
    • 제54권8호
    • /
    • pp.2933-2940
    • /
    • 2022
  • Approximations such as the delete-term approximation, rare event approximation, and minimal cutset upper bound (MCUB) need to be prudently applied for the quantification of a seismic probabilistic safety assessment (PSA) model. Important characteristics of seismic PSA models indicate that preserving the success branches in a primary seismic event tree is necessary. Based on the authors' experience in modeling and quantifying plant-level seismic PSA models, the effects of applying negate-down to the success branches in primary seismic event trees on the quantification results are summarized along with the following three insights gained: (1) there are two competing effects on the MCUB-based quantification results: one tending to increase and the other tending to decrease; (2) the binary decision diagram does not always provide exact quantification results; and (3) it is identified when the exact results will be obtained, and which combination provides more conservative results compared to the others. Complicated interactions occur in Boolean variable manipulation, approximation, and the quantification of a seismic PSA model. The insights presented herein can assist PSA analysts to better understand the important theoretical principles associated with the quantification of seismic PSA models.

The Role of Data Technologies with Machine Learning Approaches in Makkah Religious Seasons

  • Waleed Al Shehri
    • International Journal of Computer Science & Network Security
    • /
    • 제23권8호
    • /
    • pp.26-32
    • /
    • 2023
  • Hajj is a fundamental pillar of Islam that all Muslims must perform at least once in their lives. However, Umrah can be performed several times yearly, depending on people's abilities. Every year, Muslims from all over the world travel to Saudi Arabia to perform Hajj. Hajj and Umrah pilgrims face multiple issues due to the large volume of people at the same time and place during the event. Therefore, a system is needed to facilitate the people's smooth execution of Hajj and Umrah procedures. Multiple devices are already installed in Makkah, but it would be better to suggest the data architectures with the help of machine learning approaches. The proposed system analyzes the services provided to the pilgrims regarding gender, location, and foreign pilgrims. The proposed system addressed the research problem of analyzing the Hajj pilgrim dataset most effectively. In addition, Visualizations of the proposed method showed the system's performance using data architectures. Machine learning algorithms classify whether male pilgrims are more significant than female pilgrims. Several algorithms were proposed to classify the data, including logistic regression, Naive Bayes, K-nearest neighbors, decision trees, random forests, and XGBoost. The decision tree accuracy value was 62.83%, whereas K-nearest Neighbors had 62.86%; other classifiers have lower accuracy than these. The open-source dataset was analyzed using different data architectures to store the data, and then machine learning approaches were used to classify the dataset.

다중 에이전트 기반의 고대 국가 형성 시뮬레이션 (The Multi-Agent Simulation of Archaic State Formation)

  • S. Kim;A. Lazar;R.G. Reynolds
    • 한국시뮬레이션학회:학술대회논문집
    • /
    • 한국시뮬레이션학회 2003년도 춘계학술대회논문집
    • /
    • pp.91-100
    • /
    • 2003
  • In this paper we investigate the role that warfare played In the formation of the network of alliances between sites that are associated with the formation of the state in the Valley of Oaxaca, Mexico. A model of state formation proposed by Marcos and Flannery (1996) is used as the basis for an agent-based simulation model. Agents reside in sites and their actions are constrained by knowledge extracted from the Oaxaca Surface Archaeological Survey (Kowalewski 1989). The simulation is run with two different sets of constraint rules for the agents. The first set is based upon the raw data collected in the surface survey. This represents a total of 79 sites and constitutes a minimal level of warfare (raiding) in the Valley. The other site represents the generalization of these constraints to sites with similar locational characteristics. This set corresponds to 987 sites and represents a much more active role for warfare in the Valley. The rules were produced by a data mining technique, Decision Trees, guided by Genetic Algorithms. Simulations were run using the two different rule sets and compared with each other and the archaeological data for the Valley. The results strongly suggest that warfare was a necessary process in the aggregations of resources needed to support the emergence of the state in the Valley.

  • PDF

우울증 대상자의 정신 상담 경험 여부 예측 모형 (A Prediction Model for Psychiatric Counseling for Depression among Subjects with Depressive Symptoms)

  • 한명희
    • 한국보건간호학회지
    • /
    • 제37권1호
    • /
    • pp.125-135
    • /
    • 2023
  • Purpose: The number of patients suffering from depression is rapidly increasing worldwide, and by 2030, it is expected to pose a severe social and economic burden. Reports suggest that approximately 30% of subjects with symptoms of depression do not attempt treatment. Therefore, predicting the characteristics of subjects with depressive symptoms who have not even attempted counseling treatment is essential to increase the participation rate for such treatment. This study intends to predict the participation rates for psychological counseling treatment for depression among subjects with depressive symptoms. Methods: This study used data from the 2021 Korea Community Health Survey (KCHS). Data analysis was carried out using a decision tree to design a model that predicted participation in psychological counseling for depression. Results: The results showed that subjects aged 65 to 74 had difficulty understanding the explanations of medical staff even though they did not have cognitive impairment. Only 11.1% of this group received psychological counseling, which was the lowest rate among the various age groups. Among the subjects, 62.4% of those aged 19-44 or 45-64, who had suicidal thoughts and attempted suicide, received psychological counseling and this was the highest rate among the age groups surveyed. Conclusion: The identification of people showing depressive symptoms is crucial for encouraging them to undertake treatment. Also, proper depression-oriented medical services should be developed and implemented for people with depressive symptoms who exhibit a blind spot towards attempting treatment.

AI Comparative Analysis of Trade and Consumption Patterns in Korea and China

  • Chang Hwan Choi;Thi Thanh Tuyen Nguyen;PengYan Wang
    • Journal of Korea Trade
    • /
    • 제27권1호
    • /
    • pp.119-138
    • /
    • 2023
  • Purpose - This research is to empirically explore the differences in apparel consumption among male and female teenagers and college students in Korea and China. By conducting a survey to understand customers' needs and behaviors, fashion businesses will be able to improve their customer satisfaction and avoid redundancy, inventory, and the waste of resources, effort and money. Design/methodology - The research design considers the consumption patterns of male and female high school and college students in Korea and China. To analyze the data, the study employs decision trees, a type of machine learning algorithm. A decision tree model was developed to examine the relationship between the explanatory and response variables, which can be either quantitative or qualitative in nature. Findings - The main findings of this study indicate that there are differences in shopping behavior among different customer segments. The results show that men have a simpler shopping behavior compared to women. Additionally, cultural factors and the difference in fashion needs between students and non-students have a significant impact on the shopping choices of Chinese and Korean individuals. Originality/value - Existing studies often assume that the shopping behavior of high school and university students is similar and that there are no significant differences in clothing purchases between men and women across countries. The results provide valuable insights into the unique shopping behavior of different customer segments, and can inform fashion businesses in their efforts to meet the needs of their customers.

조경수목의 효율적 관리를 위한 프로그램 개발에 관한 연구 - 관리대장(Tree Inventory) 개발을 중심으로 - (Study on Developing Program for Efficient Landscape Woody Plants Management - Mainly Focused on the Development of a Tree Inventory System -)

  • 조영환;곽행구
    • 한국조경학회지
    • /
    • 제24권4호
    • /
    • pp.1-22
    • /
    • 1997
  • This paper was focused on the efficient management of landscape woody plants, and concerned itself with their important role in the urban environment. Based on the philosophy that there is nothing that can be done without an inventory, the purpose of this study was to develop an inventory system and iris proper application to a site for establishing a management plan Two different approaches were used, The first was to make a newly structured inventory system through collecting, analyzing, and evaluating various types of inventories used in Korea, the U. S. A., and Japan. The second approach was to apply a newly designed inventory system to the case study area. using GIS 'as a tool of spacial analysis and statistics for making decisions. The results could be summarized as follows; 1. In Korea, most of the Landscape Woozy Plants Inventories had datas which represented possession of trees, and only the work which they had done according to their traditional ways, There was no data related to the conditions, management needs, and site conditions of individual trees, This is essential information for organizing an inventory system . 2. There needs to be data which is balanced, containing tree characteristics and site characteristics. Through such information the management needs could be adjusted properly. The inventory list described in this paper was determined by botanical identity, placement condition, condition of tree, and types of work for maintaining as well as improving the condition of each tree One of the most important things was to determine the location data of each tree so as to compare data with other trees. The data gained from the field survey still had some problems because of lack of scientific method for supporting objective views, and because of actual situations, especially in the field of evaluating site conditions and management needs. All data should be revised to fit a computer data management system , if possible 3. The GIS(Geographic Information System) application showed good performance in handling inventory data for decision making. All the data used for the GIS application was divided into location and non-spatial data. Using the location data, it was easy to find the exact location of each tree on the monitor and on the maps generated by the computer even in the actual managed trite, along with various attribute data. Therefore it could be said that the entire management plan should start from data of individual trees with their exact locations, for making concrete management goals through actual budget planning.

  • PDF

의사결정나무 분석기법을 이용한 뇌졸중 지식 취약군 규명 (Identification of Subgroups with Lower Level of Stroke Knowledge Using Decision-tree Analysis)

  • 김현경;정석희;강현철
    • 대한간호학회지
    • /
    • 제44권1호
    • /
    • pp.97-107
    • /
    • 2014
  • Purpose: This study was performed to explore levels of stroke knowledge and identify subgroups with lower levels of stroke knowledge among adults in Korea. Methods: A cross-sectional survey was used and data were collected in 2012. A national sample of 990 Koreans aged 20 to 74 years participated in this study. Knowledge of risk factors, warning signs, and first action for stroke were surveyed using face-to-face interviews. Descriptive statistics and decision tree analysis were performed using SPSS WIN 20.0 and Answer Tree 3.1. Results: Mean score for stroke risk factor knowledge was 7.7 out of 10. The least recognized risk factor was diabetes and four subgroups with lower levels of knowledge were identified. Score for knowledge of stroke warning signs was 3.6 out of 6. The least recognized warning sign was sudden severe headache and six subgroups with lower levels of knowledge were identified. The first action for stroke was recognized by 65.7 percent of participants and four subgroups with lower levels of knowledge were identified. Conclusion: Multi-faceted education should be designed to improve stroke knowledge among Korean adults, particularly focusing on subgroups with lower levels of knowledge and less recognition of items in this study.

다변량 목표변수를 갖는 의사결정나무의 노드분리에 관한 연구 (A Study on the Node Split in Decision Tree with Multivariate Target Variables)

  • 김성준
    • 한국지능시스템학회논문지
    • /
    • 제13권4호
    • /
    • pp.386-390
    • /
    • 2003
  • 데이터마이닝은 많은 양의 데이터로부터 의사결정에 유용한 패턴을 발견하는 과정으로서 최근 경영 및 공학 분야의 폭넓은 영역에서 많은 관심을 모으고 있다. 어떤 그룹을 여러 하위그룹으로 분류해내는 일은 데이터마이닝의 주요 내용 중 하나이다. 의사결정나무로 알려진 트리기반 기법은 그러한 분류모형을 수립하는 데 효율적인 방안을 제공한다 트리학습에 있어서 우선적인 관건은 목표변수에 의해 측정되는 노드불순도를 최소화하는 것이다. 하지만 공정관측, 마케팅과학, 임상분석 등과 같은 문제에서는 여러 목표변수를 동시에 고려해야 하는 상황이 쉽게 등장하는 데, 본 논문의 목적은 이처럼 다변량 목표변수를 갖는 데이터셋에서 활용할 수 있는 노드불순도 측정방안을 제시하는 데 있다. 아울러 수치 예를 이용하여 적용결과에 대해 논의한다.

다중외적연관성규칙을 이용한 불필요한 입력변수 제거에 관한 연구 (A study on removal of unnecessary input variables using multiple external association rule)

  • 조광현;박희창
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권5호
    • /
    • pp.877-884
    • /
    • 2011
  • 의사결정나무는 데이터마이닝의 대표적인 알고리즘으로서, 의사결정 규칙을 도표화하여 관심대상이 되는 집단을 몇 개의 소집단으로 분류하거나 예측을 수행하는 방법이다. 일반적으로 의사결정나무의 모형 생성 시, 입력 변수의 수가 많을 경우 생성된 의사결정모형은 복잡한 형태가 될 수 있고, 모형 탐색 및 분석에 있어 어려움을 겪기도 한다. 이때 입력변수들 간의 내재적인 관련성은 없으나, 외적 변수에 의하여 각 변수가 우연히 어떤 다른 변수와 연결됨으로써 관련성이 있는 것으로 나타나는 것을 종종 볼 수 있다. 이에 본 논문에서는 의사결정나무 생성 시, 입력 변수에 대한 외적 관계를 파악할 수 있는 다중외적연관성규칙을 이용하여 의사결정나무 생성에 불필요한 입력변수를 제거하는 방법을 제시하고 그 효율성을 파악하기 위하여 실제 자료에 적용하고자 한다.