• Title/Summary/Keyword: BIG TREE

Search Result 222, Processing Time 0.033 seconds

A Review of the Methodology for Sophisticated Data Classification (정교한 데이터 분류를 위한 방법론의 고찰)

  • Kim, Seung Jae;Kim, Sung Hwan
    • Journal of Integrative Natural Science
    • /
    • v.14 no.1
    • /
    • pp.27-34
    • /
    • 2021
  • 전 세계적으로 인공지능(AI)을 구현하려는 움직임이 많아지고 있다. AI구현에서는 많은 양의 데이터, 목적에 맞는 데이터의 분류 등 데이터의 중요성을 뺄 수 없다. 이러한 데이터를 생성하고 가공하는 기술에는 사물인터넷(IOT)과 빅데이터(Big-data) 분석이 있으며 4차 산업을 이끌어 가는 원동력이라 할 수 있다. 또한 이러한 기술은 국가와 개인 차원에서 많이 활용되고 있으며, 특히나 특정분야에 집결되는 데이터를 기준으로 빅데이터 분석에 활용함으로써 새로운 모델을 발견하고, 그 모델로 새로운 값을 추론하고 예측함으로써 미래비전을 제시하려는 시도가 많아지고 있는 추세이다. 데이터 분석을 통한 결론은 데이터가 가지고 있는 정보의 정확성에 따라 많은 변화를 가져올 수 있으며, 그 변화에 따라 잘못된 결과를 발생시킬 수도 있다. 이렇듯 데이터의 분석은 데이터가 가지는 정보 또는 분석 목적에 맞는 데이터 분류가 매우 중요하다는 것을 알 수 있다. 또한 빅데이터 분석결과 통계량의 신뢰성과 정교함을 얻기 위해서는 각 변수의 의미와 변수들 간의 상관관계, 다중공선성 등을 고려하여 분석해야 한다. 즉, 빅데이터 분석에 앞서 분석목적에 맞도록 데이터의 분류가 잘 이루어지도록 해야 한다. 이에 본 고찰에서는 AI기술을 구현하는 머신러닝(machine learning, ML) 기법에 속하는 분류분석(classification analysis, CA) 중 의사결정트리(decision tree, DT)기법, 랜덤포레스트(random forest, RF)기법, 선형분류분석(linear discriminant analysis, LDA), 이차선형분류분석(quadratic discriminant analysis, QDA)을 이용하여 데이터를 분류한 후 데이터의 분류정도를 평가함으로써 데이터의 분류 분석률 향상을 위한 방안을 모색하려 한다.

The Role of Data Technologies with Machine Learning Approaches in Makkah Religious Seasons

  • Waleed Al Shehri
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.8
    • /
    • pp.26-32
    • /
    • 2023
  • Hajj is a fundamental pillar of Islam that all Muslims must perform at least once in their lives. However, Umrah can be performed several times yearly, depending on people's abilities. Every year, Muslims from all over the world travel to Saudi Arabia to perform Hajj. Hajj and Umrah pilgrims face multiple issues due to the large volume of people at the same time and place during the event. Therefore, a system is needed to facilitate the people's smooth execution of Hajj and Umrah procedures. Multiple devices are already installed in Makkah, but it would be better to suggest the data architectures with the help of machine learning approaches. The proposed system analyzes the services provided to the pilgrims regarding gender, location, and foreign pilgrims. The proposed system addressed the research problem of analyzing the Hajj pilgrim dataset most effectively. In addition, Visualizations of the proposed method showed the system's performance using data architectures. Machine learning algorithms classify whether male pilgrims are more significant than female pilgrims. Several algorithms were proposed to classify the data, including logistic regression, Naive Bayes, K-nearest neighbors, decision trees, random forests, and XGBoost. The decision tree accuracy value was 62.83%, whereas K-nearest Neighbors had 62.86%; other classifiers have lower accuracy than these. The open-source dataset was analyzed using different data architectures to store the data, and then machine learning approaches were used to classify the dataset.

Design of a Multi-Platform Omok Program for Artificial Intelligence Education (인공지능 교육을 위한 멀티 플랫폼 오목 프로그램 설계)

  • Cha, Joo Hyoung;Woo, Young Woon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.530-532
    • /
    • 2021
  • This paper deals with AI education service that enables developers who have completed basic programming education to program in C/C++ language in order to learn big data and artificial intelligence. In addition, a customized development environment configuration system according to the development environment and how the user implements and tests artificial intelligence are explained. And also it has a function to check the effect on artificial intelligence through manipulation of various internal parameters. It is expected that it will be possible to develop artificial intelligence education services without language restrictions through networks in the future.

  • PDF

Vegetation Distribution Status and Change for Twenty Four Years(1986~2010) of Seunghwanglim(Forest), Wonju (원주시 성황림(城隍林) 식생분포 현황 및 24년간(1986~2010년) 변화분석)

  • Han, Bong-Ho;Choi, Jin-Woo;Noh, Tai-Hwan;Kim, Ji-Suk
    • Korean Journal of Environment and Ecology
    • /
    • v.26 no.5
    • /
    • pp.741-757
    • /
    • 2012
  • This study had targeted the Seunghwanglim of Wonju in order to understand the forest vegetation's present condition. And then we compared the change in vegetation of Seunghwanglim for 24years. It was intended to provide basic data for conservation and management. Actual vegetation as a result of investigation, a total area of $56,231m^2$ Quercus serrata forest(7.02%), Acer triflorum forest(5.71%), and Deciduous Broad-Leaved Forest and Pinus densiflora forest(6.4%) were distributed variously. Present condition of the plains forest has 34 kinds of canopy species, 65 kinds of understory species, 70 species of shrubs species, 88 species of total species. And the plains forest has 500 individuals of canopy layer, 1,102 individuals of understory layer. Mean importance percentage of the major species showed Ulmus davidiana var. japonica(15.6%), Acer triflorum(15.2%), Pinus densiflora(11.1%), Quercus serrata(9.8%). Acer triflorum diameter at Ulmus davidiana var. japonica were a relatively wide range. Results of change for 24 years, vegetation of Seunghwanglim was changed from Quercus serrata-Acer triflorum to Ulmus davidiana var. japonica-Acer triflorum. Big trees over than DBH 30cm were surveyed total 18 species, 166 individuals. Increased over than the past 63 individuals. Seunghwanglim was destroyed by reckless past. Since 1990, the outer perimeter fence was installed to control human access. After that, understory layer and shrub layer were developed. And big tree was increased. Which is considered to restore damaged ecosystems. In order to conservation and protection of Seunghwanglim, people have to management and monitor about exotic species such as Robinia pseudo-acacia, Populus tomentiglandulosa, Castanea crenata, Pueraria lobata, etc.

Predicting Crime Risky Area Using Machine Learning (머신러닝기반 범죄발생 위험지역 예측)

  • HEO, Sun-Young;KIM, Ju-Young;MOON, Tae-Heon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.21 no.4
    • /
    • pp.64-80
    • /
    • 2018
  • In Korea, citizens can only know general information about crime. Thus it is difficult to know how much they are exposed to crime. If the police can predict the crime risky area, it will be possible to cope with the crime efficiently even though insufficient police and enforcement resources. However, there is no prediction system in Korea and the related researches are very much poor. From these backgrounds, the final goal of this study is to develop an automated crime prediction system. However, for the first step, we build a big data set which consists of local real crime information and urban physical or non-physical data. Then, we developed a crime prediction model through machine learning method. Finally, we assumed several possible scenarios and calculated the probability of crime and visualized the results in a map so as to increase the people's understanding. Among the factors affecting the crime occurrence revealed in previous and case studies, data was processed in the form of a big data for machine learning: real crime information, weather information (temperature, rainfall, wind speed, humidity, sunshine, insolation, snowfall, cloud cover) and local information (average building coverage, average floor area ratio, average building height, number of buildings, average appraised land value, average area of residential building, average number of ground floor). Among the supervised machine learning algorithms, the decision tree model, the random forest model, and the SVM model, which are known to be powerful and accurate in various fields were utilized to construct crime prevention model. As a result, decision tree model with the lowest RMSE was selected as an optimal prediction model. Based on this model, several scenarios were set for theft and violence cases which are the most frequent in the case city J, and the probability of crime was estimated by $250{\times}250m$ grid. As a result, we could find that the high crime risky area is occurring in three patterns in case city J. The probability of crime was divided into three classes and visualized in map by $250{\times}250m$ grid. Finally, we could develop a crime prediction model using machine learning algorithm and visualized the crime risky areas in a map which can recalculate the model and visualize the result simultaneously as time and urban conditions change.

The big data method for flash flood warning (돌발홍수 예보를 위한 빅데이터 분석방법)

  • Park, Dain;Yoon, Sanghoo
    • Journal of Digital Convergence
    • /
    • v.15 no.11
    • /
    • pp.245-250
    • /
    • 2017
  • Flash floods is defined as the flooding of intense rainfall over a relatively small area that flows through river and valley rapidly in short time with no advance warning. So that it can cause damage property and casuality. This study is to establish the flash-flood warning system using 38 accident data, reported from the National Disaster Information Center and Land Surface Model(TOPLATS) between 2009 and 2012. Three variables were used in the Land Surface Model: precipitation, soil moisture, and surface runoff. The three variables of 6 hours preceding flash flood were reduced to 3 factors through factor analysis. Decision tree, random forest, Naive Bayes, Support Vector Machine, and logistic regression model are considered as big data methods. The prediction performance was evaluated by comparison of Accuracy, Kappa, TP Rate, FP Rate and F-Measure. The best method was suggested based on reproducibility evaluation at the each points of flash flood occurrence and predicted count versus actual count using 4 years data.

A Study on the Development of Flight Prediction Model and Rules for Military Aircraft Using Data Mining Techniques (데이터 마이닝 기법을 활용한 군용 항공기 비행 예측모형 및 비행규칙 도출 연구)

  • Yu, Kyoung Yul;Moon, Young Joo;Jeong, Dae Yul
    • The Journal of Information Systems
    • /
    • v.31 no.3
    • /
    • pp.177-195
    • /
    • 2022
  • Purpose This paper aims to prepare a full operational readiness by establishing an optimal flight plan considering the weather conditions in order to effectively perform the mission and operation of military aircraft. This paper suggests a flight prediction model and rules by analyzing the correlation between flight implementation and cancellation according to weather conditions by using big data collected from historical flight information of military aircraft supplied by Korean manufacturers and meteorological information from the Korea Meteorological Administration. In addition, by deriving flight rules according to weather information, it was possible to discover an efficient flight schedule establishment method in consideration of weather information. Design/methodology/approach This study is an analytic study using data mining techniques based on flight historical data of 44,558 flights of military aircraft accumulated by the Republic of Korea Air Force for a total of 36 months from January 2013 to December 2015 and meteorological information provided by the Korea Meteorological Administration. Four steps were taken to develop optimal flight prediction models and to derive rules for flight implementation and cancellation. First, a total of 10 independent variables and one dependent variable were used to develop the optimal model for flight implementation according to weather condition. Second, optimal flight prediction models were derived using algorithms such as logistics regression, Adaboost, KNN, Random forest and LightGBM, which are data mining techniques. Third, we collected the opinions of military aircraft pilots who have more than 25 years experience and evaluated importance level about independent variables using Python heatmap to develop flight implementation and cancellation rules according to weather conditions. Finally, the decision tree model was constructed, and the flight rules were derived to see how the weather conditions at each airport affect the implementation and cancellation of the flight. Findings Based on historical flight information of military aircraft and weather information of flight zone. We developed flight prediction model using data mining techniques. As a result of optimal flight prediction model development for each airbase, it was confirmed that the LightGBM algorithm had the best prediction rate in terms of recall rate. Each flight rules were checked according to the weather condition, and it was confirmed that precipitation, humidity, and the total cloud had a significant effect on flight cancellation. Whereas, the effect of visibility was found to be relatively insignificant. When a flight schedule was established, the rules will provide some insight to decide flight training more systematically and effectively.

Determine Optimal Timing for Out-Licensing of New Drugs in the Aspect of Biotech (신약의 기술이전 최적시기 결정 문제 - 바이오텍의 측면에서)

  • Na, Byungsoo;Kim, Jaeyoung
    • Knowledge Management Research
    • /
    • v.21 no.3
    • /
    • pp.105-121
    • /
    • 2020
  • With regard to the development of new drugs, what is most important for a Korean Biotech, where no global sales network has been established, is decision-making related to out-licensing of new drugs. The probability of success for each clinical phase is different, and the licensing amount and its royalty vary depending on which clinical phase the licensing contract is made. Due to the nature of such a licensing contract and Biotech's weak financial status, it is a very important decision-making issue for a Biotech to determine when to license out to a Big Pharma. This study defined a model called 'optimal timing for out-licensing of new drugs' and the results were derived from the decision tree analysis. As a case study, we applied to a Biotech in Korea, which is conducting FDA global clinical trials for a first-in-class new drug. Assuming that the market size and expected market penetration rate of the target disease are known, it has been shown that out-licensing after phase 1 or phase 2 of clinical trials is a best alternative that maximizes Biotech's profits. This study can provide a conceptual framework for the use of management science methodologies in pharmaceutical fields, thus laying the foundation for knowledge and research on out-licensing of new drugs.

A Study of the Advanced Strategy for ICT-based Public Compensation Business (ICT 기반 공익사업 보상업무 첨단화 방안 연구)

  • Seo, Myoung Bae
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.75-83
    • /
    • 2020
  • Compensation services that are indispensable during large-scale public utilities projects have been gradually increasing with the recent increase in construction, but there are no systematic compensation services due to the complicated procedures and manual work. For this reason, various problems such as construction period delays due to various complaints, corruption in compensation work, and impossible to trace the history of compensation data in the past are emerging. In this paper, in order to solve this problem, in-depth interviews and questionnaires were conducted to find out the problems of each compensation status. Based on this, 3 core technologies and 10 technical needs based on ICT were selected to improve the compensation work by deriving STEEP analysis and Issue Tree. The three core technologies are big data-based decision-making and prediction technology, advanced measurement technology, and open cloud-based compensation platform technology. In order to introduce the derived technologies to the institutions in charge of compensation, the possibility of technology diffusion by project operators was suggested based on the results of the current status of informatization by institution. Based on the core technology derived from this paper, it is necessary to make a prototype that can be advanced in compensation work and apply it to each institution and analyze the effect.

Security Requirements Analysis on IP Camera via Threat Modeling and Common Criteria (보안위협모델링과 국제공통평가기준을 이용한 IP Camera 보안요구사항 분석)

  • Park, Jisoo;Kim, Seungjoo
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.6 no.3
    • /
    • pp.121-134
    • /
    • 2017
  • With rapid increasing the development and use of IoT Devices, requirements for safe IoT devices and services such as reliability, security are also increasing. In Security engineering, SDLC (Secure Development Life Cycle) is applied to make the trustworthy system. Secure Development Life Cycle has 4 big steps, Security requirements, Design, Implementation and Operation and each step has own goals and activities. Deriving security requirements, the first step of SDLC, must be accurate and objective because it affect the rest of the SDLC. For accurate and objective security requirements, Threat modeling is used. And the results of the threat modeling can satisfy the completeness of scope of analysis and the traceability of threats. In many countries, academic and IT company, a lot of researches about drawing security requirements systematically are being done. But in domestic, awareness and researches about deriving security requirements systematically are lacking. So in this paper, I described about method and process to drawing security requirements systematically by using threat modeling including DFD, STRIDE, Attack Library and Attack Tree. And also security requirements are described via Common Criteria for delivering objective meaning and broad use of them.