• Title/Summary/Keyword: Multi-class Classification

Search Result 220, Processing Time 0.024 seconds

Research on ITB Contract Terms Classification Model for Risk Management in EPC Projects: Deep Learning-Based PLM Ensemble Techniques (EPC 프로젝트의 위험 관리를 위한 ITB 문서 조항 분류 모델 연구: 딥러닝 기반 PLM 앙상블 기법 활용)

  • Hyunsang Lee;Wonseok Lee;Bogeun Jo;Heejun Lee;Sangjin Oh;Sangwoo You;Maru Nam;Hyunsik Lee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.11
    • /
    • pp.471-480
    • /
    • 2023
  • The Korean construction order volume in South Korea grew significantly from 91.3 trillion won in public orders in 2013 to a total of 212 trillion won in 2021, particularly in the private sector. As the size of the domestic and overseas markets grew, the scale and complexity of EPC (Engineering, Procurement, Construction) projects increased, and risk management of project management and ITB (Invitation to Bid) documents became a critical issue. The time granted to actual construction companies in the bidding process following the EPC project award is not only limited, but also extremely challenging to review all the risk terms in the ITB document due to manpower and cost issues. Previous research attempted to categorize the risk terms in EPC contract documents and detect them based on AI, but there were limitations to practical use due to problems related to data, such as the limit of labeled data utilization and class imbalance. Therefore, this study aims to develop an AI model that can categorize the contract terms based on the FIDIC Yellow 2017(Federation Internationale Des Ingenieurs-Conseils Contract terms) standard in detail, rather than defining and classifying risk terms like previous research. A multi-text classification function is necessary because the contract terms that need to be reviewed in detail may vary depending on the scale and type of the project. To enhance the performance of the multi-text classification model, we developed the ELECTRA PLM (Pre-trained Language Model) capable of efficiently learning the context of text data from the pre-training stage, and conducted a four-step experiment to validate the performance of the model. As a result, the ensemble version of the self-developed ITB-ELECTRA model and Legal-BERT achieved the best performance with a weighted average F1-Score of 76% in the classification of 57 contract terms.

The Basic Data Analysis of Lupus Nephritis in Children (소아 루프스 신염에 대한 기초 조사)

  • Min Jae Hong;Paek Kyung Hoon;Park Kyung Mi;Kim Jung Sue;Ha Il Soo;Cheong Hae Il;Kim Joong Gon;Choi Yong
    • Childhood Kidney Diseases
    • /
    • v.3 no.1
    • /
    • pp.80-87
    • /
    • 1999
  • Purposes : Renal involvement is a potentially serious complication of systemic lupus erythematosus (SLE). There have been only few studies of lupus nephritis in pediatric age. In this study, the clinical manifestations, pathologic findings, response to treatment, and clinical course of lupus nephritis in children were analyzed. And the results will provide basic data for future nation-wide prospective multi-center study. Methods . The medical records of 46 children clinically and pathologically diagnosed to have lupus nephritis at Seoul National University Children's Hospital during 1986 to 1997 were analyzed retrospectively. Results : 1) The median age of diagnosis of lupus nephritis was 12.8 years ($2\;years\~\;15year$ 8months), and the sex ratio was 1:2.5. 2) FANA($85.7\%$), anti-ds-DNA antibody ($78.0\%$), and malar rash ($60.8\%$) were the most common findings among the classification criteria by ARA Decreased C3 was detected in $88.9\%$ of patients. 3) Hematuria ($87.0\%$) was the most common renal symptom, and WHO class IV lupus nephritis was identified in 41 cases by renal biopsy. 4) In most of patients, the disease activity was controlled relatively well with a single or combined therapy of prednisolone, azathioprine, or cyclophosphamide. The response revealed no difference according to the mode of treatment. 5) Infection, especially of Varicella-Zoster virus and candida, was the most common complication during the disease course. Conclusion : The renal involvement was noted in $87.0\%$ of childhood SLE, and $89.1\%$ of renal lesions was WHO class IV lupus nephritis known to associated with poor long-term prognosis. So, aggressive treatment using immunosuppressants in the early disease course may be helpful to increase long-term prognosis of lupus nephritis. A prospective multi-center study is necessary to analyze the therapeutic efficacy of various treatment modalities.

  • PDF

Research on Classification of Human Emotions Using EEG Signal (뇌파신호를 이용한 감정분류 연구)

  • Zubair, Muhammad;Kim, Jinsul;Yoon, Changwoo
    • Journal of Digital Contents Society
    • /
    • v.19 no.4
    • /
    • pp.821-827
    • /
    • 2018
  • Affective computing has gained increasing interest in the recent years with the development of potential applications in Human computer interaction (HCI) and healthcare. Although momentous research has been done on human emotion recognition, however, in comparison to speech and facial expression less attention has been paid to physiological signals. In this paper, Electroencephalogram (EEG) signals from different brain regions were investigated using modified wavelet energy features. For minimization of redundancy and maximization of relevancy among features, mRMR algorithm was deployed significantly. EEG recordings of a publically available "DEAP" database have been used to classify four classes of emotions with Multi class Support Vector Machine. The proposed approach shows significant performance compared to existing algorithms.

Genetic Association Analysis of Fasting and 1- and 2-Hour Glucose Tolerance Test Data Using a Generalized Index of Dissimilarity Measure for the Korean Population

  • Yee, Jaeyong;Kim, Yongkang;Park, Taesung;Park, Mira
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.181-186
    • /
    • 2016
  • Glucose tolerance tests have been devised to determine the speed of blood glucose clearance. Diabetes is often tested with the standard oral glucose tolerance test (OGTT), along with fasting glucose level. However, no single test may be sufficient for the diagnosis, and the World Health Organization (WHO)/International Diabetes Federation (IDF) has suggested composite criteria. Accordingly, a single multi-class trait was constructed with three of the fasting phenotypes and 1- and 2-hour OGTT phenotypes from the Korean Association Resource (KARE) project, and the genetic association was investigated. All of the 18 possible combinations made out of the 3 sets of classification for the individual phenotypes were taken into our analysis. These were possible due to a method that was recently developed by us for estimating genomic associations using a generalized index of dissimilarity. Eight single-nucleotide polymorphisms (SNPs) that were found to have the strongest main effect are reported with the corresponding genes. Four of them conform to previous reports, located in the CDKAL1 gene, while the other 4 SNPs are new findings. Two-order interacting SNP pairs of are also presented. One pair (rs2328549 and rs6486740) has a prominent association, where the two single-nucleotide polymorphism locations are CDKAL1 and GLT1D1. The latter has not been found to have a strong main effect. New findings may result from the proper construction and analysis of a composite trait.

A Deep Learning Approach with Stacking Architecture to Identify Botnet Traffic

  • Kang, Koohong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.12
    • /
    • pp.123-132
    • /
    • 2021
  • Malicious activities of Botnets are responsible for huge financial losses to Internet Service Providers, companies, governments and even home users. In this paper, we try to confirm the possibility of detecting botnet traffic by applying the deep learning model Convolutional Neural Network (CNN) using the CTU-13 botnet traffic dataset. In particular, we classify three classes, such as the C&C traffic between bots and C&C servers to detect C&C servers, traffic generated by bots other than C&C communication to detect bots, and normal traffic. Performance metrics were presented by accuracy, precision, recall, and F1 score on classifying both known and unknown botnet traffic. Moreover, we propose a stackable botnet detection system that can load modules for each botnet type considering scalability and operability on the real field.

Studying Life Zone Determination and Classification of South Korea for Providing and Operating Living SOC Facilities in the Post-COVID-19 Era (코로나-19 이후 시대에 생활SOC 시설의 설치·운영을 위한 우리나라 생활권의 설정과 유형 구분 연구)

  • Heejae Kim;Geunyoung Kim
    • Journal of the Society of Disaster Information
    • /
    • v.20 no.2
    • /
    • pp.448-461
    • /
    • 2024
  • Purpose: The purpose of this study is to establish a life zone class suitable for Korean characteristics in the post-COVID-19 era and to classify the types for the installation and operation of living SOC facilities. Method: The concept of the life zone was established through policies and previous studies related to the life zone, and data in various fields such as population, employment, transportation, economy, and education were classified using the z-score technique. Result: Korea's life zones can be classified into metropolitan life zones, regional life zones, urban life zones, village life zones, and neighborhood life zones, and depending on their roles, they can be classified into central life zones, workplace-residential balanced life zones, residential life zones, industrial life zones, and low-density life zones. Conclusion: The results of this study show that proper life zone establishment and proper living SOC supply can prevent the decline of underdeveloped areas and contribute to balanced regional development

An Study on the Correlation between Sound Characteristics and Sasang Constitution by CSL (CSL을 통한 음향특성과 사상체질간의 상관성 연구)

  • Shin, Mi-ran;Kim, Dal-lae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.11 no.1
    • /
    • pp.137-157
    • /
    • 1999
  • The purpose of this study is to help classifying Sasang Constitution through correlation with sound characteristic. This study was done it under the suppose that Sasang Constitution has correlation with sound spectrogram. The following result were obtained about correlation between sound spectrogram and Sasang Constitution by comparison and analysis 1. Soeumin answered his voice low tone, smooth and quiet in the survey. Soyangin answered his voice high, clear, fast and speaking random. Taeumin answered his voice low, thick and muddy. 2. Taeyangin was significantly slow compared with the others in the time of reading composition. Taeyangin was significantly slow compared with the others in Formant frequency 1. Taeyangin was significantly discriminated from Soeumin in Formant frequency 5. Taeyangin was significantly low compared with the others in Bandwidth 2. Soeumln was significantly low compared with Taeyangin in Pitch Maximum and Pitch Maximum-Pitch Minimum. Taeyangin was significantly high compared with the others in Energy mean. 3. In list of specification, the discrimination rate was higher than that by lists of 13 in the results of Multi-dimensional 4-class minimum-distance. The discrimination rate of three disposition except Soyangin was higher than that of four disposition in the results of One way ANOVA and Analysis of dis crimination in SPSS/PC+. In CART, the estimate rate of Sasang Constitution discrimination was higher than any other method. It is considered that there is a correlation between sound spectrogram and Sasang constitution according to the results. And method of Sasang constitution classification through sound spectrogram analysis can be one method as assistant for the objectification of Sasang constitution classification.

  • PDF

The Optimization of Hybrid BCI Systems based on Blind Source Separation in Single Channel (단일 채널에서 블라인드 음원분리를 통한 하이브리드 BCI시스템 최적화)

  • Yang, Da-Lin;Nguyen, Trung-Hau;Kim, Jong-Jin;Chung, Wan-Young
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.19 no.1
    • /
    • pp.7-13
    • /
    • 2018
  • In the current study, we proposed an optimized brain-computer interface (BCI) which employed blind source separation (BBS) approach to remove noises. Thus motor imagery (MI) signal and steady state visual evoked potential (SSVEP) signal were easily to be detected due to enhancement in signal-to-noise ratio (SNR). Moreover, a combination between MI and SSVEP which is typically can increase the number of commands being generated in the current BCI. To reduce the computational time as well as to bring the BCI closer to real-world applications, the current system utilizes a single-channel EEG signal. In addition, a convolutional neural network (CNN) was used as the multi-class classification model. We evaluated the performance in term of accuracy between a non-BBS+BCI and BBS+BCI. Results show that the accuracy of the BBS+BCI is achieved $16.15{\pm}5.12%$ higher than that in the non-BBS+BCI by using BBS than non-used on. Overall, the proposed BCI system demonstrate a feasibility to be applied for multi-dimensional control applications with a comparable accuracy.

Community Structure and Understory Vegetation Distribution Pattern of Fagus engleriana Stand in Is. Ulleung (울릉도 너도밤나무림의 군집구조와 하층식생의 분포특성)

  • Cheon, Kwang-Il;Jung, Sung-Cheol;Lee, Chang-Woo;Byeon, Jun-Gi;Joo, Sung-Hyun;You, Ju-Han;Lee, Seul-Gi;Choi, Cheol-Hyun;Park, In-Hwan
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.15 no.4
    • /
    • pp.81-95
    • /
    • 2012
  • This study was intended for Fagus engleriana stand in Is. Ulleung where the disturbance of vegetation has been caused by the exploitation and the increase of tourists. For the effective conservation and management on this issue, this study was conducted provide basic data. The sixteen study sites ($20{\times}20m$) were installed in the dominant Fagus engleriana stand and the base environment and vegetation were investigated. The Fagus engleriana stand was classified into two groups, The Fagus engleriana stand was classified into two groups, community A is Fagus engleriana-Sorbus amurensis and community B is Fagus engleriana-Acer pictum subsp. Mono by cluster analysis and community A were nothing signigicant by indicator species analysis. Community B were Eight species (Tsuga sieboldii, Camellia japonica, Dystaenia takesimana ect.) significant by indicator species analysis. The diameter class of 16cm to 25cm was 53.7% in population structure of Fagus engleriana, which was the highest and showed inverse J-distribution. Species diversity index (H') of investigated woody layer group ranged from 0.99 to 2.05 and that of under layer group ranged from 1.75~2.59. According to Non-metric Multidimensional Scaling (NMS) analysis, the woody layer was divided into community A developed in the region having relatively high sand content at high altitudes and community B formed at the place having relatively high clay content at low altitudes. Then this classification was significant through Multi-Response Permutation Procedures (MRPP) analysis. The distribution of understory vegetation through Detrended Correspondence Analysis (DCA) was induced by the silt content and cover degree of vegetation layer.

An Analysis of the Locational Selection Factors of the Small- and Medium-sized Hospitals Using the AHP : Centered on the Spine and Joint Hospitals (AHP를 이용한 중·소 병원 입지선택요인 분석 : 척추·관절 병원중심으로)

  • Kim, Duck Ki;Shim, Gyo-Eon
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.5
    • /
    • pp.191-214
    • /
    • 2018
  • This research empirically analyzed the selection factors and the locational selection factors of the medical service facilities according to the gradual increase of the importance of the selection factors and the locational selection factors regarding the establishments of the small- and medium-sized hospitals according to the rapid changes of the socio-economic conditions. By analyzing the priority order according to the levels of the importance of each evaluation item factor through a research related to the selection factors and the locational selection factors of the small- and medium-sized hospitals and by drawing what the important factors that have the influences on the competitiveness of the pre-existent small- and medium-sized hospitals are through the classification of the real estate locational factors and the non-locational factors, the purpose lies in utilizing them as the basic data and materials for the opening strategies of the small- and medium-sized hospitals considering the special, locational characteristics according to the important factors of the selection factors of the small- and medium-sized hospitals, regarding the medical suppliers that have been preparing, for opening the new, small- and medium-sized hospitals. Based on the results of the preceding researches and the researches on the case examples, 28 evaluation factors were arrived at in terms of the level of the medical treatment, the medical services, the accessibilities of the hospitals, the conveniences of the hospitals, and the physical environment. And, regarding the 28 detailed evaluation factors that had been collected, through the interviews with the related experts, the 5 factors of the medical level, the medical service, the expertise of the hospital, the convenience of the hospital, and the physical environment were selected as the upper class evaluation factors. And, according to each upper class, a total of 28 low-part evaluation factors were selected. Regarding the optimal evaluation factors that were selected, the optimal locational factors were selected by carrying out an AHP questionnaire survey investigation with 200 medical experts as the subjects. Regarding the AHP analysis results, similarly with the case examples of the precedent researches, the levels of the importance appeared in the order of the medical level, the medical services, the accessibility of the hospital, the physical environment, and the convenience. And the factors that were related to the facilities of a hospital appeared low. The results of this research can be applied in providing the basis for the decision-makings regarding the selections of the locations of the small- and medium-sized hospitals in the future.