• Title/Summary/Keyword: multiple classification analysis

Search Result 462, Processing Time 0.033 seconds

Ensemble Model using Multiple Profiles for Analytical Classification of Threat Intelligence (보안 인텔리전트 유형 분류를 위한 다중 프로파일링 앙상블 모델)

  • Kim, Young Soo
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.3
    • /
    • pp.231-237
    • /
    • 2017
  • Threat intelligences collected from cyber incident sharing system and security events collected from Security Information & Event Management system are analyzed and coped with expanding malicious code rapidly with the advent of big data. Analytical classification of the threat intelligence in cyber incidents requires various features of cyber observable. Therefore it is necessary to improve classification accuracy of the similarity by using multi-profile which is classified as the same features of cyber observables. We propose a multi-profile ensemble model performed similarity analysis on cyber incident of threat intelligence based on both attack types and cyber observables that can enhance the accuracy of the classification. We see a potential improvement of the cyber incident analysis system, which enhance the accuracy of the classification. Implementation of our suggested technique in a computer network offers the ability to classify and detect similar cyber incident of those not detected by other mechanisms.

Selecting the optimal threshold based on impurity index in imbalanced classification (불균형 자료에서 불순도 지수를 활용한 분류 임계값 선택)

  • Jang, Shuin;Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.711-721
    • /
    • 2021
  • In this paper, we propose the method of adjusting thresholds using impurity indices in classification analysis on imbalanced data. Suppose the minority category is Positive and the majority category is Negative for the imbalanced binomial data. When categories are determined based on the commonly used 0.5 basis, the specificity tends to be high in unbalanced data while the sensitivity is relatively low. Increasing sensitivity is important when proper classification of objects in minority categories is relatively important. We explore how to increase sensitivity through adjusting thresholds. Existing studies have adjusted thresholds based on measures such as G-Mean and F1-score, but in this paper, we propose a method to select optimal thresholds using the chi-square statistic of CHAID, the Gini index of CART, and the entropy of C4.5. We also introduce how to get a possible unique value when multiple optimal thresholds are obtained. Empirical analysis shows what improvements have been made compared to the results based on 0.5 through classification performance metrics.

Multi-Label Classification Approach to Effective Aspect-Mining (효과적인 애스팩트 마이닝을 위한 다중 레이블 분류접근법)

  • Jong Yoon Won;Kun Chang Lee
    • Information Systems Review
    • /
    • v.22 no.3
    • /
    • pp.81-97
    • /
    • 2020
  • Recent trends in sentiment analysis have been focused on applying single label classification approaches. However, when considering the fact that a review comment by one person is usually composed of several topics or aspects, it would be better to classify sentiments for those aspects respectively. This paper has two purposes. First, based on the fact that there are various aspects in one sentence, aspect mining is performed to classify the emotions by each aspect. Second, we apply the multiple label classification method to analyze two or more dependent variables (output values) at once. To prove our proposed approach's validity, online review comments about musical performances were garnered from domestic online platform, and the multi-label classification approach was applied to the dataset. Results were promising, and potentials of our proposed approach were discussed.

Comparative Analysis of Land-use thematic GIS layers and Multi-resolution Image Classification Results by using LANDSAT 7 ETM+ and KOMPSAT EOC image (Landsat 7 ETM+와 KOMPSAT EOC 영상 자료를 이용한 다중 분해능 영상 분류결과와 토지이용현황 주제도 대비 분석)

  • 이기원;유영철;송무영;사공호상
    • Spatial Information Research
    • /
    • v.10 no.2
    • /
    • pp.331-343
    • /
    • 2002
  • Recently, as various fields of applications using space-borne imagery have been emphasized, interests on integrated analysis or fusion using multi-sources are also increasing. In this study, to investigate applicability of multiple imageries for further regional-scaled application, DN value analysis and multi-resolution classification by using KOMPSAT EOC imagery and Landsat 7 ETM+image data in the Namyangju-city area were performed, and then this classified results were compared to land-use thematic data at the same area. In case of classified results by using muff-resolution image data, it is shown that linear-type features can be easily extracted. furthermore, it is expected that multi-resolution classified image can be effectively utilized to urban environment analysis, according to results of similar pattern by comparative study based on multi-buffered zone analysis or so-called distance analysis along main road features in the study area.

High-Reliable Classification of Multiple Induction Motor Faults using Robust Vibration Signatures in Noisy Environments based on a LPC Analysis and an EM Algorithm (LPC 분석 기법 및 EM 알고리즘 기반 잡음 환경에 강인한 진동 특징을 이용한 고 신뢰성 유도 전동기 다중 결함 분류)

  • Kang, Myeongsu;Jang, Won-Chul;Kim, Jong-Myon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.2
    • /
    • pp.21-30
    • /
    • 2014
  • The use of induction motors has been recently increasing in a variety of industrial sites, and they play a significant role. This has motivated that many researchers have studied on developing fault detection and classification systems of induction motors in order to reduce economical damage caused by their faults. To early identify induction motor faults, this paper effectively estimates spectral envelopes of each induction motor fault by utilizing a linear prediction coding (LPC) analysis technique and an expectation maximization (EM) algorithm. Moreover, this paper classifies induction motor faults into their corresponding categories by calculating Mahalanobis distance using the estimated spectral envelopes and finding the minimum distance. Experimental results show that the proposed approach yields higher classification accuracies than the state-of-the-art conventional approach for both noiseless and noisy environments for identifying the induction motor faults.

Application of Multi-periodic Harmonic Model for Classification of Multi-temporal Satellite Data: MODIS and GOCI Imagery

  • Jung, Myunghee;Lee, Sang-Hoon
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.4
    • /
    • pp.573-587
    • /
    • 2019
  • A multi-temporal approach using remotely sensed time series data obtained over multiple years is a very useful method for monitoring land covers and land-cover changes. While spectral-based methods at any particular time limits the application utility due to instability of the quality of data obtained at that time, the approach based on the temporal profile can produce more accurate results since data is analyzed from a long-term perspective rather than on one point in time. In this study, a multi-temporal approach applying a multi-periodic harmonic model is proposed for classification of remotely sensed data. A harmonic model characterizes the seasonal variation of a time series by four parameters: average level, frequency, phase, and amplitude. The availability of high-quality data is very important for multi-temporal analysis.An satellite image usually have many unobserved data and bad-quality data due to the influence of observation environment and sensing system, which impede the analysis and might possibly produce inaccurate results. Harmonic analysis is also very useful for real-time data reconstruction. Multi-periodic harmonic model is applied to the reconstructed data to classify land covers and monitor land-cover change by tracking the temporal profiles. The proposed method is tested with the MODIS and GOCI NDVI time series over the Korean Peninsula for 5 years from 2012 to 2016. The results show that the multi-periodic harmonic model has a great potential for classification of land-cover types and monitoring of land-cover changes through characterizing annual temporal dynamics.

Stability Analysis of High Speed Railway Tunnel Passing Through the Abandoned Mine Area (폐광지역을 통과하는 고속철도터널의 안정성 평가)

  • 장명환;양형식;정소걸
    • Tunnel and Underground Space
    • /
    • v.10 no.3
    • /
    • pp.395-402
    • /
    • 2000
  • The influence of the mined-out caves on the stability of the high speed railway tunnel was investigated with a series of geological logging and in-situ tests on the one hand, and with the rock mass classification using the multiple regression analysis on the other hand. The rock mass in this area can be classified as 'fair', and the condition of the discontinuities plays the most important role in the classification of the rock mass. The results of the analysis obtained by the FLAC showed that the western part of the tunnel locating at 50m above the mine cavities could be affected by subsidence associated with a considerable deformation, the magnitude of which might depend on the properties of the rock mass.

  • PDF

Effectiveness Analysis of Helicopter Flight Simulator and Actual Flight Training: Focused on Instrument Flight Training (헬리콥터 비행 시뮬레이터와 실 비행훈련과의 효과 분석: 계기비행 훈련을 중심으로)

  • Kim, Sang-chul;Kim, Jong-min
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.28 no.1
    • /
    • pp.75-82
    • /
    • 2020
  • To compare and analyze the differences between flight simulation and actual flight among 130 army helicopter pilot training subjects, the correlation analysis was performed first through t-testing and multiple regression analysis of individual characteristics and flight simulation and actual instrument flight training, which were analyzed significantly in the age (a3) and service classification (a5) of the six verification factors. This has been shown to be significant, with no difference between the flight simulator and the actual flight. Second, in order to study the correlation between aircraft types, the flight evaluation (v1) was analyzed as a dependent variable for the performance of the flight simulator (KUH: s2, UH60: s3, AH-1S: s5, UH-1H: s6), and the results of the multiple regression analysis of the flight simulator evaluation (s1) were analyzed, in contrast, as a dependent variable, and in conclusion, the training of the flight simulator provided statistical data on the possibility of replacing the actual flight training, which is thought to contribute to the orientation, budget reduction and aviation safety of the pilot training.

A quantitative analysis of greenhouse gases emissions from catching swimming crab and snow crab through cross-analysis of multiple fisheries (다수 업종의 교차분석을 통한 꽃게 및 대게 어획 시 온실가스 배출량의 정량적 분석)

  • Gunho LEE;Jihoon LEE;Sua PARK;Minseo PARK
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.59 no.1
    • /
    • pp.19-27
    • /
    • 2023
  • The interest in greenhouse gases (GHG) emitted from all industries is emerging as a very important issue worldwide. This is affecting not only the global warming, but also the environmentally friendly competitiveness of the industry. The fisheries sector is increasingly interested in greenhouse gas emissions also due to the Paris Climate Agreement in 2015. Korean industry and government are also making a number of effort to reduce greenhouse gas emissions so far, but the effort to reduce GHG in the fishery sector is insufficient compared to other fields. Especially, the investigation on the GHG emissions from Korean fisheries did not carry out extensively. The studies on GHG emissions from Korean fishery are most likely dealt with the GHG emissions by fishery classification so far. However, the forthcoming research related to GHG emissions from fisheries is needed to evaluate the GHG emission level by species to prepare the adoption of Environmental labels and declarations (ISO 14020). The purpose of this research is to investigate which degree of GHG emitted to produce the species (swimming crab and snow crab) from various fisheries. Here, we calculated the GHG emission to produce the species from the fisheries using the life cycle assessment (LCA) method. The system boundary and input parameters for each process level are defined for LCA analysis. The fuel use coefficients of the fisheries for the species are also calculated according to the fuel type. The GHG emissions from sea activities by the fisheries will be dealt with. Furthermore, the GHG emissions for producing the unit weight species and annual production are calculated by fishery classification. The results will be helpful to establish the carbon footprint of seafood in Korea.

Statistical Analysis for Chemical Characterization of Fall-Out Particles (강하분진의 화학적 특성파악을 위한 통계학적 해석)

  • Kim, Hyeon-Seop;Heo, Jeong-Suk;Kim, Dong-Sul
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.14 no.6
    • /
    • pp.631-642
    • /
    • 1998
  • Fall-out particles were collected by the modified British deposit gauges at 35 sampling sites in Suwon area from January to November, 1996. Twenty chemical species (Al. Ba, Cd, Cr, K, Pb, Sb, Zn, Cu, Fe, Ni, V, F-, Cl-, NO3-, 5042-, Na+, NH4+, Mg2+, and Ca2+) were analyzed by AAS and If. The purposes of this study were to estimate qualitatively various emission sources of the fell-out particle by applying multivariate statistical techniques such as factor analysis, multiple regression analysis, and discriminant analysis. During the study, outlier sites were determined by a z-score method. Cl-, Na+, Mg2+, and SO42- were highly correlated due to their common marine related source. Wind speed was the most influential factor for the deposition fluxes of the particle itself and all the chemical species as well. When applying the factor analysis, 8 source patterns were qualitatively obtained, such as marine source, soil source, oil burning source, Cr related source, tire source, Cd related source, agriculture source, and F- related source. As a result of the multiple regression analysis, we could suggest that some chemical compounds may possibly exist in the form of CaSO4, NaN03, NaCl, MgC12, (NH4)2SO4, NaF, and CaCl2 in the fall-out particles. Finally, spatial and seasonal classification study performed by a discriminant analysis showed th.at SO42-, Ca2+, Cl-, and Fe were dominant in the group of spatial pattern; however, SO42-, Cl-, Al, and V were in the group of seasonal pattern.

  • PDF