• Title/Summary/Keyword: Application Classification

Search Result 1,900, Processing Time 0.03 seconds

Using Genetic Rule-Based Classifier System for Data Mining (유전자 알고리즘을 이용한 데이터 마이닝의 분류 시스템에 관한 연구)

  • Han, Myung-Mook
    • Journal of Internet Computing and Services
    • /
    • v.1 no.1
    • /
    • pp.63-72
    • /
    • 2000
  • Data mining means a process of nontrivial extraction of hidden knowledge or potentially useful information from data in large databases. Data mining algorithm is a multi-disciplinary field of research; machine learning, statistics, and computer science all make a contribution. Different classification schemes can be used to categorize data mining methods based on the kinds of tasks to be implemented and the kinds of application classes to be utilized, and classification has been identified as an important task in the emerging field of data mining. Since classification is the basic element of human's way of thinking, it is a well-studied problem in a wide varietyof application. In this paper, we propose a classifier system based on genetic algorithm with robust property, and the proposed system is evaluated by applying it to nDmC problem related to classification task in data mining.

  • PDF

Performance Improvement of the Payload Signature based Traffic Classification System Using Application Traffic Locality (응용 트래픽의 지역성을 이용한 페이로드 시그니쳐 기반 트래픽 분석 시스템의 성능 향상)

  • Park, Jun-Sang;Yoon, Sung-Ho;Kim, Myung-Sup
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38B no.7
    • /
    • pp.519-525
    • /
    • 2013
  • The traffic classification is a preliminary and essential step for stable network service provision and efficient network resource management. However, the payload signature-based method has a significant drawback in high-speed network environment that the processing speed is much slower than other method such as header-based and statistical methods. In this paper, We propose the server IP, Port cache-based traffic classification method using application traffic locality to improve the processing speed of traffic classification. The suggested method achieved about 10 folds improvement in processing speed and 10% improvement in completeness over the payload-based classification system.

Classification of Product Safety Management Target by RAP and Cluster Analysis for Consumer Safety (소비자안전을 위한 RAP 및 군집분석을 통한 제품안전 관리대상 유형분류 연구)

  • Suh, Jungdae
    • Journal of the Korean Society of Safety
    • /
    • v.33 no.6
    • /
    • pp.128-135
    • /
    • 2018
  • Currently, the government selects products that are likely to cause harm to consumers as safety management targets and classifies them into three types: safety certification, safety confirmation, and supplier conformity verification. In addition, the government conducts safety surveys on products in circulation or accident products, and recalls products that are of great concern to consumer risks. In this paper, we have developed RAP (Risk Assessment method based on Probability), which is a probability based product risk assessment method, for the classification of safety management type of product and safety investigation, and have shown an application example. In this process, information is used for the CISS (Consumer Injury Surveillance System) of the Korean Consumer Agency. In addition, we apply the cluster analysis to classify the current supervised children products into three groups. Then, we confirm the effectiveness of RAP by comparing the result of RAP application, cluster analysis result and current safety management classification type. Also, we recognize the need to review the current safety management classification criteria for classifying products into three types.

Analyzing Key Variables in Network Attack Classification on NSL-KDD Dataset using SHAP (SHAP 기반 NSL-KDD 네트워크 공격 분류의 주요 변수 분석)

  • Sang-duk Lee;Dae-gyu Kim;Chang Soo Kim
    • Journal of the Society of Disaster Information
    • /
    • v.19 no.4
    • /
    • pp.924-935
    • /
    • 2023
  • Purpose: The central aim of this study is to leverage machine learning techniques for the classification of Intrusion Detection System (IDS) data, with a specific focus on identifying the variables responsible for enhancing overall performance. Method: First, we classified 'R2L(Remote to Local)' and 'U2R (User to Root)' attacks in the NSL-KDD dataset, which are difficult to detect due to class imbalance, using seven machine learning models, including Logistic Regression (LR) and K-Nearest Neighbor (KNN). Next, we use the SHapley Additive exPlanation (SHAP) for two classification models that showed high performance, Random Forest (RF) and Light Gradient-Boosting Machine (LGBM), to check the importance of variables that affect classification for each model. Result: In the case of RF, the 'service' variable and in the case of LGBM, the 'dst_host_srv_count' variable were confirmed to be the most important variables. These pivotal variables serve as key factors capable of enhancing performance in the context of classification for each respective model. Conclusion: In conclusion, this paper successfully identifies the optimal models, RF and LGBM, for classifying 'R2L' and 'U2R' attacks, while elucidating the crucial variables associated with each selected model.

Combining Multiple Classifiers for Automatic Classification of Email Documents (전자우편 문서의 자동분류를 위한 다중 분류기 결합)

  • Lee, Jae-Haeng;Cho, Sung-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.3
    • /
    • pp.192-201
    • /
    • 2002
  • Automated text classification is considered as an important method to manage and process a huge amount of documents in digital forms that are widespread and continuously increasing. Recently, text classification has been addressed with machine learning technologies such as k-nearest neighbor, decision tree, support vector machine and neural networks. However, only few investigations in text classification are studied on real problems but on well-organized text corpus, and do not show their usefulness. This paper proposes and analyzes text classification methods for a real application, email document classification task. First, we propose a combining method of multiple neural networks that improves the performance through the combinations with maximum and neural networks. Second, we present another strategy of combining multiple machine learning classifiers. Voting, Borda count and neural networks improve the overall classification performance. Experimental results show the usefulness of the proposed methods for a real application domain, yielding more than 90% precision rates.

New Classification System for the Standardization of Power IT Terminologies (새로운 매트릭스분류체제에 의한 전력 IT용어 제정에 관한 연구)

  • Kim, Jung-Hoon;Hwang, Hu-Mor;Won, Jong-Ryul
    • Proceedings of the KIEE Conference
    • /
    • 2008.11a
    • /
    • pp.360-362
    • /
    • 2008
  • Based on classification systems of power and IT standard dictionaries, scientific and technological standard, SPARK, power IT fields of IEC and organization units of corporations, we propose a new classification system for the standardization of power of terminologies. The classification system consists of a hierarchical structure with general classification, application fields and specific technologies while keeping the conventional matrix-type classification system. Interpretation work of the power of terminologies confirms that the proposed classification system is efficient.

  • PDF

Application of Bitemporal Classification Technique for Accuracy Improvement of Remotely Sensed Data (원격탐사 데이타의 정확도 향상을 위한 Bitemporal Classification 기법의 적용)

  • 안철호;안기원;윤상호;박민호
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.5 no.2
    • /
    • pp.24-33
    • /
    • 1987
  • This study aims at obtaining more effective image processing techniques and more accurately classified image in the sphere which uses remotely sensed data. For this practice, the result of land use classification compounding summer scene with winter scene and the classified result of summer scene were compared, analyzed. From the upper analysed results, we found that Bitemporal Classification technique and $tan^{-1}$transformation were effective. Particularly, dividing crop class into two classes of farmland and field was more possible by appling Bitemporal Classification technique.

  • PDF

Image Classification for Military Application using Public Landcover Map (공개된 토지피복도를 활용한 위성영상 분류)

  • Hong, Woo-Yong;Park, Wan-Yong;Song, Hyeon-Seung;Jung, Cheol-Hoon;Eo, Yang-Dam;Kim, Seong-Joon
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.13 no.1
    • /
    • pp.147-155
    • /
    • 2010
  • Landcover information of access-denied area was extracted from low-medium and high resolution satellite image. Training for supervised classification was performed to refer visually by landcover map which is made and distributed from The Ministry of Environment. The classification result was compared by relating data of FACC land classification system. As we rasterize digital military map with same pixel size of satellite classification, the accuracy test was performed by image to image method. In vegetation case, ancillary data such as NDVI and image for seasons are going to improve accuracy. FACC code of FDB need to recognize the properties which can be automated.

Selecting Optimal Basis Function with Energy Parameter in Image Classification Based on Wavelet Coefficients

  • Yoo, Hee-Young;Lee, Ki-Won;Jin, Hong-Sung;Kwon, Byung-Doo
    • Korean Journal of Remote Sensing
    • /
    • v.24 no.5
    • /
    • pp.437-444
    • /
    • 2008
  • Land-use or land-cover classification of satellite images is one of the important tasks in remote sensing application and many researchers have tried to enhance classification accuracy. Previous studies have shown that the classification technique based on wavelet transform is more effective than traditional techniques based on original pixel values, especially in complicated imagery. Various basis functions such as Haar, daubechies, coiflets and symlets are mainly used in 20 image processing based on wavelet transform. Selecting adequate wavelet is very important because different results could be obtained according to the type of basis function in classification. However, it is not easy to choose the basis function which is effective to improve classification accuracy. In this study, we first computed the wavelet coefficients of satellite image using ten different basis functions, and then classified images. After evaluating classification results, we tried to ascertain which basis function is the most effective for image classification. We also tried to see if the optimum basis function is decided by energy parameter before classifying the image using all basis functions. The energy parameters of wavelet detail bands and overall accuracy are clearly correlated. The decision of optimum basis function using energy parameter in the wavelet based image classification is expected to be helpful for saving time and improving classification accuracy effectively.

Two-stage Deep Learning Model with LSTM-based Autoencoder and CNN for Crop Classification Using Multi-temporal Remote Sensing Images

  • Kwak, Geun-Ho;Park, No-Wook
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.4
    • /
    • pp.719-731
    • /
    • 2021
  • This study proposes a two-stage hybrid classification model for crop classification using multi-temporal remote sensing images; the model combines feature embedding by using an autoencoder (AE) with a convolutional neural network (CNN) classifier to fully utilize features including informative temporal and spatial signatures. Long short-term memory (LSTM)-based AE (LAE) is fine-tuned using class label information to extract latent features that contain less noise and useful temporal signatures. The CNN classifier is then applied to effectively account for the spatial characteristics of the extracted latent features. A crop classification experiment with multi-temporal unmanned aerial vehicle images is conducted to illustrate the potential application of the proposed hybrid model. The classification performance of the proposed model is compared with various combinations of conventional deep learning models (CNN, LSTM, and convolutional LSTM) and different inputs (original multi-temporal images and features from stacked AE). From the crop classification experiment, the best classification accuracy was achieved by the proposed model that utilized the latent features by fine-tuned LAE as input for the CNN classifier. The latent features that contain useful temporal signatures and are less noisy could increase the class separability between crops with similar spectral signatures, thereby leading to superior classification accuracy. The experimental results demonstrate the importance of effective feature extraction and the potential of the proposed classification model for crop classification using multi-temporal remote sensing images.