• Title/Summary/Keyword: Dataset Training

Search Result 640, Processing Time 0.028 seconds

An Integrated Model based on Genetic Algorithms for Implementing Cost-Effective Intelligent Intrusion Detection Systems (비용효율적 지능형 침입탐지시스템 구현을 위한 유전자 알고리즘 기반 통합 모형)

  • Lee, Hyeon-Uk;Kim, Ji-Hun;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.1
    • /
    • pp.125-141
    • /
    • 2012
  • These days, the malicious attacks and hacks on the networked systems are dramatically increasing, and the patterns of them are changing rapidly. Consequently, it becomes more important to appropriately handle these malicious attacks and hacks, and there exist sufficient interests and demand in effective network security systems just like intrusion detection systems. Intrusion detection systems are the network security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. Conventional intrusion detection systems have generally been designed using the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. However, they cannot handle new or unknown patterns of the network attacks, although they perform very well under the normal situation. As a result, recent studies on intrusion detection systems use artificial intelligence techniques, which can proactively respond to the unknown threats. For a long time, researchers have adopted and tested various kinds of artificial intelligence techniques such as artificial neural networks, decision trees, and support vector machines to detect intrusions on the network. However, most of them have just applied these techniques singularly, even though combining the techniques may lead to better detection. With this reason, we propose a new integrated model for intrusion detection. Our model is designed to combine prediction results of four different binary classification models-logistic regression (LOGIT), decision trees (DT), artificial neural networks (ANN), and support vector machines (SVM), which may be complementary to each other. As a tool for finding optimal combining weights, genetic algorithms (GA) are used. Our proposed model is designed to be built in two steps. At the first step, the optimal integration model whose prediction error (i.e. erroneous classification rate) is the least is generated. After that, in the second step, it explores the optimal classification threshold for determining intrusions, which minimizes the total misclassification cost. To calculate the total misclassification cost of intrusion detection system, we need to understand its asymmetric error cost scheme. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, total misclassification cost is more affected by FNE rather than FPE. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 10,000 samples from them by using random sampling method. Also, we compared the results from our model with the results from single techniques to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell R4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on GA outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that the proposed model outperformed all the other comparative models in the total misclassification cost perspective. Consequently, it is expected that our study may contribute to build cost-effective intelligent intrusion detection systems.

Sorghum Field Segmentation with U-Net from UAV RGB (무인기 기반 RGB 영상 활용 U-Net을 이용한 수수 재배지 분할)

  • Kisu Park;Chanseok Ryu ;Yeseong Kang;Eunri Kim;Jongchan Jeong;Jinki Park
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_1
    • /
    • pp.521-535
    • /
    • 2023
  • When converting rice fields into fields,sorghum (sorghum bicolor L. Moench) has excellent moisture resistance, enabling stable production along with soybeans. Therefore, it is a crop that is expected to improve the self-sufficiency rate of domestic food crops and solve the rice supply-demand imbalance problem. However, there is a lack of fundamental statistics,such as cultivation fields required for estimating yields, due to the traditional survey method, which takes a long time even with a large manpower. In this study, U-Net was applied to RGB images based on unmanned aerial vehicle to confirm the possibility of non-destructive segmentation of sorghum cultivation fields. RGB images were acquired on July 28, August 13, and August 25, 2022. On each image acquisition date, datasets were divided into 6,000 training datasets and 1,000 validation datasets with a size of 512 × 512 images. Classification models were developed based on three classes consisting of Sorghum fields(sorghum), rice and soybean fields(others), and non-agricultural fields(background), and two classes consisting of sorghum and non-sorghum (others+background). The classification accuracy of sorghum cultivation fields was higher than 0.91 in the three class-based models at all acquisition dates, but learning confusion occurred in the other classes in the August dataset. In contrast, the two-class-based model showed an accuracy of 0.95 or better in all classes, with stable learning on the August dataset. As a result, two class-based models in August will be advantageous for calculating the cultivation fields of sorghum.

A Study on 3D Indoor mapping for as-built BIM creation by using Graph-based SLAM (준공 BIM 구축을 위한 Graph-based SLAM 기반의 실내공간 3차원 지도화 연구)

  • Jung, Jaehoon;Yoon, Sanghyun;Cyrill, Stachniss;Heo, Joon
    • Korean Journal of Construction Engineering and Management
    • /
    • v.17 no.3
    • /
    • pp.32-42
    • /
    • 2016
  • In Korea, the absence of BIM use in existing civil structures and buildings is driving a demand for as-built BIM. As-built BIMs are often created using laser scanners that provide dense 3D point cloud data. Conventional static laser scanning approaches often suffer from limitations in their operability due to the difficulties in moving the equipment, the selection of scanning location, and the requirement of placing targets or extracting tie points for registration of each scanned point cloud. This paper aims at reducing the manual effort using a kinematic 3D laser scanning system based on graph-based simultaneous localization and mapping (SLAM) for continuous indoor mapping. The robotic platform carries three 2D laser scanners: the front scanner is mounted horizontally to compute the robot's trajectory and to build the SLAM graph; the other two scanners are mounted vertically to scan the profiles of surrounding environments. To reduce the accumulated error in the trajectory of the platform through loop closures, the graph-based SLAM system incorporates AdaBoost loop closure approach, which is particularly suitable for the developed multi-scanner system providing more features than the single-scanner system for training. We implemented the proposed method and evaluated it in two indoor test sites. Our experimental results show that the false positive rate was reduced by 13.6% and 7.9% for the two dataset. Finally, the 2D and 3D mapping results of the two test sites confirmed the effectiveness of the proposed graph-based SLAM.

A Design and Analysis of Pressure Predictive Model for Oscillating Water Column Wave Energy Converters Based on Machine Learning (진동수주 파력발전장치를 위한 머신러닝 기반 압력 예측모델 설계 및 분석)

  • Seo, Dong-Woo;Huh, Taesang;Kim, Myungil;Oh, Jae-Won;Cho, Su-Gil
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.11
    • /
    • pp.672-682
    • /
    • 2020
  • The Korea Nowadays, which is research on digital twin technology for efficient operation in various industrial/manufacturing sites, is being actively conducted, and gradual depletion of fossil fuels and environmental pollution issues require new renewable/eco-friendly power generation methods, such as wave power plants. In wave power generation, however, which generates electricity from the energy of waves, it is very important to understand and predict the amount of power generation and operational efficiency factors, such as breakdown, because these are closely related by wave energy with high variability. Therefore, it is necessary to derive a meaningful correlation between highly volatile data, such as wave height data and sensor data in an oscillating water column (OWC) chamber. Secondly, the methodological study, which can predict the desired information, should be conducted by learning the prediction situation with the extracted data based on the derived correlation. This study designed a workflow-based training model using a machine learning framework to predict the pressure of the OWC. In addition, the validity of the pressure prediction analysis was verified through a verification and evaluation dataset using an IoT sensor data to enable smart operation and maintenance with the digital twin of the wave generation system.

The Working Conditions for Care Workers and Care Quality in Long-Term Care Services (노인장기요양보험제도에서 요양보호사의 근로조건이 서비스 질에 미치는 효과에 관한 연구)

  • Kwon, Hyun Jung;Hong, Kyung Zoon
    • Korean Journal of Social Welfare
    • /
    • v.69 no.1
    • /
    • pp.33-57
    • /
    • 2017
  • This study examines the effect of working conditions for care workers on the care quality in long-term care facilities, particularly the coexisting perspective on publicness and the marketization of Long-term care services in South Korea brings about. Prior studies have not identified a causal relationship between working conditions and the care quality, only explained cause of a low-wage labor market and low productivity of social services. Theoretical relevance of working conditions and service quality on Long-term care in Korea is to view from a integrated care model by Daly and Lewis(2002). A nonproportional stratified sampling procedure was used to consider Long-term care facility's ownership. A merged dataset combining surveys from 248 Long-Term Care facilities and online resources from NHIC administrative was used and analyzed by multiple regression. The analysis results is showed as follows. Overall, organizations with better working conditions, having higher wage, having greater a fringe benefit, being skills development and training are likely to have good care quality in each area. This research shows that the working conditions, rewards and support to care workers of organizational culture in the normative dimension beyond the minimum standard on labor market policy and evaluation system by government regulations have a positive impact on Long-term care quality.

  • PDF

Effect of Experience, Education, Record Keeping, Labor and Decision Making on Monthly Milk Yield and Revenue of Dairy Farms Supported by a Private Organization in Central Thailand

  • Yeamkong, S.;Koonawootrittriron, S.;Elzo, M.A.;Suwanasopee, T.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.23 no.6
    • /
    • pp.814-824
    • /
    • 2010
  • The objective of this research was to assess the effect of experience, education, record keeping, labor, and decision making on monthly milk yield per farm (MYF), monthly milk yield per cow (MYC), monthly milk revenue per farm (MRF), and monthly revenue per cow (MRC) of dairy farms supported by a private organization in Central Thailand. The dataset contained 34,082 monthly milk yield and revenue records collected from January 2004 to December 2008 on 497 farms, and information on individual farmer experience and education, record keeping, and decision making obtained with a questionnaire. Farmer experience categories were i) no experience, ii) one year, iii) two to five years, iv) six to ten years, v) eleven to fifteen years, vi) sixteen to twenty years, and vii) more than twenty years. Farmer education categories were i) no education or primary school, ii) high school, and iii) bachelor or higher degree. Record keeping categories were: i) no records and ii) kept records. Labor categories were: i) family, ii) hired people, and iii) family and hired people. Decision making categories were: i) decisions made by farmers themselves, ii) decisions made with help from government officials, and iii) decisions made with help from organization staff. The mixed linear model contained the fixed effects of year-season, farm location-farm size subclass, experience, education, record keeping, labor, and decision making on sire selection, and the random effects of farm and residual. Results showed that longer experience increased (p<0.05) monthly milk yield (MYF and MYC) and revenue (MRF and MRC). Farms that hired people produced the highest (p<0.05) monthly milk yield (MYF and MYC) and revenue (MRF and MRC), followed by farms that used family, and the lowest values were for farms that used both family and hired people. Better educated farmers produced more MYC and MRC (p<0.05) than lower educated farmers. Farms that kept records had higher MYF and MRF (p<0.05) than those without records. Although differences among farms were non-significant, farms that received help from the organization staff had higher monthly milk yield (MYF and MYC) and revenue (MRF and MRC) than those that decided by themselves or with help from government officials. These findings suggested that dairy farmers needed systematic training and continuous support to improve farm milk production and revenues in a sustainable manner.

Application of Deep Learning Method for Real-Time Traffic Analysis using UAV (UAV를 활용한 실시간 교통량 분석을 위한 딥러닝 기법의 적용)

  • Park, Honglyun;Byun, Sunghoon;Lee, Hansung
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.38 no.4
    • /
    • pp.353-361
    • /
    • 2020
  • Due to the rapid urbanization, various traffic problems such as traffic jams during commute and regular traffic jams are occurring. In order to solve these traffic problems, it is necessary to quickly and accurately estimate and analyze traffic volume. ITS (Intelligent Transportation System) is a system that performs optimal traffic management by utilizing the latest ICT (Information and Communications Technology) technologies, and research has been conducted to analyze fast and accurate traffic volume through various techniques. In this study, we proposed a deep learning-based vehicle detection method using UAV (Unmanned Aerial Vehicle) video for real-time traffic analysis with high accuracy. The UAV was used to photograph orthogonal videos necessary for training and verification at intersections where various vehicles pass and trained vehicles by classifying them into sedan, truck, and bus. The experiment on UAV dataset was carried out using YOLOv3 (You Only Look Once V3), a deep learning-based object detection technique, and the experiments achieved the overall object detection rate of 90.21%, precision of 95.10% and the recall of 85.79%.

Pattern Recognition of the Herbal Drug, Magnoliae Flos According to their Essential Oil Components

  • Jeong, Eun-Sook;Choi, Kyu-Yeol;Kim, Sun-Chun;Son, In-Seop;Cho, Hwang-Eui;Ahn, Su-Youn;Woo, Mi-Hee;Hong, Jin-Tae;Moon, Dong-Cheul
    • Bulletin of the Korean Chemical Society
    • /
    • v.30 no.5
    • /
    • pp.1121-1126
    • /
    • 2009
  • This paper describes a pattern recognition method of Magnoliae flos based on a gas chromatographic/mass spectrometric (GC/MS) analysis of the essential oil components. The botanical drug is mainly comprised of the four magnolia species (M. denudata, M. biondii, M. kobus, and M. liliflora) in Korea, although some other species are also being dealt with the drug. The GC/MS separation of the volatile components, which was extracted by the simultaneous distillation and extraction (SDE), was performed on a carbowax column (supelcowax 10; 30 m{\time}0.25 mm{\time}0.25{\mu}m$) using temperature programming. Variance in the retention times for all peaks of interests was within RSD 2% for repeated analyses (n = 9). Of the 74 essential oil components identified from the magnolia species, approximately 10 major components, which is $\alpha$-pinene, $\beta$-pinene, sabinene, myrcene, d-limonene, eucarlyptol (1,8-cineol), $\gamma$-terpinene, p-cymene, linalool, $\alpha$-terpineol, were commonly present in the four species. For statistical analysis, the original dataset was reduced to the 13 variables by Fisher criterion and factor analysis (FA). The essential oil patterns were processed by means of the multivariate statistical analysis including hierarchical cluster analysis (HCA), principal component analysis (PCA) and discriminant analysis (DA). All samples were divided into four groups with three principal components by PCA and according to the plant origins by HCA. Thirty-three samples (23 training sets and 10 test samples to be assessed) were correctly classified into the four groups predicted by PCA. This method would provide a practical strategy for assessing the authenticity or quality of the well-known herbal drug, Magnoliae flos.

Prediction of Ultimate Strength and Strain of Concrete Columns Retrofitted by FRP Using Adaptive Neuro-Fuzzy Inference System (FRP로 보강된 콘크리트 부재의 압축응력-변형률 예측을 위한 뉴로퍼지모델의 적용)

  • Park, Tae-Won;Na, Ung-Jin;Kwon, Sung-Jun
    • Journal of the Korea Concrete Institute
    • /
    • v.22 no.1
    • /
    • pp.19-27
    • /
    • 2010
  • Aging and severe environments are major causes of damage in reinforced concrete (RC) structures such as buildings and bridges. Deterioration such as concrete cracks, corrosion of steel, and deformation of structural members can significantly degrade the structural performance and safety. Therefore, effective and easy-to-use methods are desired for repairing and strengthening such concrete structures. Various methods for strengthening and rehabilitation of RC structures have been developed in the past several decades. Recently, FRP composite materials have emerged as a cost-effective alternative to the conventional materials for repairing, strengthening, and retrofitting deteriorating/deficient concrete structures, by externally bonding FRP laminates to concrete structural members. The main purpose of this study is to investigate the effectiveness of adaptive neuro-fuzzy inference system (ANFIS) in predicting behavior of circular type concrete column retrofitted with FRP. To construct training and testing dataset, experiment results for the specimens which have different retrofit profile are used. Retrofit ratio, strength of existing concrete, thickness, number of layer, stiffness, ultimate strength of fiber and size of specimens are selected as input parameters to predict strength, strain, and stiffness of post-yielding modulus. These proposed ANFIS models show reliable increased accuracy in predicting constitutive properties of concrete retrofitted by FRP, compared to the constitutive models suggested by other researchers.

Face Identification Using a Near-Infrared Camera in a Nonrestrictive In-Vehicle Environment (적외선 카메라를 이용한 비제약적 환경에서의 얼굴 인증)

  • Ki, Min Song;Choi, Yeong Woo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.3
    • /
    • pp.99-108
    • /
    • 2021
  • There are unrestricted conditions on the driver's face inside the vehicle, such as changes in lighting, partial occlusion and various changes in the driver's condition. In this paper, we propose a face identification system in an unrestricted vehicle environment. The proposed method uses a near-infrared (NIR) camera to minimize the changes in facial images that occur according to the illumination changes inside and outside the vehicle. In order to process a face exposed to extreme light, the normal face image is changed to a simulated overexposed image using mean and variance for training. Thus, facial classifiers are simultaneously generated under both normal and extreme illumination conditions. Our method identifies a face by detecting facial landmarks and aggregating the confidence score of each landmark for the final decision. In particular, the performance improvement is the highest in the class where the driver wears glasses or sunglasses, owing to the robustness to partial occlusions by recognizing each landmark. We can recognize the driver by using the scores of remaining visible landmarks. We also propose a novel robust rejection and a new evaluation method, which considers the relations between registered and unregistered drivers. The experimental results on our dataset, PolyU and ORL datasets demonstrate the effectiveness of the proposed method.