• Title/Summary/Keyword: 머신러닝 자동화

Search Result 68, Processing Time 0.027 seconds

A Study on Automation of Big Data Quality Diagnosis Using Machine Learning (머신러닝을 이용한 빅데이터 품질진단 자동화에 관한 연구)

  • Lee, Jin-Hyoung
    • The Journal of Bigdata
    • /
    • v.2 no.2
    • /
    • pp.75-86
    • /
    • 2017
  • In this study, I propose a method to automate the method to diagnose the quality of big data. The reason for automating the quality diagnosis of Big Data is that as the Fourth Industrial Revolution becomes a issue, there is a growing demand for more volumes of data to be generated and utilized. Data is growing rapidly. However, if it takes a lot of time to diagnose the quality of the data, it can take a long time to utilize the data or the quality of the data may be lowered. If you make decisions or predictions from these low-quality data, then the results will also give you the wrong direction. To solve this problem, I have developed a model that can automate diagnosis for improving the quality of Big Data using machine learning which can quickly diagnose and improve the data. Machine learning is used to automate domain classification tasks to prevent errors that may occur during domain classification and reduce work time. Based on the results of the research, I can contribute to the improvement of data quality to utilize big data by continuing research on the importance of data conversion, learning methods for unlearned data, and development of classification models for each domain.

  • PDF

The Automated System for Location Visiting Preference Prediction with Personality Factors (사람 성격 요소에 따른 위치 방문 선호도 예측의 자동화 시스템)

  • Song, Ha Yoon;Jung, Ji Hyun
    • Annual Conference of KIPS
    • /
    • 2021.11a
    • /
    • pp.935-938
    • /
    • 2021
  • 데이터 베이스에 저장된 사용자의 위치, 성격정보를 자동으로 받아서 머신러닝으로 회귀분석하여 방문 장소에 대한 선호도를 예측한다. 사람의 성격 요소로는 BFF 와 다른 기본 요소들을 사용하였다. 이를 위하여 자동화된 시스템을 구성하였고 위치 방문 선호도를 예측하기 위한 머신러닝 기법으로는 앙상블기법을 사용하였다. 예측 결과는 장소 카테고리별로 방문 선호도가 나타나고 이를 사용자 별로 나누어 저장할 예정이다. 데이터의 양이 많아지면서 나타나는 문제들을 해결하여 향후 연구에 도움이 될 것이다.

Proposal and empirical study of web shell detection system (MWSDS) applying machine learning-based supervised learning and classification (머신러닝기반의 지도학습과 분류 알고리즘을 적용한 웹쉘 탐지시스템(MWSDS)제안 연구)

  • Ki-hwan Kim;Sangdo Lee;Yongtae Shin
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2024.01a
    • /
    • pp.49-50
    • /
    • 2024
  • 본 논문에서는 웹쉘 악성코드를 정확하게 분류하고, 빠른시간안에 자동으로 웹쉘 분류 및 분석을 통하여 웹쉘을 탐지하기 위하여 인공지능 머신러닝 기반의 Supervised AI ML 및 Classification 알고리즘을 적용하여 빠른 시간안에 분류, 정확한 분석을 통하여 자동화된 탐지시스템인 MWSDS를 제안하고 웹쉘 실험 데이터를 통하여 실증하였다. 본제안의 경우 웹쉘악성코드 공격에 대한 대응뿐만아니라 관리적인 정보보호 체계수립을 통하여 보다 효과적이며, 지속적으로 대응할 수 있을 것으로 전망된다.

  • PDF

A Study on Artificial Intelligence-based Automated Integrated Security Control System Model (인공지능 기반의 자동화된 통합보안관제시스템 모델 연구)

  • Wonsik Nam;Han-Jin Cho
    • Smart Media Journal
    • /
    • v.13 no.3
    • /
    • pp.45-52
    • /
    • 2024
  • In today's growing threat environment, rapid and effective detection and response to security events is essential. To solve these problems, many companies and organizations respond to security threats by introducing security control systems. However, existing security control systems are experiencing difficulties due to the complexity and diverse characteristics of security events. In this study, we propose an automated integrated security control system model based on artificial intelligence. It is based on deep learning, an artificial intelligence technology, and provides effective detection and processing functions for various security events. To this end, the model applies various artificial intelligence algorithms and machine learning methods to overcome the limitations of existing security control systems. The proposed model reduces the operator's workload, ensures efficient operation, and supports rapid response to security threats.

Development of Dataset Cllection RPA for Machine Learning (머신러닝을 위한 데이터셋 수집 RPA 개발)

  • Kim, Ki-Tae;Seo, Bo-in;Yun, Sang-Hyeok;Lee, Sei-Hoon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2020.07a
    • /
    • pp.295-296
    • /
    • 2020
  • 본 논문에서는 RPA(Robotic Process Automation) Tool 개발 과정 중 머신 러닝, 딥러닝에 필요한 이미지 크롤링 및 전처리 기능을 이용한 가공된 데이터 셋 처리 과정을 기술한다. 개발된 RPA 툴에서 머신러닝 및 딥러닝에 사용될 데이터 확보 기능을 제공하며, 세부적으로 이미지 전처리(Convert Gray, Histogram Equalization, Binary, Resize)등 반복적으로 사용되는 기능들을 제공한다. 개발된 툴을 통해 RPA의 자동화 기능과, 전처리 기능의 융합을 통해 업무의 효율성을 제공한다.

  • PDF

A Study of Big Data Domain Automatic Classification Using Machine Learning (머신러닝을 이용한 빅데이터 도메인 자동 판별에 관한 연구)

  • Kong, Seongwon;Hwang, Deokyoul
    • The Journal of Bigdata
    • /
    • v.3 no.2
    • /
    • pp.11-18
    • /
    • 2018
  • This study is a study on domain automatic classification for domain - based quality diagnosis which is a key element of big data quality diagnosis. With the increase of the value and utilization of Big Data and the rise of the Fourth Industrial Revolution, the world is making efforts to create new value by utilizing big data in various fields converged with IT such as law, medical, and finance. However, analysis based on low-reliability data results in critical problems in both the process and the result, and it is also difficult to believe that judgments based on the analysis results. Although the need of highly reliable data has also increased, research on the quality of data and its results have been insufficient. The purpose of this study is to shorten the work time to automizing the domain classification work which was performed from manually to using machine learning in the domain - based quality diagnosis, which is a key element of diagnostic evaluation for improving data quality. Extracts information about the characteristics of the data that is stored in the database and identifies the domain, and then featurize it, and automizes the domain classification using machine learning. We will use it for big data quality diagnosis and contribute to quality improvement.

Exploratory Research on Automating the Analysis of Scientific Argumentation Using Machine Learning (머신 러닝을 활용한 과학 논변 구성 요소 코딩 자동화 가능성 탐색 연구)

  • Lee, Gyeong-Geon;Ha, Heesoo;Hong, Hun-Gi;Kim, Heui-Baik
    • Journal of The Korean Association For Science Education
    • /
    • v.38 no.2
    • /
    • pp.219-234
    • /
    • 2018
  • In this study, we explored the possibility of automating the process of analyzing elements of scientific argument in the context of a Korean classroom. To gather training data, we collected 990 sentences from science education journals that illustrate the results of coding elements of argumentation according to Toulmin's argumentation structure framework. We extracted 483 sentences as a test data set from the transcription of students' discourse in scientific argumentation activities. The words and morphemes of each argument were analyzed using the Python 'KoNLPy' package and the 'Kkma' module for Korean Natural Language Processing. After constructing the 'argument-morpheme:class' matrix for 1,473 sentences, five machine learning techniques were applied to generate predictive models relating each sentences to the element of argument with which it corresponded. The accuracy of the predictive models was investigated by comparing them with the results of pre-coding by researchers and confirming the degree of agreement. The predictive model generated by the k-nearest neighbor algorithm (KNN) demonstrated the highest degree of agreement [54.04% (${\kappa}=0.22$)] when machine learning was performed with the consideration of morpheme of each sentence. The predictive model generated by the KNN exhibited higher agreement [55.07% (${\kappa}=0.24$)] when the coding results of the previous sentence were added to the prediction process. In addition, the results indicated importance of considering context of discourse by reflecting the codes of previous sentences to the analysis. The results have significance in that, it showed the possibility of automating the analysis of students' argumentation activities in Korean language by applying machine learning.

Proposal Record Automation Service Based on AI by Using OCR and Pattern Analysis Algorithm (OCR과 패턴분석 알고리즘을 활용한 인공지능 기반 기록 자동화 서비스 제안)

  • Hwang, Yun-Young
    • Annual Conference of KIPS
    • /
    • 2019.10a
    • /
    • pp.530-532
    • /
    • 2019
  • 제안하는 서비스는 OCR(Optical Character Recognition, 광학문자인식)과 딥러닝 패턴분석 알고리즘을 활용하여 문서를 효율적으로 관리하는 서비스로 필기를 많이 하는 사용자를 위한 기능을 제공한다. 최근 다양한 분야에서의 머신러닝 기반의 OCR의 활용이 증가했지만 기존의 애플리케이션은 패턴 분석 알고리즘과 통계 기반의 OCR을 혼합하여 사용하기 때문에 필기체에 대한 인식률이 높지 않다. 이에 본 논문에서는 OCR과 패턴분석 알고리즘을 활용하여 필기체에 대한 높은 인식률을 제공하는 서비스를 제안한다.