• 제목/요약/키워드: Automated machine learning

검색결과 192건 처리시간 0.03초

머신 러닝 접근 방식을 통한 가짜 채용 탐지 (Detecting Fake Job Recruitment with a Machine Learning Approach)

  • 일킨 타히예프;이재흥
    • 스마트미디어저널
    • /
    • 제12권2호
    • /
    • pp.36-41
    • /
    • 2023
  • 지원자 추적 시스템의 등장으로 온라인 채용이 활성화되면서 채용 사기가 심각한 문제로 대두되고 있다. 이 연구는 온라인 채용 환경에서 채용 사기를 탐지할 수 있는 신뢰할 수 있는 모델을 개발하여 비용 손실을 줄이고 개인 사생활 보호를 강화하고자 한다. 이 연구의 주요 기여는 데이터를 탐색적으로 분석하여 얻은 통찰력을 활용하여 어떤 채용 정보가 사기인지, 아니면 합법적인지를 구분할 수 있는 자동화된 방법론을 제공하는데 있다. 캐글에서 제공하는 채용 사기 데이터 집합인 EMSCAD를 사용하여 다양한 단일 분류기 및 앙상블 분류기 기반 머신러닝 모델을 훈련하고 평가하였으며, 그 결과로 앙상블 분류기인 랜덤 포레스트 분류기가 정확도 98.67%, F1 점수 0.81로 가장 좋은 결과를 보이는 것을 알 수 있었다.

Application of machine learning for merging multiple satellite precipitation products

  • Van, Giang Nguyen;Jung, Sungho;Lee, Giha
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2021년도 학술발표회
    • /
    • pp.134-134
    • /
    • 2021
  • Precipitation is a crucial component of water cycle and play a key role in hydrological processes. Traditionally, gauge-based precipitation is the main method to achieve high accuracy of rainfall estimation, but its distribution is sparsely in mountainous areas. Recently, satellite-based precipitation products (SPPs) provide grid-based precipitation with spatio-temporal variability, but SPPs contain a lot of uncertainty in estimated precipitation, and the spatial resolution quite coarse. To overcome these limitations, this study aims to generate new grid-based daily precipitation using Automatic weather system (AWS) in Korea and multiple SPPs(i.e. CHIRPSv2, CMORPH, GSMaP, TRMMv7) during the period of 2003-2017. And this study used a machine learning based Random Forest (RF) model for generating new merging precipitation. In addition, several statistical linear merging methods are used to compare with the results of the RF model. In order to investigate the efficiency of RF, observed data from 64 observed Automated Synoptic Observation System (ASOS) were collected to evaluate the accuracy of the products through Kling-Gupta efficiency (KGE), probability of detection (POD), false alarm rate (FAR), and critical success index (CSI). As a result, the new precipitation generated through the random forest model showed higher accuracy than each satellite rainfall product and spatio-temporal variability was better reflected than other statistical merging methods. Therefore, a random forest-based ensemble satellite precipitation product can be efficiently used for hydrological simulations in ungauged basins such as the Mekong River.

  • PDF

Air-Launched Weapon Engagement Zone Development Utilizing SCG (Scaled Conjugate Gradient) Algorithm

  • Hansang JO;Rho Shin MYONG
    • 한국인공지능학회지
    • /
    • 제12권2호
    • /
    • pp.17-23
    • /
    • 2024
  • Various methods have been developed to predict the flight path of an air-launched weapon to intercept a fast-moving target in the air. However, it is also getting more challenging to predict the optimal firing zone and provide it to a pilot in real-time during engagements for advanced weapons having new complicated guidance and thrust control. In this study, a method is proposed to develop an optimized weapon engagement zone by the SCG (Scaled Conjugate Gradient) algorithm to achieve both accurate and fast estimates and provide an optimized launch display to a pilot during combat engagement. SCG algorithm is fully automated, includes no critical user-dependent parameters, and avoids an exhaustive search used repeatedly to determine the appropriate stage and size of machine learning. Compared with real data, this study showed that the development of a machine learning-based weapon aiming algorithm can provide proper output for optimum weapon launch zones that can be used for operational fighters. This study also established a process to develop one of the critical aircraft-weapon integration software, which can be commonly used for aircraft integration of air-launched weapons.

Artificial Intelligence based Tumor detection System using Computational Pathology

  • Naeem, Tayyaba;Qamar, Shamweel;Park, Peom
    • 시스템엔지니어링학술지
    • /
    • 제15권2호
    • /
    • pp.72-78
    • /
    • 2019
  • Pathology is the motor that drives healthcare to understand diseases. The way pathologists diagnose diseases, which involves manual observation of images under a microscope has been used for the last 150 years, it's time to change. This paper is specifically based on tumor detection using deep learning techniques. Pathologist examine the specimen slides from the specific portion of body (e-g liver, breast, prostate region) and then examine it under the microscope to identify the effected cells among all the normal cells. This process is time consuming and not sufficiently accurate. So, there is a need of a system that can detect tumor automatically in less time. Solution to this problem is computational pathology: an approach to examine tissue data obtained through whole slide imaging using modern image analysis algorithms and to analyze clinically relevant information from these data. Artificial Intelligence models like machine learning and deep learning are used at the molecular levels to generate diagnostic inferences and predictions; and presents this clinically actionable knowledge to pathologist through dynamic and integrated reports. Which enables physicians, laboratory personnel, and other health care system to make the best possible medical decisions. I will discuss the techniques for the automated tumor detection system within the new discipline of computational pathology, which will be useful for the future practice of pathology and, more broadly, medical practice in general.

기계학습 분류기의 예측확률과 만장일치를 이용한 한국어 서답형 문항 자동채점 시스템 (Automated Scoring System for Korean Short-Answer Questions Using Predictability and Unanimity)

  • 천민아;김창현;김재훈;노은희;성경희;송미영
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제5권11호
    • /
    • pp.527-534
    • /
    • 2016
  • 최근 정보화 사회에서는 단순 암기보다는 문제 해결 능력과 종합적인 사고력을 바탕으로 창의적인 생각을 할 수 있는 인재를 요구한다. 이에 따라 교육과정도 학생들의 종합적인 사고력을 판단할 수 있는 서답형 문항을 늘리는 방향으로 변하고 있다. 그러나 서답형 문항의 경우 채점자의 주관에 의존하여 채점이 진행되기 때문에, 채점 결과의 일관성을 확보하기 어렵다는 단점이 있다. 이런 점을 해결하기 위해 해외에서는 기계학습을 이용한 자동채점 시스템을 채점 도구로 사용하고 있다. 한국어는 영어와 언어학적으로 다른 분류에 속하므로 영어권에서 사용하는 자동채점 시스템을 한국어에 그대로 적용할 수 없다. 따라서 한국어 체계에 맞는 자동채점 시스템의 개발이 필요하다. 본 논문에서는 기계학습 분류기의 예측확률과 만장일치 방법을 사용한 한국어 서답형 문항 자동채점 시스템을 소개하고, 자동채점 시스템을 이용한 채점 결과와 교과 전문가의 채점 결과를 비교하여 자동채점 시스템의 실용성을 검증한다. 본 논문의 실험을 위해 2014년 국가수준 학업성취도 평가의 국어, 사회, 과학 교과의 서답형 문항을 사용했다. 평가 척도로 피어슨 상관계수와 카파계수를 사용했다. 채점자가 개입했을 때와 개입하지 않았을 때의 상관계수 모두 0.7 이상으로 강한 양의 상관관계를 보였다. 이는 자동채점 시스템이 교과 전문가가 채점한 결과와 유사한 방향으로 답안에 점수를 부여한 것이므로 자동채점 시스템을 채점 보조도구로서 충분히 사용할 수 있을 것이다.

Deep-learning based In-situ Monitoring and Prediction System for the Organic Light Emitting Diode

  • Park, Il-Hoo;Cho, Hyeran;Kim, Gyu-Tae
    • 반도체디스플레이기술학회지
    • /
    • 제19권4호
    • /
    • pp.126-129
    • /
    • 2020
  • We introduce a lifetime assessment technique using deep learning algorithm with complex electrical parameters such as resistivity, permittivity, impedance parameters as integrated indicators for predicting the degradation of the organic molecules. The evaluation system consists of fully automated in-situ measurement system and multiple layer perceptron learning system with five hidden layers and 1011 perceptra in each layer. Prediction accuracies are calculated and compared depending on the physical feature, learning hyperparameters. 62.5% of full time-series data are used for training and its prediction accuracy is estimated as r-square value of 0.99. Remaining 37.5% of the data are used for testing with prediction accuracy of 0.95. With k-fold cross-validation, the stability to the instantaneous changes in the measured data is also improved.

Automated Prioritization of Construction Project Requirements using Machine Learning and Fuzzy Logic System

  • Hassan, Fahad ul;Le, Tuyen;Le, Chau;Shrestha, K. Joseph
    • 국제학술발표논문집
    • /
    • The 9th International Conference on Construction Engineering and Project Management
    • /
    • pp.304-311
    • /
    • 2022
  • Construction inspection is a crucial stage that ensures that all contractual requirements of a construction project are verified. The construction inspection capabilities among state highway agencies have been greatly affected due to budget reduction. As a result, efficient inspection practices such as risk-based inspection are required to optimize the use of limited resources without compromising inspection quality. Automated prioritization of textual requirements according to their criticality would be extremely helpful since contractual requirements are typically presented in an unstructured natural language in voluminous text documents. The current study introduces a novel model for predicting the risk level of requirements using machine learning (ML) algorithms. The ML algorithms tested in this study included naïve Bayes, support vector machines, logistic regression, and random forest. The training data includes sequences of requirement texts which were labeled with risk levels (such as very low, low, medium, high, very high) using the fuzzy logic systems. The fuzzy model treats the three risk factors (severity, probability, detectability) as fuzzy input variables, and implements the fuzzy inference rules to determine the labels of requirements. The performance of the model was examined on labeled dataset created by fuzzy inference rules and three different membership functions. The developed requirement risk prediction model yielded a precision, recall, and f-score of 78.18%, 77.75%, and 75.82%, respectively. The proposed model is expected to provide construction inspectors with a means for the automated prioritization of voluminous requirements by their importance, thus help to maximize the effectiveness of inspection activities under resource constraints.

  • PDF

스마트 팩토리를 위한 자율주행 시뮬레이터 기반 지능형 AGV 머신러닝 시스템 (Intelligent AGV Machine-Learning System based on Self-Driving Simulator for Smart Factory)

  • 이세훈;김기철;문환복;김도균
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2017년도 제56차 하계학술대회논문집 25권2호
    • /
    • pp.17-18
    • /
    • 2017
  • 본 논문은 스마트 팩토리의 중요 요소인 무인반송차(AGV)를 자율 주행시키기 위해 오픈 소스 자율 주행차 시뮬레이터인 udacity를 이용해 머신 러닝시키는 시스템을 개발하였다. 공장의 운행 루트를 자율주행 시뮬레이터의 전경으로 가공하고, 3개의 카메라를 부착시킨 AGV를 운행시키면서 머신 러닝시킨다. AGV를 주행하여 얻어진 여러 학습 데이터를 통해 도출된 결과들을 각각 비교하여 우수한 모델을 선정하고 운행시킨 결과 AGV가 정해진 운행 루트를 정확하게 주행하는 것을 확인하였다. 이를 통해, 가상 운행 환경에서 저비용으로 AGV 운행 학습이 가능하다는 것을 보였다.

  • PDF

A Study on Variant Malware Detection Techniques Using Static and Dynamic Features

  • Kang, Jinsu;Won, Yoojae
    • Journal of Information Processing Systems
    • /
    • 제16권4호
    • /
    • pp.882-895
    • /
    • 2020
  • The amount of malware increases exponentially every day and poses a threat to networks and operating systems. Most new malware is a variant of existing malware. It is difficult to deal with numerous malware variants since they bypass the existing signature-based malware detection method. Thus, research on automated methods of detecting and processing variant malware has been continuously conducted. This report proposes a method of extracting feature data from files and detecting malware using machine learning. Feature data were extracted from 7,000 malware and 3,000 benign files using static and dynamic malware analysis tools. A malware classification model was constructed using multiple DNN, XGBoost, and RandomForest layers and the performance was analyzed. The proposed method achieved up to 96.3% accuracy.

A Machine Learning Approach to Detect the Dog's Behavior using Wearable Sensors

  • Aich, Satyabrata;Chakraborty, Sabyasachi;Joo, Moon-il;Sim, Jong Seong;Kim, Hee-Cheol
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2019년도 춘계학술대회
    • /
    • pp.281-282
    • /
    • 2019
  • In recent years welfare of animals is the biggest challenge because animals, especially dogs are widely recognized as pet as well as they are using as service animals. So, for the wellbeing of the dog it is necessary to perform objective assessment to track their behavior in everyday life. In this paper, we have proposed an automatic behavior assessment system for dogs based on a neck worn and tail worn accelerometer and gyroscope platform, and data analysis techniques that recognize typical dog activities. We evaluate the system based on the analysis of 8 behavior traits in 3 dogs, incorporating 2 breeds of various sizes. Our proposed framework able to reproduce the manual assessment that is based on the video recording which is treated as gold standard that exhibits the real-life use case of automated dog behavior analysis.

  • PDF