• 제목/요약/키워드: Machine Learning

검색결과 5,492건 처리시간 0.034초

Single Antenna Based GPS Signal Reception Condition Classification Using Machine Learning Approaches

  • Sanghyun Kim;Seunghyeon Park;Jiwon Seo
    • Journal of Positioning, Navigation, and Timing
    • /
    • 제12권2호
    • /
    • pp.149-155
    • /
    • 2023
  • In urban areas it can be difficult to utilize global navigation satellite systems (GNSS) due to signal reflections and blockages. It is thus crucial to detect reflected or blocked signals because they lead to significant degradation of GNSS positioning accuracy. In a previous study, a classifier for global positioning system (GPS) signal reception conditions was developed using three features and the support vector machine (SVM) algorithm. However, this classifier had limitations in its classification performance. Therefore, in this study, we developed an improved machine learning based method of classifying GPS signal reception conditions by including an additional feature with the existing features. Furthermore, we applied various machine learning classification algorithms. As a result, when tested with datasets collected in different environments than the training environment, the classification accuracy improved by nine percentage points compared to the existing method, reaching up to 58%.

Diagnosing Reading Disorders based on Eye Movements during Natural Reading

  • Yongseok Yoo
    • Journal of information and communication convergence engineering
    • /
    • 제21권4호
    • /
    • pp.281-286
    • /
    • 2023
  • Diagnosing reading disorders involves complex procedures to evaluate complex cognitive processes. For an accurate diagnosis, a series of tests and evaluations by human experts are required. In this study, we propose a quantitative tool to diagnose reading disorders based on natural reading behaviors using minimal human input. The eye movements of the third- and fourth-grade students were recorded while they read a text at their own pace. Seven machine learning models were used to evaluate the gaze patterns of the words in the presented text and classify the students as normal or having a reading disorder. The accuracy of the machine learning-based diagnosis was measured using the diagnosis by human experts as the ground truth. The highest accuracy of 0.8 was achieved by the support vector machine and random forest classifiers. This result demonstrated that machine learning-based automated diagnosis could substitute for the traditional diagnosis of reading disorders and enable large-scale screening for students at an early age.

다중 머신러닝 알고리즘을 이용한 악성 URL 예측 시스템 설계 및 구현 (Design and Implementation of Malicious URL Prediction System based on Multiple Machine Learning Algorithms)

  • 강홍구;신삼신;김대엽;박순태
    • 한국멀티미디어학회논문지
    • /
    • 제23권11호
    • /
    • pp.1396-1405
    • /
    • 2020
  • Cyber threats such as forced personal information collection and distribution of malicious codes using malicious URLs continue to occur. In order to cope with such cyber threats, a security technologies that quickly detects malicious URLs and prevents damage are required. In a web environment, malicious URLs have various forms and are created and deleted from time to time, so there is a limit to the response as a method of detecting or filtering by signature matching. Recently, researches on detecting and predicting malicious URLs using machine learning techniques have been actively conducted. Existing studies have proposed various features and machine learning algorithms for predicting malicious URLs, but most of them are only suggesting specialized algorithms by supplementing features and preprocessing, so it is difficult to sufficiently reflect the strengths of various machine learning algorithms. In this paper, a system for predicting malicious URLs using multiple machine learning algorithms was proposed, and an experiment was performed to combine the prediction results of multiple machine learning models to increase the accuracy of predicting malicious URLs. Through experiments, it was proved that the combination of multiple models is useful in improving the prediction performance compared to a single model.

IoT센서로 수집된 균질 시간 데이터를 이용한 기계학습 기반의 품질관리 및 데이터 보정 (Machine Learning-based Quality Control and Error Correction Using Homogeneous Temporal Data Collected by IoT Sensors)

  • 김혜진;이현수;최병진;김용혁
    • 한국융합학회논문지
    • /
    • 제10권4호
    • /
    • pp.17-23
    • /
    • 2019
  • 본 논문은 온도 등 7 가지의 IoT 센서에서 수집된 기상데이터의 각 기상요소에 대하여 품질관리(Quality Control; QC)를 하였다. 또한, 우리는 측정된 값에 오류가 있는 데이터를 기계학습으로 의미있게 추정하는 방법을 제안한다. 수집된 기상데이터를 기본 QC 결과를 바탕으로 오류 데이터를 선형 보간하여 기계학습 QC를 진행하였으며, 기계학습 기법으로는 대표적인 서포트벡터회귀, 의사결정테이블, 다층퍼셉트론을 사용했다. 기본 QC의 적용 유무에 따라 비교해 보았을 때, 우리는 기본 QC를 거쳐 보간한 기계학습 모델들의 평균절대오차(MAE)가 21% 낮은 것을 확인할 수 있었다. 또한, 기계학습 기법에 따라 비교하여 서포트벡터회귀 모델을 적용하였을 때가, 모든 기상 요소에 대하여 MAE가 평균적으로 다층신경망은 24%, 의사결정테이블은 58% 낮은 것을 알 수 있었다.

Comparison of Machine Learning-Based Radioisotope Identifiers for Plastic Scintillation Detector

  • Jeon, Byoungil;Kim, Jongyul;Yu, Yonggyun;Moon, Myungkook
    • Journal of Radiation Protection and Research
    • /
    • 제46권4호
    • /
    • pp.204-212
    • /
    • 2021
  • Background: Identification of radioisotopes for plastic scintillation detectors is challenging because their spectra have poor energy resolutions and lack photo peaks. To overcome this weakness, many researchers have conducted radioisotope identification studies using machine learning algorithms; however, the effect of data normalization on radioisotope identification has not been addressed yet. Furthermore, studies on machine learning-based radioisotope identifiers for plastic scintillation detectors are limited. Materials and Methods: In this study, machine learning-based radioisotope identifiers were implemented, and their performances according to data normalization methods were compared. Eight classes of radioisotopes consisting of combinations of 22Na, 60Co, and 137Cs, and the background, were defined. The training set was generated by the random sampling technique based on probabilistic density functions acquired by experiments and simulations, and test set was acquired by experiments. Support vector machine (SVM), artificial neural network (ANN), and convolutional neural network (CNN) were implemented as radioisotope identifiers with six data normalization methods, and trained using the generated training set. Results and Discussion: The implemented identifiers were evaluated by test sets acquired by experiments with and without gain shifts to confirm the robustness of the identifiers against the gain shift effect. Among the three machine learning-based radioisotope identifiers, prediction accuracy followed the order SVM > ANN > CNN, while the training time followed the order SVM > ANN > CNN. Conclusion: The prediction accuracy for the combined test sets was highest with the SVM. The CNN exhibited a minimum variation in prediction accuracy for each class, even though it had the lowest prediction accuracy for the combined test sets among three identifiers. The SVM exhibited the highest prediction accuracy for the combined test sets, and its training time was the shortest among three identifiers.

머신러닝을 이용한 다공형 GDI 인젝터의 플래시 보일링 분무 예측 모델 개발 (Development of Flash Boiling Spray Prediction Model of Multi-hole GDI Injector Using Machine Learning)

  • 상몽소;신달호;;박수한
    • 한국분무공학회지
    • /
    • 제27권2호
    • /
    • pp.57-65
    • /
    • 2022
  • The purpose of this study is to use machine learning to build a model capable of predicting the flash boiling spray characteristics. In this study, the flash boiling spray was visualized using Shadowgraph visualization technology, and then the spray image was processed with MATLAB to obtain quantitative data of spray characteristics. The experimental conditions were used as input, and the spray characteristics were used as output to train the machine learning model. For the machine learning model, the XGB (extreme gradient boosting) algorithm was used. Finally, the performance of machine learning model was evaluated using R2 and RMSE (root mean square error). In order to have enough data to train the machine learning model, this study used 12 injectors with different design parameters, and set various fuel temperatures and ambient pressures, resulting in about 12,000 data. By comparing the performance of the model with different amounts of training data, it was found that the number of training data must reach at least 7,000 before the model can show optimal performance. The model showed different prediction performances for different spray characteristics. Compared with the upstream spray angle and the downstream spray angle, the model had the best prediction performance for the spray tip penetration. In addition, the prediction performance of the model showed a relatively poor trend in the initial stage of injection and the final stage of injection. The model performance is expired to be further enhanced by optimizing the hyper-parameters input into the model.

Parameterization of the Company's Business Model for Machine Learning-Based Marketing Stress Testing

  • Menkova, Krystyna;Zozulov, Oleksandr
    • International Journal of Computer Science & Network Security
    • /
    • 제22권2호
    • /
    • pp.318-326
    • /
    • 2022
  • Marketing stress testing is a new method of identifying the company's strengths and weaknesses in a turbulent environment. Technically, this is a complex procedure, so it involves artificial intelligence and machine learning. The main problem is currently the development of methodological approaches to the development of the company's digital model, which will provide a framework for machine learning. The aim of the study was to identify and develop an author's approach to the parameterization of the company's business processes for machine learning-based marketing stress testing. This aim provided the company's activities to be considered as a set of elements (business processes, products) and factors that affect them (marketing environment). The article proposes an author's approach to the parameterization of the company's business processes for machine learning-based marketing stress testing. The proposed approach includes four main elements that are subject to parameterization: elements of the company's internal environment, factors of the marketing environment, the company' core competency and factors impacting the company. Matrices for evaluating the results of the work of expert groups to determine the degree of influence of the marketing environment factors were developed. It is proposed to distinguish between mega-level, macro-level, meso-level and micro-level factors depending on the degree of impact on the company. The methodological limitation of the study is that it involves the modelling method as the only one possible at this stage of the study. The implementation limitation is that the proposed approach can only be used if the company plans to use machine learning for marketing stress testing.

머신러닝 애플리케이션 구현 비용 평가를 위한 확장형 기능 포인트 모델 (An Extended Function Point Model for Estimating the Implementing Cost of Machine Learning Applications )

  • 임석진
    • 문화기술의 융합
    • /
    • 제9권2호
    • /
    • pp.475-481
    • /
    • 2023
  • 머신러닝과 같은 소프트웨어가 일상생활에 매우 큰 영향력을 발휘하고 있는 상황에서, 소프트웨어의 개발비용을 평가하는 비용 모델의 중요성이 지속적으로 증가하고 있다. 비용 모델로서 LOC(Line of Code)와 M/M(Man-Month) 모델은 소프트웨어의 양적인 요소들을 측정하는 비용모델이다. 이와는 달리, FP(Function Point)는 소프트웨어의 기능적 특징들을 평가하는 비용모델로서 소프트웨어의 질적인 요소를 평가한다는 점에서 효과적이다. 그러나 FP는 머신러닝 소프트웨어의 주요한 요소들을 평가하지 않기 때문에 머신러닝 소프트웨어를 평가하는데 한계를 가진다. 본 논문은 확장형 FP(Extended Function Point, ExFP)를 제안한다. 확장형 FP는 머신러닝의 주요 특징인 하이퍼 파라미터와 그것의 최적화에 대한 복잡도를 반영하여 소프트웨어의 기능적 요소를 평가하도록 확장하였기 때문에 머신러닝과 같은 최신 소프트웨어에의 비용 평가에 적합하다. 머신러닝 소프트웨어의 특징을 반영한 평가를 통해 제안된 확장형 FP의 효용성을 보였다.

Machine Learning Prediction for the Recurrence After Electrical Cardioversion of Patients With Persistent Atrial Fibrillation

  • Soonil Kwon;Eunjung Lee;Hojin Ju;Hyo-Jeong Ahn;So-Ryoung Lee;Eue-Keun Choi;Jangwon Suh;Seil Oh;Wonjong Rhee
    • Korean Circulation Journal
    • /
    • 제53권10호
    • /
    • pp.677-689
    • /
    • 2023
  • Background and Objectives: There is limited evidence regarding machine-learning prediction for the recurrence of atrial fibrillation (AF) after electrical cardioversion (ECV). This study aimed to predict the recurrence of AF after ECV using machine learning of clinical features and electrocardiograms (ECGs) in persistent AF patients. Methods: We analyzed patients who underwent successful ECV for persistent AF. Machine learning was designed to predict patients with 1-month recurrence. Individual 12-lead ECGs were collected before and after ECV. Various clinical features were collected and trained the extreme gradient boost (XGBoost)-based model. Ten-fold cross-validation was used to evaluate the performance of the model. The performance was compared to the C-statistics of the selected clinical features. Results: Among 718 patients (mean age 63.5±9.3 years, men 78.8%), AF recurred in 435 (60.6%) patients after 1 month. With the XGBoost-based model, the areas under the receiver operating characteristic curves (AUROCs) were 0.57, 0.60, and 0.63 if the model was trained by clinical features, ECGs, and both (the final model), respectively. For the final model, the sensitivity, specificity, and F1-score were 84.7%, 28.2%, and 0.73, respectively. Although the AF duration showed the best predictive performance (AUROC, 0.58) among the clinical features, it was significantly lower than that of the final machine-learning model (p<0.001). Additional training of extended monitoring data of 15-minute single-lead ECG and photoplethysmography in available patients (n=261) did not significantly improve the model's performance. Conclusions: Machine learning showed modest performance in predicting AF recurrence after ECV in persistent AF patients, warranting further validation studies.

혼합 기계 학습 기반 소변 스펙트럼 분석 앙상블 모델 (Ensemble Model for Urine Spectrum Analysis Based on Hybrid Machine Learning)

  • 최재혁;정목동
    • 한국멀티미디어학회논문지
    • /
    • 제23권8호
    • /
    • pp.1059-1065
    • /
    • 2020
  • In hospitals, nurses are subjectively determining the urine status to check the kidneys and circulatory system of patients whose statuses are related to patients with kidney disease, critically ill patients, and nursing homes before and after surgery. To improve this problem, this paper proposes a urine spectrum analysis system which clusters urine test results based on a hybrid machine learning model consists of unsupervised learning and supervised learning. The proposed system clusters the spectral data using unsupervised learning in the first part, and classifies them using supervised learning in the second part. The results of the proposed urine spectrum analysis system using a mixed model are evaluated with the results of pure supervised learning. This paper is expected to provide better services than existing medical services to patients by solving the shortage of nurses, shortening of examination time, and subjective evaluation in hospitals.