• Title/Summary/Keyword: Machine Learning Techniques

Search Result 1,051, Processing Time 0.031 seconds

Application of Machine Learning Techniques for the Classification of Source Code Vulnerability (소스코드 취약성 분류를 위한 기계학습 기법의 적용)

  • Lee, Won-Kyung;Lee, Min-Ju;Seo, DongSu
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.30 no.4
    • /
    • pp.735-743
    • /
    • 2020
  • Secure coding is a technique that detects malicious attack or unexpected errors to make software systems resilient against such circumstances. In many cases secure coding relies on static analysis tools to find vulnerable patterns and contaminated data in advance. However, secure coding has the disadvantage of being dependent on rule-sets, and accurate diagnosis is difficult as the complexity of static analysis tools increases. In order to support secure coding, we apply machine learning techniques, such as DNN, CNN and RNN to investigate into finding major weakness patterns shown in secure development coding guides and present machine learning models and experimental results. We believe that machine learning techniques can support detecting security weakness along with static analysis techniques.

Modeling on Expansion Behavior of Gwangan Bridge using Machine Learning Techniques and Structural Monitoring Data (머신러닝 기법과 계측 모니터링 데이터를 이용한 광안대교 신축거동 모델링)

  • Park, Ji Hyun;Shin, Sung Woo;Kim, Soo Yong
    • Journal of the Korean Society of Safety
    • /
    • v.33 no.6
    • /
    • pp.42-49
    • /
    • 2018
  • In this study, we have developed a prediction model for expansion and contraction behaviors of expansion joint in Gwangan Bridge using machine learning techniques and bridge monitoring data. In the development of the prediction model, two famous machine learning techniques, multiple regression analysis (MRA) and artificial neural network (ANN), were employed. Structural monitoring data obtained from bridge monitoring system of Gwangan Bridge were used to train and validate the developed models. From the results, it was found that the expansion and contraction behaviors predicted by the developed models are matched well with actual expansion and contraction behaviors of Gwangan Bridge. Therefore, it can be concluded that both MRA and ANN models can be used to predict the expansion and contraction behaviors of Gwangan Bridge without actual measurements of those behaviors.

Detection of E.coli biofilms with hyperspectral imaging and machine learning techniques

  • Lee, Ahyeong;Seo, Youngwook;Lim, Jongguk;Park, Saetbyeol;Yoo, Jinyoung;Kim, Balgeum;Kim, Giyoung
    • Korean Journal of Agricultural Science
    • /
    • v.47 no.3
    • /
    • pp.645-655
    • /
    • 2020
  • Bacteria are a very common cause of food poisoning. Moreover, bacteria form biofilms to protect themselves from harsh environments. Conventional detection methods for foodborne bacterial pathogens including the plate count method, enzyme-linked immunosorbent assays (ELISA), and polymerase chain reaction (PCR) assays require a lot of time and effort. Hyperspectral imaging has been used for food safety because of its non-destructive and real-time detection capability. This study assessed the feasibility of using hyperspectral imaging and machine learning techniques to detect biofilms formed by Escherichia coli. E. coli was cultured on a high-density polyethylene (HDPE) coupon, which is a main material of food processing facilities. Hyperspectral fluorescence images were acquired from 420 to 730 nm and analyzed by a single wavelength method and machine learning techniques to determine whether an E. coli culture was present. The prediction accuracy of a biofilm by the single wavelength method was 84.69%. The prediction accuracy by the machine learning techniques were 87.49, 91.16, 86.61, and 86.80% for decision tree (DT), k-nearest neighbor (k-NN), linear discriminant analysis (LDA), and partial least squares-discriminant analysis (PLS-DA), respectively. This result shows the possibility of using machine learning techniques, especially the k-NN model, to effectively detect bacterial pathogens and confirm food poisoning through hyperspectral images.

Forecasting Sow's Productivity using the Machine Learning Models (머신러닝을 활용한 모돈의 생산성 예측모델)

  • Lee, Min-Soo;Choe, Young-Chan
    • Journal of Agricultural Extension & Community Development
    • /
    • v.16 no.4
    • /
    • pp.939-965
    • /
    • 2009
  • The Machine Learning has been identified as a promising approach to knowledge-based system development. This study aims to examine the ability of machine learning techniques for farmer's decision making and to develop the reference model for using pig farm data. We compared five machine learning techniques: logistic regression, decision tree, artificial neural network, k-nearest neighbor, and ensemble. All models are well performed to predict the sow's productivity in all parity, showing over 87.6% predictability. The model predictability of total litter size are highest at 91.3% in third parity and decreasing as parity increases. The ensemble is well performed to predict the sow's productivity. The neural network and logistic regression is excellent classifier for all parity. The decision tree and the k-nearest neighbor was not good classifier for all parity. Performance of models varies over models used, showing up to 104% difference in lift values. Artificial Neural network and ensemble models have resulted in highest lift values implying best performance among models.

  • PDF

Comparison of Machine Learning Techniques for Cyberbullying Detection on YouTube Arabic Comments

  • Alsubait, Tahani;Alfageh, Danyah
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.1
    • /
    • pp.1-5
    • /
    • 2021
  • Cyberbullying is a problem that is faced in many cultures. Due to their popularity and interactive nature, social media platforms have also been affected by cyberbullying. Social media users from Arab countries have also reported being a target of cyberbullying. Machine learning techniques have been a prominent approach used by scientists to detect and battle this phenomenon. In this paper, we compare different machine learning algorithms for their performance in cyberbullying detection based on a labeled dataset of Arabic YouTube comments. Three machine learning models are considered, namely: Multinomial Naïve Bayes (MNB), Complement Naïve Bayes (CNB), and Linear Regression (LR). In addition, we experiment with two feature extraction methods, namely: Count Vectorizer and Tfidf Vectorizer. Our results show that, using count vectroizer feature extraction, the Logistic Regression model can outperform both Multinomial and Complement Naïve Bayes models. However, when using Tfidf vectorizer feature extraction, Complement Naive Bayes model can outperform the other two models.

Comparative Application of Various Machine Learning Techniques for Lithology Predictions (다양한 기계학습 기법의 암상예측 적용성 비교 분석)

  • Jeong, Jina;Park, Eungyu
    • Journal of Soil and Groundwater Environment
    • /
    • v.21 no.3
    • /
    • pp.21-34
    • /
    • 2016
  • In the present study, we applied various machine learning techniques comparatively for prediction of subsurface structures based on multiple secondary information (i.e., well-logging data). The machine learning techniques employed in this study are Naive Bayes classification (NB), artificial neural network (ANN), support vector machine (SVM) and logistic regression classification (LR). As an alternative model, conventional hidden Markov model (HMM) and modified hidden Markov model (mHMM) are used where additional information of transition probability between primary properties is incorporated in the predictions. In the comparisons, 16 boreholes consisted with four different materials are synthesized, which show directional non-stationarity in upward and downward directions. Futhermore, two types of the secondary information that is statistically related to each material are generated. From the comparative analysis with various case studies, the accuracies of the techniques become degenerated with inclusion of additive errors and small amount of the training data. For HMM predictions, the conventional HMM shows the similar accuracies with the models that does not relies on transition probability. However, the mHMM consistently shows the highest prediction accuracy among the test cases, which can be attributed to the consideration of geological nature in the training of the model.

A Study on Predicting Cryptocurrency Distribution Prices Using Machine Learning Techniques (머신러닝 기법을 활용한 암호화폐 유통 가격 예측 연구)

  • KIM, Han-Min;KIM, Hoik
    • Journal of Distribution Science
    • /
    • v.17 no.11
    • /
    • pp.93-101
    • /
    • 2019
  • Purpose: Blockchain technology suggests ways to solve the problems in the existing industry. Among them, Cryptocurrency system, which is an element of Blockchain technology, is a very important factor for operating Blockchain. While Blockchain cryptocurrency has attracted attention, studies on cryptocurrency prices have been mainly conducted, however previous studies mainly conducted on Bitcoin prices. On the other hand, in the context of the creation and trading of various cryptocurrencies based on the Blockchain system, little research has been done on cryptocurrencies other than Bitcoin. Hence, this study attempts to find variables related to the prices of Dash, Litecoin, and Monero cryptocurrencies using machine learning techniques. We also attempt to find differences in the variables related to the prices for each cryptocurrencies and to examine machine learning techniques that can provide better performance. Research design, data, and methodology: This study performed Dash, Litecoin, and Monero price prediction analysis of cryptocurrency using Blockchain information and machine learning techniques. We employed number of transactions in Blockchain, amount of generated cryptocurrency, transaction fees, number of activity accounts in Blockchain, Block creation difficulty, block size, umber of created blocks as independent variables. This study tried to ensure the reliability of the analysis results through 10-fold cross validation. Blockchain information was hierarchically added for price prediction, and the analysis result was measured as RMSE and MAPE. Results: The analysis shows that the prices of Dash, Litecoin and Monero cryptocurrency are related to Blockchain information. Also, we found that different Blockchain information improves the analysis results for each cryptocurrency. In addition, this study found that the neural network machine learning technique provides better analysis results than support-vector machine in predicting cryptocurrency prices. Conclusion: This study concludes that the information of Blockchain should be considered for the prediction of the price of Dash, Litecoin, and Monero cryptocurrency. It also suggests that Blockchain information related to the price of cryptocurrency differs depending on the type of cryptocurrency. We suggest that future research on various types of cryptocurrencies is needed. The findings of this study can provide a theoretical basis for future cryptocurrency research in distribution management.

A Pragmatic Framework for Predicting Change Prone Files Using Machine Learning Techniques with Java-based Software

  • Loveleen Kaur;Ashutosh Mishra
    • Asia pacific journal of information systems
    • /
    • v.30 no.3
    • /
    • pp.457-496
    • /
    • 2020
  • This study aims to extensively analyze the performance of various Machine Learning (ML) techniques for predicting version to version change-proneness of source code Java files. 17 object-oriented metrics have been utilized in this work for predicting change-prone files using 31 ML techniques and the framework proposed has been implemented on various consecutive releases of two Java-based software projects available as plug-ins. 10-fold and inter-release validation methods have been employed to validate the models and statistical tests provide supplementary information regarding the reliability and significance of the results. The results of experiments conducted in this article indicate that the ML techniques perform differently under the different validation settings. The results also confirm the proficiency of the selected ML techniques in lieu of developing change-proneness prediction models which could aid the software engineers in the initial stages of software development for classifying change-prone Java files of a software, in turn aiding in the trend estimation of change-proneness over future versions.

A Study on a Wearable Smart Airbag Using Machine Learning Algorithm (머신러닝 알고리즘을 사용한 웨어러블 스마트 에어백에 관한 연구)

  • Kim, Hyun Sik;Baek, Won Cheol;Baek, Woon Kyung
    • Journal of the Korean Society of Safety
    • /
    • v.35 no.2
    • /
    • pp.94-99
    • /
    • 2020
  • Bikers can be subjected to injuries from unexpected accidents even if they wear basic helmets. A properly designed airbag can efficiently protect the critical areas of the human body. This study introduces a wearable smart airbag system using machine learning techniques to protect human neck and shoulders. When a bicycle accident happens, a microprocessor analyzes the biker's motion data to recognize if it is a critical accident by comparing with accident classification models. These models are trained by a variety of possible accidents through machine learning techniques, like k-means and SVM methods. When the microprocessor decides it is a critical accident, it issues an actuation signal for the gas inflater to inflate the airbag. A protype of the wearable smart airbag with the machine learning techniques is developed and its performance is tested using a human dummy mounted on a moving cart.

An improvement of LEM2 algorithm

  • The, Anh-Pham;Lee, Young-Koo;Lee, Sung-Young
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.302-304
    • /
    • 2011
  • Rule based machine learning techniques are very important in our real world now. We can list out some important application which we can apply rule based machine learning algorithm such as medical data mining, business transaction mining. The different between rules based machine learning and model based machine learning is that model based machine learning out put some models, which often are very difficult to understand by expert or human. But rule based techniques output are the rule sets which is in IF THEN format. For example IF blood pressure=90 and kidney problem=yes then take this drug. By this way, medical doctor can easy modify and update some usable rule. This is the scenario in medical decision support system. Currently, Rough set is one of the most famous theory which can be used for produce the rule. LEM2 is the algorithm use this theory and can produce the small set of rule on the database. In this paper, we present an improvement of LEM2 algorithm which incorporates the variable precision techniques.