• Title/Summary/Keyword: machine learning

Search Result 5,214, Processing Time 0.034 seconds

A Study on the Idol Survivability Prediction Using Machine Learning Techniques : Focused on the Industrial Competitiveness (머신러닝 기법을 활용한 아이돌 생존 가능성 예측 연구 : 산업 경쟁력 증진을 중심으로)

  • Kim, Seul-ah;Ahn, Ju Hyuk;Cui, Fuquan
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.5
    • /
    • pp.291-302
    • /
    • 2020
  • Korean popular music industry, which is lead by "Idol group", has forsaken their fandom all over the world. Therefore, idol groups has become not only an artist but also the most influential people in the Korean economy. A global idol group with a strong fandom can earn more than a trillion-dollar by attracting their global fan's interest in Korea. In other words, it is considerably important to carry the idol to a successful conclusion. This study tries to expect whether the idols can be survived or not at a certain point after their debut by ANN, Decision Tree, Random Forest. We decide that certain point as the three-year and eight-year after their debut, because it is their break-even point year and the year after their average renewal of the contract. In addition, this study also explains which feature is the most important to their survival by feature importance and Logistic regression. In conclusion, features like the number of idol competitors, the number of debut members and the number of the genre are significant. These results shed light on the efficient management of K-Pop idol to improve industrial competitiveness.

k-NN Query Optimization Scheme Based on Machine Learning Using a DNN Model (DNN 모델을 이용한 기계 학습 기반 k-최근접 질의 처리 최적화 기법)

  • We, Ji-Won;Choi, Do-Jin;Lee, Hyeon-Byeong;Lim, Jong-Tae;Lim, Hun-Jin;Bok, Kyoung-Soo;Yoo, Jae-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.10
    • /
    • pp.715-725
    • /
    • 2020
  • In this paper, we propose an optimization scheme for a k-Nearest Neighbor(k-NN) query, which finds k objects closest to the query in the high dimensional feature vectors. The k-NN query is converted and processed into a range query based on the range that is likely to contain k data. In this paper, we propose an optimization scheme using DNN model to derive an optimal range that can reduce processing cost and accelerate search speed. The entire system of the proposed scheme is composed of online and offline modules. In the online module, a query is actually processed when it is issued from a client. In the offline module, an optimal range is derived for the query by using the DNN model and is delivered to the online module. It is shown through various performance evaluations that the proposed scheme outperforms the existing schemes.

Bug Report Quality Prediction for Enhancing Performance of Information Retrieval-based Bug Localization (정보검색기반 결함위치식별 기술의 성능 향상을 위한 버그리포트 품질 예측)

  • Kim, Misoo;Ahn, June;Lee, Eunseok
    • Journal of KIISE
    • /
    • v.44 no.8
    • /
    • pp.832-841
    • /
    • 2017
  • Bug reports are essential documents for developers to localize and fix bugs. These reports contain information regarding software bugs or failures that occur during software operation and maintenance phase. Information Retrieval-based Bug Localization (IR-BL) techniques have been proposed to reduce the time and cost it takes for developers to resolve bug reports. However, if a low-quality bug report is submitted, the performance of such techniques can be significantly degraded. To address this problem, we propose a quality prediction method that selects low-quality bug reports. This process; defines a Quality property of a Bug report as a Query (Q4BaQ) and predicts the quality of the bug reports using machine learning. We evaluated the proposed method with 3 open source projects. The results of the experiment show that the proposed method achieved an average F-measure of 87.31% and outperformed previous prediction techniques by up to 6.62% in the F-measure. Finally, a combination of the proposed method and traditional automatic query reformulation method improved the MRR and MAP by 0.9% and 1.3%, respectively.

An Effective Two-Step Model for Speech Act Analysis in a Schedule Management Domain (일정 관리 영역에서의 화행 분석을 위한 효과적인 2단계 모델)

  • Lee, Hyun-Jung;Kim, Hark-Soo;Seo, Jung-Yun
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.3
    • /
    • pp.297-310
    • /
    • 2008
  • Since speech acts implies speakers' intentions, it is essential to determine speakers' speech acts if we want to implement an intelligent dialogue system. We propose a two-step model for effectively determining speakers' speech acts. In the first step, the proposed model returns speech act candidates by using a neural network model based on machine learning and a predictivity model based on statistics, respectively. In the second step, using speech act candidates which are returned by the predictivity model, the proposed model filters out speech act candidates which are returned by the neural network model. Then, the proposed model selects a speech act with maximum output value among the unremoved speech act candidates. In the experiment on a schedule management domain, the proposed two-step modeling method showed better precisions than the previous methods only using a machine learning model or a probability model.

  • PDF

Machine-Learning Based on Relevance Feedback: A Powerful Engine to Enhance the Performance of SDI System (기계학습 기반 피드백 과정을 통한 SDI 시스템의 성능향상에 관한 연구)

  • Noh, Young-Hee
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.4 s.54
    • /
    • pp.133-152
    • /
    • 2004
  • As the Internet facilitates the rapid increase of information availability, the study on SDI service that provides users with relevant document in a timely manner has been developed. However, the practical use of this service has been low. This thesis aims at analyzing the reasons for this and developing relevance feedback based SDI system to improve the performance of the existing SDI system. Experimental systems that are developed for this study are SDI system based on users' minimum intervention feedback, SDI system based on perfect automation feedback, and SDI system based on users' maximum intervention feedback. The fourth system that utilizes the traditional SDI system is also studied to evaluate the level of performance improvement of the newly developed three types of SDI system. As a result of this study, SDI system based on users' maximum intervention feedback showed greatest performance improvement. The next performance improvement happened in order of SDI system based on perfect automation feedback, SDI system based on users' minimum intervention feedback, and the traditional SDI system. Feedback based systems showed greater performance improvement as they went through more feedback processes.

Fuaay Decision Tree Induction to Obliquely Partitioning a Feature Space (특징공간을 사선 분할하는 퍼지 결정트리 유도)

  • Lee, Woo-Hang;Lee, Keon-Myung
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.3
    • /
    • pp.156-166
    • /
    • 2002
  • Decision tree induction is a kind of useful machine learning approach for extracting classification rules from a set of feature-based examples. According to the partitioning style of the feature space, decision trees are categorized into univariate decision trees and multivariate decision trees. Due to observation error, uncertainty, subjective judgment, and so on, real-world data are prone to contain some errors in their feature values. For the purpose of making decision trees robust against such errors, there have been various trials to incorporate fuzzy techniques into decision tree construction. Several researches hove been done on incorporating fuzzy techniques into univariate decision trees. However, for multivariate decision trees, few research has been done in the line of such study. This paper proposes a fuzzy decision tree induction method that builds fuzzy multivariate decision trees named fuzzy oblique decision trees, To show the effectiveness of the proposed method, it also presents some experimental results.

Evaluation of Interpretability for Generated Rules from ANFIS (ANFIS에서 생성된 규칙의 해석용이성 평가)

  • Song, Hee-Seok;Kim, Jae-Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.4
    • /
    • pp.123-140
    • /
    • 2009
  • Fuzzy neural network is an integrated model of artificial neural network and fuzzy system and it has been successfully applied in control and forecasting area. Recently ANFIS(Adaptive Network-based Fuzzy Inference System) has been noticed widely among various fuzzy neural network models because of outstanding performance of control and forecasting accuracy. ANFIS has capability to refine its fuzzy rules interactively with human expert. In particular, when we use initial rule structure for machine learning which is generated from human expert, it is highly probable to reach global optimum solution as well as shorten time to convergence. We propose metrics to evaluate interpretability of generated rules as a means of acquiring domain knowledge and compare level of interpretability of ANFIS fuzzy rules to those of C5.0 classification rules. The proposed metrics also can be used to evaluate capability of rule generation for the various machine learning methods.

  • PDF

Middle Ear Disease Automatic Decision Scheme using HoG Descriptor (HoG 기술자를 이용한 중이염 자동 판별 방법)

  • Jung, Na-ra;Song, Jae-wook;Choi, Ho-Hyoung;Kang, Hyun-soo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.3
    • /
    • pp.621-629
    • /
    • 2016
  • This paper presents a decision method of middle ear disease which is developed in children and adults. In the proposed method, features are extracted from the middle ear disease images and normal images using HoG (histogram of oriented gradient) descriptor and the extracted features are learned by SVM (support vector machine) classifier. To obtain an input vector into SVM, an input image is resized to a predefined size and then the resized image is partitioned into 16 blocks each of which is partitioned into 4 sub-blocks (namely cell). Finally, the feature vector with 576 components is given by using HoG with 9 bins and it is used as SVM learning and classification. Input images are classified by SVM classifier based on the model of learning features. Experimental results show that the proposed method yields the precision of over 90% in decision.

Photovoltaic Generation Forecasting Using Weather Forecast and Predictive Sunshine and Radiation (일기 예보와 예측 일사 및 일조를 이용한 태양광 발전 예측)

  • Shin, Dong-Ha;Park, Jun-Ho;Kim, Chang-Bok
    • Journal of Advanced Navigation Technology
    • /
    • v.21 no.6
    • /
    • pp.643-650
    • /
    • 2017
  • Photovoltaic generation which has unlimited energy sources are very intermittent because they depend on the weather. Therefore, it is necessary to get accurate generation prediction with reducing the uncertainty of photovoltaic generation and improvement of the economics. The Meteorological Agency predicts weather factors for three days, but doesn't predict the sunshine and solar radiation that are most correlated with the prediction of photovoltaic generation. In this study, we predict sunshine and solar radiation using weather, precipitation, wind direction, wind speed, humidity, and cloudiness which is forecasted for three days at Meteorological Agency. The photovoltaic generation forecasting model is proposed by using predicted solar radiation and sunshine. As a result, the proposed model showed better results in the error rate indexes such as MAE, RMSE, and MAPE than the model that predicts photovoltaic generation without radiation and sunshine. In addition, DNN showed a lower error rate index than using SVM, which is a type of machine learning.

A Study on Object Classification Using IR-UWB (IR-UWB를 이용한 물체 분류에 관한 연구)

  • Gam, Ji-Hyeon;Jeong, Jae-Hoon;Byun, Gi-Sig;Kim, Gwan-Hyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.88-90
    • /
    • 2018
  • There are many studies on IR-UWB Radar. A number of studies have been conducted on the Personnel count and measurement distance to person, mainly using IR-UWB. In this paper, however, we use IR-UWB Radar to distinguish objects. In order to distinguish these objects, in this paper, the IR-UWB radar is operated by positioning the object at a certain distance and the object is classified by using the size and shape of the wave reflected by the object. To distinguish objects using only the size and shape of these waveforms, SVM (Support Vector Machine) was used to classify objects by learning shape and size of waveforms. In this paper, we show that the size and shape of the waveform received by the IR-UWB Radar can be identified by SVM pattern learning.

  • PDF