• Title/Summary/Keyword: k-means 군집 알고리즘

Search Result 191, Processing Time 0.031 seconds

Sequence-based Similar Music Retrieval Scheme (시퀀스 기반의 유사 음악 검색 기법)

  • Jun, Sang-Hoon;Hwang, Een-Jun
    • Journal of IKEEE
    • /
    • v.13 no.2
    • /
    • pp.167-174
    • /
    • 2009
  • Music evokes human emotions or creates music moods through various low-level musical features. Typical music clip consists of one or more moods and this can be used as an important criteria for determining the similarity between music clips. In this paper, we propose a new music retrieval scheme based on the mood change patterns of music clips. For this, we first divide music clips into segments based on low level musical features. Then, we apply K-means clustering algorithm for grouping them into clusters with similar features. By assigning a unique mood symbol for each cluster, we can represent each music clip by a sequence of mood symbols. Finally, to estimate the similarity of music clips, we measure the similarity of their musical mood sequence using the Longest Common Subsequence (LCS) algorithm. To evaluate the performance of our scheme, we carried out various experiments and measured the user evaluation. We report some of the results.

  • PDF

Comparative Analysis for Clustering Based Optimal Vehicle Routes Planning (클러스터링 기반의 최적 차량 운행 계획 수립을 위한 비교연구)

  • Kim, Jae-Won;Shin, KwangSup
    • The Journal of Bigdata
    • /
    • v.5 no.1
    • /
    • pp.155-180
    • /
    • 2020
  • It takes the most important role the problem of assigining vehicles and desigining optimal routes for each vehicle in order to enhance the logistics service level. While solving the problem, various cost factors such as number of vehicles, the capacity of vehicles, total travelling distance, should be considered at the same time. Although most of logistics service providers introduced the Transportation Management System (TMS), the system has the limitation which can not consider the practical constraints. In order to make the solution of TMS applicable, it is required experts revised the solution of TMS based on their own experience and intuition. In this research, different from previous research which have focused on minimizing the total cost, it has been proposed the methodology which can enhance the efficiency and fairness of asset utilization, simultaneously. First of all, it has been adopted the Cluster-First Route-Second (CFRS) approach. Based on the location of customers, we have grouped customers as clusters by using four different clustering algorithm such as K-Means, K-Medoids, DBSCAN, Model-based clustering and a procedural approach, Fisher & Jaikumar algorithm. After getting the result of clustering, it has been developed the optiamal vehicle routes within clusters. Based on the result of numerical experiments, it can be said that the propsed approach based on CFRS may guarantee the better performance in terms of total travelling time and distance. At the same time, the variance of travelling distance and number of visiting customers among vehicles, it can be concluded that the proposed approach can guarantee the better performance of assigning tasks in terms of fairness.

Study on Hand Gestures Recognition Algorithm of Millimeter Wave (밀리미터파의 손동작 인식 알고리즘에 관한 연구)

  • Nam, Myung Woo;Hong, Soon Kwan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.7
    • /
    • pp.685-691
    • /
    • 2020
  • In this study, an algorithm that recognizes numbers from 0 to 9 was developed using the data obtained after tracking hand movements using the echo signal of a millimeter-wave radar sensor at 77 GHz. The echo signals obtained from the radar sensor by detecting the motion of a hand gesture revealed a cluster of irregular dots due to the difference in scattering cross-sectional area. A valid center point was obtained from them by applying a K-Means algorithm using 3D coordinate values. In addition, the obtained center points were connected to produce a numeric image. The recognition rate was compared by inputting the obtained image and an image similar to human handwriting by applying the smoothing technique to a CNN (Convolutional Neural Network) model trained with MNIST (Modified National Institute of Standards and Technology database). The experiment was conducted in two ways. First, in the recognition experiments using images with and without smoothing, average recognition rates of 77.0% and 81.0% were obtained, respectively. In the experiment of the CNN model with augmentation of learning data, a recognition rate of 97.5% and 99.0% on average was obtained in the recognition experiment using the image with and without smoothing technique, respectively. This study can be applied to various non-contact recognition technologies using radar sensors.

Design of Classifier for Sorting of Black Plastics by Type Using Intelligent Algorithm (지능형 알고리즘을 이용한 재질별 검정색 플라스틱 분류기 설계)

  • Park, Sang Beom;Roh, Seok Beom;Oh, Sung Kwun;Park, Eun Kyu;Choi, Woo Zin
    • Resources Recycling
    • /
    • v.26 no.2
    • /
    • pp.46-55
    • /
    • 2017
  • In this study, the design methodology of Radial Basis Function Neural Networks is developed with the aid of Laser Induced Breakdown Spectroscopy and also applied to the practical plastics sorting system. To identify black plastics such as ABS, PP, and PS, RBFNNs classifier as a kind of intelligent algorithms is designed. The dimensionality of the obtained input variables are reduced by using PCA and divided into several groups by using K-means clustering which is a kind of clustering techniques. The entire data is split into training data and test data according to the ratio of 4:1. The 5-fold cross validation method is used to evaluate the performance as well as reliability of the proposed classifier. In case of input variables and clusters equal to 5 respectively, the classification performance of the proposed classifier is obtained as 96.78%. Also, the proposed classifier showed superiority in the viewpoint of classification performance where compared to other classifiers.

Property-based Hierarchical Clustering of Peers using Mobile Agent for Unstructured P2P Systems (비구조화 P2P 시스템에서 이동에이전트를 이용한 Peer의 속성기반 계층적 클러스터링)

  • Salvo, MichaelAngelG.;Mateo, RomeoMarkA.;Lee, Jae-Wan
    • Journal of Internet Computing and Services
    • /
    • v.10 no.4
    • /
    • pp.189-198
    • /
    • 2009
  • Unstructured peer-to-peer systems are most commonly used in today's internet. But file placement is random in these systems and no correlation exists between peers and their contents. There is no guarantee that flooding queries will find the desired data. In this paper, we propose to cluster nodes in unstructured P2P systems using the agglomerative hierarchical clustering algorithm to improve the search method. We compared the delay time of clustering the nodes between our proposed algorithm and the k-means clustering algorithm. We also simulated the delay time of locating data in a network topology and recorded the overhead of the system using our proposed algorithm, k-means clustering, and without clustering. Simulation results show that the delay time of our proposed algorithm is shorter compared to other methods and resource overhead is also reduced.

  • PDF

Analysis of Research Trends Related to drug Repositioning Based on Machine Learning (머신러닝 기반의 신약 재창출 관련 연구 동향 분석)

  • So Yeon Yoo;Gyoo Gun Lim
    • Information Systems Review
    • /
    • v.24 no.1
    • /
    • pp.21-37
    • /
    • 2022
  • Drug repositioning, one of the methods of developing new drugs, is a useful way to discover new indications by allowing drugs that have already been approved for use in people to be used for other purposes. Recently, with the development of machine learning technology, the case of analyzing vast amounts of biological information and using it to develop new drugs is increasing. The use of machine learning technology to drug repositioning will help quickly find effective treatments. Currently, the world is having a difficult time due to a new disease caused by coronavirus (COVID-19), a severe acute respiratory syndrome. Drug repositioning that repurposes drugsthat have already been clinically approved could be an alternative to therapeutics to treat COVID-19 patients. This study intends to examine research trends in the field of drug repositioning using machine learning techniques. In Pub Med, a total of 4,821 papers were collected with the keyword 'Drug Repositioning'using the web scraping technique. After data preprocessing, frequency analysis, LDA-based topic modeling, random forest classification analysis, and prediction performance evaluation were performed on 4,419 papers. Associated words were analyzed based on the Word2vec model, and after reducing the PCA dimension, K-Means clustered to generate labels, and then the structured organization of the literature was visualized using the t-SNE algorithm. Hierarchical clustering was applied to the LDA results and visualized as a heat map. This study identified the research topics related to drug repositioning, and presented a method to derive and visualize meaningful topics from a large amount of literature using a machine learning algorithm. It is expected that it will help to be used as basic data for establishing research or development strategies in the field of drug repositioning in the future.

Development of a data analysis system for preventing school violence based on AI unsupervised learning (AI 비지도 학습 기반의 학교폭력 예방 데이터 분석 시스템 개발)

  • Jung, Soyeong;Ma, Youngji;Koo, Dukhoi
    • Journal of The Korean Association of Information Education
    • /
    • v.25 no.5
    • /
    • pp.741-750
    • /
    • 2021
  • School violence has long been recognized as a social problem, and various efforts have been made to prevent it. In this study, we propose a system that can prevent school violence by analyzing data on the frequency of conversations between students, friendship and preference to be in the same group. This data was quantified using a Likert scale questionnaire, and also grouped into the appropriate number of clusters using the K-means algorithm. Additionally, the homeroom teacher observed the frequency and nature of conversations between students, and targeted specific individuals or groups for counseling and intervention, with the aim of reducing school violence. Data analysis revealed that the teachers' qualitative observations were consistent with the quantified data based on student questionnaires, and therefore applicable as quantitative data towards the identification and understanding of student relationships within the classroom. The study has potential limitations. The data used is subjective and based on peer evaluations which can be inconsistent as the students may use different criteria to evaluate one another. It is expected that this study will help homeroom teachers in their efforts to prevent school violence by understanding the relationships between students within the classroom.

Analysis of data on prevention of school violence based on AI unsupervised learning (AI 비지도 학습 기반의 학교폭력 예방 데이터 분석)

  • Jung, Soyeong;Ma, Youngji;Koo, Dukhoi
    • 한국정보교육학회:학술대회논문집
    • /
    • 2021.08a
    • /
    • pp.85-91
    • /
    • 2021
  • School violence has long been recognized as a social problem, and various efforts have been made to prevent it. In this study, we propose a system that can prevent school violence by analyzing data on the frequency of conversations between students, and identify peer relationships. The frequency of conversations between students in the class was quantified using a rating scale questionnaire, and this data was grouped into the appropriate number of clusters using the K-means algorithm. Additionally, the homeroom teacher observed the frequency and nature of conversations between students, and targeted specific individuals or groups for counseling and intervention, with the aim of reducing school violence. Data analysis revealed that the teachers' qualitative observations were consistent with the quantified data based on student questionnaires, and therefore applicable as quantitative data towards the identification and understanding of student relationships within the classroom. The study has potential limitations. The data used is subjective and based on peer evaluations which can be inconsistent as the students may use different criteria to evaluate one another. It is expected that this study will help homeroom teachers in their efforts to prevent school violence by understanding the relationships between students within the classroom.

  • PDF

Short-term Power Load Forecasting using Time Pattern for u-City Application (u-City응용에서의 시간 패턴을 이용한 단기 전력 부하 예측)

  • Park, Seong-Seung;Shon, Ho-Sun;Lee, Dong-Gyu;Ji, Eun-Mi;Kim, Hi-Seok;Ryu, Keun-Ho
    • Journal of Korea Spatial Information System Society
    • /
    • v.11 no.2
    • /
    • pp.177-181
    • /
    • 2009
  • Developing u-Public facilities for application u-City is to combine both the state-of-the art of the construction and ubiquitous computing and must be flexibly comprised of the facilities for the basic service of the building such as air conditioning, heating, lighting and electric equipments to materialize a new format of spatial planning and the public facilities inside or outside. Accordingly, in this paper we suggested the time pattern system for predicting the most basic power system loads for the basic service. To application the tim e pattern we applied SOM algorithm and k-means method and then clustered the data each weekday and each time respectively. The performance evaluation results of suggestion system showed that the forecasting system better the ARIMA model than the exponential smoothing method. It has been assumed that the plan for power supply depending on demand and system operation could be performed efficiently by means of using such power load forecasting.

  • PDF

SWAT Direct Runoff and Baseflow Evaluation using Web-based Flow Clustering EI Estimation System (웹기반의 유량 군집화 EI 평가시스템을 이용한 SWAT 직접유출과 기저유출 평가)

  • Jang, Won Seok;Moon, Jong Pil;Kim, Nam Won;Yoo, Dong Sun;Kum, Dong Hyuk;Kim, Ik Jae;Mun, Yuri;Lim, Kyoung Jae
    • Journal of Korean Society on Water Environment
    • /
    • v.27 no.1
    • /
    • pp.61-72
    • /
    • 2011
  • In order to assess hydrologic and nonpoint source pollutant behaviors in a watershed with Soil and Water Assessment Tool (SWAT) model, the accuracy evaluation of SWAT model should be conducted prior to the application of it to a watershed. When calibrating and validating hydrological components of SWAT model, the Nash-Sutcliffe efficiency coefficient (EI) has been widely used. However, the EI value has been known as it is affected sensitively by big numbers among the range of numbers. In this study, a Web-based flow clustering EI estimation system using K-means clustering algorithm was developed and used for SWAT hydrology evaluation. Even though the EI of total streamflow was high, the EI values of hydrologic components (i.e., direct runoff and baseflow) were not high. Also when the EI values of flow group I and II (i.e., low and high value group) clustered from direct runoff and baseflow were computed, respectively, the EI values of them were much lower with negative EI values for some flow group comparison. The SWAT auto-calibration tool estimated values also showed negative EI values for most flow group I and II of direct runoff and baseflow although EI value of total streamflow was high. The result obtained in this study indicates that the SWAT hydrology component should be calibrated until all four positive EI values for each flow group of direct runoff and baseflow are obtained for better accuracy both in direct runoff and baseflow.