• Title/Summary/Keyword: K-means++ algorithm

Search Result 1,363, Processing Time 0.031 seconds

Soft sensor design based on PLS with hybrid inner model (내적 조합 모델 PLS를 이용한 소프트 센서 설계)

  • Hong Sun Ju;Han Chong Hun
    • Journal of the Korean Institute of Gas
    • /
    • v.2 no.3
    • /
    • pp.49-53
    • /
    • 1998
  • It takes quite a long time for an analyzer, such as gas chromatography, to measure a bulk property of a system, which prevents on-line measurements. Also, the cost of installation and maintenance is very high. Consequently, some other means is needed for on-line measurements of properties and the development of soft sensors based on process variables like temperature and pressure is of great interest. In the field of gas industry, the development of a soft sensor which makes indirect on-line measurements of gas compositions and flow rate, is in progress. In this paper, we proposed a hybrid inner model PLS which improved the prediction performance by taking into account the data structure, as an empirical modeling algorithm. When applied to a design of a soft sensor of a distillation tower, the hybrid inner model PLS showed better prediction performance than other methods.

  • PDF

A Study on Clustering of Core Competencies to Deploy in and Develop Courseworks for New Digital Technology (카드소팅을 활용한 디지털 신기술 과정 핵심역량 군집화에 관한 연구)

  • Ji-Woon Lee;Ho Lee;Joung-Huem Kwon
    • Journal of Practical Engineering Education
    • /
    • v.14 no.3
    • /
    • pp.565-572
    • /
    • 2022
  • Card sorting is a useful data collection method for understanding users' perceptions of relationships between items. In general, card sorting is an intuitive and cost-effective technique that is very useful for user research and evaluation. In this study, the core competencies of each field were used as competency cards used in the next stage of card sorting for course development, and the clustering results were derived by applying the K-means algorithm to cluster the results. As a result of card sorting, competency clustering for core competencies for each occupation in each field was verified based on Participant-Centric Analysis (PCA). For the number of core competency cards for each occupation, the number of participants who agreed appropriately for clustering and the degree of card similarity were derived compared to the number of sorting participants.

The Analysis of CT Number Rate of Change of Applying The Iterative Metallic Artifact Reduction Algorithm for CT Reconstruction Image (Iterative Metallic Artifact Reduction 알고리즘 적용 CT 재구성영상의 CT Number 변화율 분석)

  • Kim, Hyeonju;Yoon, Joon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.7
    • /
    • pp.84-91
    • /
    • 2017
  • This study was performed using Somatom Definition Flash (Siemens, Enlarge, Germany) and GE 64-MDCT (Discovery 750 HD, GE HEALTHCARE, Milwaukee, USA.) using high-density medical materials that (are indispensable to?) computed tomography. We analyzed quantitatively the rate of change of the CT number of the CT reconstruction images by means of the IMAR and MAR algorithms using the phantom images acquired after scanning and previously captured raw data images. As a result, it was shown that the IMAR and MAR algorithms provided if ferent phantom images in the case of all medical high-density materials (p <0.05). The black streak artifacts were analyzed using the MAR and IMAR algorithms to determine if they corresponded to stainless steel materials (p>0.05). Also, it was found that the application of the IMAR algorithm affects the attenuation deviation, because there is a change in the image CT number compared to that before. The results suggest that, in the future, after the implant procedure, it would be useful to observe the surgical site and surrounding tissues during follow-up CT scans.

Recognition of Flat Type Signboard using Deep Learning (딥러닝을 이용한 판류형 간판의 인식)

  • Kwon, Sang Il;Kim, Eui Myoung
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.4
    • /
    • pp.219-231
    • /
    • 2019
  • The specifications of signboards are set for each type of signboards, but the shape and size of the signboard actually installed are not uniform. In addition, because the colors of the signboard are not defined, so various colors are applied to the signboard. Methods for recognizing signboards can be thought of as similar methods of recognizing road signs and license plates, but due to the nature of the signboards, there are limitations in that the signboards can not be recognized in a way similar to road signs and license plates. In this study, we proposed a methodology for recognizing plate-type signboards, which are the main targets of illegal and old signboards, and automatically extracting areas of signboards, using the deep learning-based Faster R-CNN algorithm. The process of recognizing flat type signboards through signboard images captured by using smartphone cameras is divided into two sequences. First, the type of signboard was recognized using deep learning to recognize flat type signboards in various types of signboard images, and the result showed an accuracy of about 71%. Next, when the boundary recognition algorithm for the signboards was applied to recognize the boundary area of the flat type signboard, the boundary of flat type signboard was recognized with an accuracy of 85%.

Personalized insurance product based on similarity (유사도를 활용한 맞춤형 보험 추천 시스템)

  • Kim, Joon-Sung;Cho, A-Ra;Oh, Hayong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.11
    • /
    • pp.1599-1607
    • /
    • 2022
  • The data mainly used for the model are as follows: the personal information, the information of insurance product, etc. With the data, we suggest three types of models: content-based filtering model, collaborative filtering model and classification models-based model. The content-based filtering model finds the cosine of the angle between the users and items, and recommends items based on the cosine similarity; however, before finding the cosine similarity, we divide into several groups by their features. Segmentation is executed by K-means clustering algorithm and manually operated algorithm. The collaborative filtering model uses interactions that users have with items. The classification models-based model uses decision tree and random forest classifier to recommend items. According to the results of the research, the contents-based filtering model provides the best result. Since the model recommends the item based on the demographic and user features, it indicates that demographic and user features are keys to offer more appropriate items.

A Study on the Method of Producing the 1 km Resolution Seasonal Prediction of Temperature Over South Korea for Boreal Winter Using Genetic Algorithm and Global Elevation Data Based on Remote Sensing (위성고도자료와 유전자 알고리즘을 이용한 남한의 겨울철 기온의 1 km 격자형 계절예측자료 생산 기법 연구)

  • Lee, Joonlee;Ahn, Joong-Bae;Jung, Myung-Pyo;Shim, Kyo-Moon
    • Korean Journal of Remote Sensing
    • /
    • v.33 no.5_2
    • /
    • pp.661-676
    • /
    • 2017
  • This study suggests a new method not only to produce the 1 km-resolution seasonal prediction but also to improve the seasonal prediction skill of temperature over South Korea. This method consists of four stages of experiments. The first stage, EXP1, is a low-resolution seasonal prediction of temperature obtained from Pusan National University Coupled General Circulation Model, and EXP2 is to produce 1 km-resolution seasonal prediction of temperature over South Korea by applying statistical downscaling to the results of EXP1. EXP3 is a seasonal prediction which considers the effect of temperature changes according to the altitude on the result of EXP2. Here, we use altitude information from ASTER GDEM, satellite observation. EXP4 is a bias corrected seasonal prediction using genetic algorithm in EXP3. EXP1 and EXP2 show poorer prediction skill than other experiments because the topographical characteristic of South Korea is not considered at all. Especially, the prediction skills of two experiments are lower at the high altitude observation site. On the other hand, EXP3 and EXP4 applying the high resolution elevation data based on remote sensing have higher prediction skill than other experiments by effectively reflecting the topographical characteristics such as temperature decrease as altitude increases. In addition, EXP4 reduced the systematic bias of seasonal prediction using genetic algorithm shows the superior performance for temporal variability such as temporal correlation, normalized standard deviation, hit rate and false alarm rate. It means that the method proposed in this study can produces high-resolution and high-quality seasonal prediction effectively.

Recommender Systems using Structural Hole and Collaborative Filtering (구조적 공백과 협업필터링을 이용한 추천시스템)

  • Kim, Mingun;Kim, Kyoung-Jae
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.107-120
    • /
    • 2014
  • This study proposes a novel recommender system using the structural hole analysis to reflect qualitative and emotional information in recommendation process. Although collaborative filtering (CF) is known as the most popular recommendation algorithm, it has some limitations including scalability and sparsity problems. The scalability problem arises when the volume of users and items become quite large. It means that CF cannot scale up due to large computation time for finding neighbors from the user-item matrix as the number of users and items increases in real-world e-commerce sites. Sparsity is a common problem of most recommender systems due to the fact that users generally evaluate only a small portion of the whole items. In addition, the cold-start problem is the special case of the sparsity problem when users or items newly added to the system with no ratings at all. When the user's preference evaluation data is sparse, two users or items are unlikely to have common ratings, and finally, CF will predict ratings using a very limited number of similar users. Moreover, it may produces biased recommendations because similarity weights may be estimated using only a small portion of rating data. In this study, we suggest a novel limitation of the conventional CF. The limitation is that CF does not consider qualitative and emotional information about users in the recommendation process because it only utilizes user's preference scores of the user-item matrix. To address this novel limitation, this study proposes cluster-indexing CF model with the structural hole analysis for recommendations. In general, the structural hole means a location which connects two separate actors without any redundant connections in the network. The actor who occupies the structural hole can easily access to non-redundant, various and fresh information. Therefore, the actor who occupies the structural hole may be a important person in the focal network and he or she may be the representative person in the focal subgroup in the network. Thus, his or her characteristics may represent the general characteristics of the users in the focal subgroup. In this sense, we can distinguish friends and strangers of the focal user utilizing the structural hole analysis. This study uses the structural hole analysis to select structural holes in subgroups as an initial seeds for a cluster analysis. First, we gather data about users' preference ratings for items and their social network information. For gathering research data, we develop a data collection system. Then, we perform structural hole analysis and find structural holes of social network. Next, we use these structural holes as cluster centroids for the clustering algorithm. Finally, this study makes recommendations using CF within user's cluster, and compare the recommendation performances of comparative models. For implementing experiments of the proposed model, we composite the experimental results from two experiments. The first experiment is the structural hole analysis. For the first one, this study employs a software package for the analysis of social network data - UCINET version 6. The second one is for performing modified clustering, and CF using the result of the cluster analysis. We develop an experimental system using VBA (Visual Basic for Application) of Microsoft Excel 2007 for the second one. This study designs to analyzing clustering based on a novel similarity measure - Pearson correlation between user preference rating vectors for the modified clustering experiment. In addition, this study uses 'all-but-one' approach for the CF experiment. In order to validate the effectiveness of our proposed model, we apply three comparative types of CF models to the same dataset. The experimental results show that the proposed model outperforms the other comparative models. In especial, the proposed model significantly performs better than two comparative modes with the cluster analysis from the statistical significance test. However, the difference between the proposed model and the naive model does not have statistical significance.

Feature-based Image Analysis for Object Recognition on Satellite Photograph (인공위성 영상의 객체인식을 위한 영상 특징 분석)

  • Lee, Seok-Jun;Jung, Soon-Ki
    • Journal of the HCI Society of Korea
    • /
    • v.2 no.2
    • /
    • pp.35-43
    • /
    • 2007
  • This paper presents a system for image matching and recognition based on image feature detection and description techniques from artificial satellite photographs. We propose some kind of parameters from the varied environmental elements happen by image handling process. The essential point of this experiment is analyzes that affects match rate and recognition accuracy when to change of state of each parameter. The proposed system is basically inspired by Lowe's SIFT(Scale-Invariant Transform Feature) algorithm. The descriptors extracted from local affine invariant regions are saved into database, which are defined by k-means performed on the 128-dimensional descriptor vectors on an artificial satellite photographs from Google earth. And then, a label is attached to each cluster of the feature database and acts as guidance for an appeared building's information in the scene from camera. This experiment shows the various parameters and compares the affected results by changing parameters for the process of image matching and recognition. Finally, the implementation and the experimental results for several requests are shown.

  • PDF

Protocol converting method for the Real-time Safety Supervision System in Railway (실시간 철도안전 관제를 위한 프로토콜 변환 방안 연구)

  • Ahn, Jin;Kim, Sung-Min
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.7
    • /
    • pp.1335-1341
    • /
    • 2016
  • For the safety of train operation, monitoring & supervisory systems for train, signal, power, communication and facilities is operating independently in another place, so, its sensors are interdependently connected from each other to transfer gathering datas of sensing to control center. A Goal of Real-time railway safety supervision system is to improve the safety oversight efficiency and to prevent accidents by means of hazard prediction based on big data by integrating all of safety sensing data in wayside of railway, and the System is requested acquisition of all of sensing data of safety. So, we need special method of protocol converting for the purpose of integrating all of detecting data concerning safety without any changing application. In this paper we investigate the existing converting method in communication field, and propose a new progress to converting protocol adding function of transfer using XML file, and implemented this algorithm, and tested with example packets, finally.

Computer Adaptive Testing Method for Measuring Disability in Patients With Back Pain

  • Choi, Bongsam
    • Physical Therapy Korea
    • /
    • v.19 no.3
    • /
    • pp.124-131
    • /
    • 2012
  • Most conventional instruments measuring disability rely on total score by simply adding individual item responses, which is dependent on the items chosen to represent the underlying construct (test-dependent) and a test statistic, such as coefficient alpha for the estimate of reliability, varying from sample to sample (sample-dependent). By contrast, item response theory (IRT) method focuses on the psychometric properties of the test items instead of the instrument as a whole. By estimating probability that a respondent will select a particular rating for an item, item difficulty and person ability (or disability) can be placed on same linear continuum. These estimates are invariant regardless of the item used (test-free measurement) and the ability of sample applied (sample-free measurement). These advantages of IRT allow the creation of invariantly calibrated large item banks that precisely discriminate the disability levels of individuals. Computer adaptive testing (CAT) method often requiring a testing algorithm promise a means for administering items in a way that is both efficient and precise. This method permits selectively administering items that are closely matched to the ability level of individuals (measurement precision) and measuring the ability without the loss of precision provided by the full item bank (measurement efficiency). These measurement properties can reasonably be achieved using IRT and CAT method. This article aims to investigate comprehensive overview of the existing disability instrument for back pain and to inform physical therapists of an alternative innovative way overcoming the shortcomings of conventional disability instruments. An understanding of IRT and CAT method will equip physical therapist with skills in interpreting the measurement properties of disability instruments developed using the methods.