• Title/Summary/Keyword: k-means clustering analysis

Search Result 462, Processing Time 0.027 seconds

Analysis of dimensions and shapes of maxillary and mandibular dental arch in Korean young adults

  • Park, Su-Jung;Leesungbok, Richard;Song, Jae-Won;Chang, Se Hun;Lee, Suk-Won;Ahn, Su-Jin
    • The Journal of Advanced Prosthodontics
    • /
    • v.9 no.5
    • /
    • pp.321-327
    • /
    • 2017
  • PURPOSE. The aim of this study was to investigate dental arch dimensions and to classify arch shape in Korean young adults. MATERIALS AND METHODS. The sample included 50 Koreans with age ranging from 24 to 32 years. Maxillary and mandibular casts were fabricated using irreversible hydrocolloid and type III dental stones. Incisor-canine distance, $incisor-1^{st}$ molar distance, $incisor-2^{nd}$ molar distance, intercanine distance, $inter-1^{st}$ molar distance, and $inter-2^{nd}$ molar distance in both the maxillary and mandibular arch were measured using a three-dimensional measuring device. The dental arch was classified into three groups using five ratios from the measured values by the K-means clustering method. The data were analyzed with one-way analysis of variance. RESULTS. Arch lengths (IM2D, $incisal-2^{nd}$ molar distance) were 44.13 mm in the maxilla and 40.40 mm in the mandible. Arch widths (M2W, inter $2^{nd}$ molar width) were 64.12 mm in the maxilla and 56.37 mm in the mandible. Distribution of the dental arch form was mostly ovoid shape (maxilla 52% and mandible 56%), followed by the V-shape and the U-shape. The arch width for the U-shape was broader than for the other forms. CONCLUSION. This study establishes new reference data for dental arch dimensions for young Korean adults. The most common arch form is the ovoid type in the maxilla and mandible of Koreans. Clinicians should be aware of these references and classify arch type before and during their dental treatment for effective and harmonized results in Koreans.

Development of Customer Sentiment Pattern Map for Webtoon Content Recommendation (웹툰 콘텐츠 추천을 위한 소비자 감성 패턴 맵 개발)

  • Lee, Junsik;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.67-88
    • /
    • 2019
  • Webtoon is a Korean-style digital comics platform that distributes comics content produced using the characteristic elements of the Internet in a form that can be consumed online. With the recent rapid growth of the webtoon industry and the exponential increase in the supply of webtoon content, the need for effective webtoon content recommendation measures is growing. Webtoons are digital content products that combine pictorial, literary and digital elements. Therefore, webtoons stimulate consumer sentiment by making readers have fun and engaging and empathizing with the situations in which webtoons are produced. In this context, it can be expected that the sentiment that webtoons evoke to consumers will serve as an important criterion for consumers' choice of webtoons. However, there is a lack of research to improve webtoons' recommendation performance by utilizing consumer sentiment. This study is aimed at developing consumer sentiment pattern maps that can support effective recommendations of webtoon content, focusing on consumer sentiments that have not been fully discussed previously. Metadata and consumer sentiments data were collected for 200 works serviced on the Korean webtoon platform 'Naver Webtoon' to conduct this study. 488 sentiment terms were collected for 127 works, excluding those that did not meet the purpose of the analysis. Next, similar or duplicate terms were combined or abstracted in accordance with the bottom-up approach. As a result, we have built webtoons specialized sentiment-index, which are reduced to a total of 63 emotive adjectives. By performing exploratory factor analysis on the constructed sentiment-index, we have derived three important dimensions for classifying webtoon types. The exploratory factor analysis was performed through the Principal Component Analysis (PCA) using varimax factor rotation. The three dimensions were named 'Immersion', 'Touch' and 'Irritant' respectively. Based on this, K-Means clustering was performed and the entire webtoons were classified into four types. Each type was named 'Snack', 'Drama', 'Irritant', and 'Romance'. For each type of webtoon, we wrote webtoon-sentiment 2-Mode network graphs and looked at the characteristics of the sentiment pattern appearing for each type. In addition, through profiling analysis, we were able to derive meaningful strategic implications for each type of webtoon. First, The 'Snack' cluster is a collection of webtoons that are fast-paced and highly entertaining. Many consumers are interested in these webtoons, but they don't rate them well. Also, consumers mostly use simple expressions of sentiment when talking about these webtoons. Webtoons belonging to 'Snack' are expected to appeal to modern people who want to consume content easily and quickly during short travel time, such as commuting time. Secondly, webtoons belonging to 'Drama' are expected to evoke realistic and everyday sentiments rather than exaggerated and light comic ones. When consumers talk about webtoons belonging to a 'Drama' cluster in online, they are found to express a variety of sentiments. It is appropriate to establish an OSMU(One source multi-use) strategy to extend these webtoons to other content such as movies and TV series. Third, the sentiment pattern map of 'Irritant' shows the sentiments that discourage customer interest by stimulating discomfort. Webtoons that evoke these sentiments are hard to get public attention. Artists should pay attention to these sentiments that cause inconvenience to consumers in creating webtoons. Finally, Webtoons belonging to 'Romance' do not evoke a variety of consumer sentiments, but they are interpreted as touching consumers. They are expected to be consumed as 'healing content' targeted at consumers with high levels of stress or mental fatigue in their lives. The results of this study are meaningful in that it identifies the applicability of consumer sentiment in the areas of recommendation and classification of webtoons, and provides guidelines to help members of webtoons' ecosystem better understand consumers and formulate strategies.

Analysis of Utilization Characteristics, Health Behaviors and Health Management Level of Participants in Private Health Examination in a General Hospital (일개 종합병원의 민간 건강검진 수검자의 검진이용 특성, 건강행태 및 건강관리 수준 분석)

  • Kim, Yoo-Mi;Park, Jong-Ho;Kim, Won-Joong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.1
    • /
    • pp.301-311
    • /
    • 2013
  • This study aims to analyze characteristics, health behaviors and health management level related to private health examination recipients in one general hospital. To achieve this, we analyzed 150,501 cases of private health examination data for 11 years from 2001 to 2011 for 20,696 participants in 2011 in a Dae-Jeon general hospital health examination center. The cluster analysis for classify private health examination group is used z-score standardization of K-means clustering method. The logistic regression analysis, decision tree and neural network analysis are used to periodic/non-periodic private health examination classification model. 1,000 people were selected as a customer management business group that has high probability to be non-periodic private health examination patients in new private health examination. According to results of this study, private health examination group was categorized by new, periodic and non-periodic group. New participants in private health examination were more 30~39 years old person than other age groups and more patients suspected of having renal disease. Periodic participants in private health examination were more male participants and more patients suspected of having hyperlipidemia. Non-periodic participants in private health examination were more smoking and sitting person and more patients suspected of having anemia and diabetes mellitus. As a result of decision tree, variables related to non-periodic participants in private health examination were sex, age, residence, exercise, anemia, hyperlipidemia, diabetes mellitus, obesity and liver disease. In particular, 71.4% of non-periodic participants were female, non-anemic, non-exercise, and suspicious obesity person. To operation of customized customer management business for private health examination will contribute to efficiency in health examination center.

Clustering according to Inpatients' Opinion on Hospital Foodservice and Analyzing Inpatient Response to Foodservice Qualify and Revisit Intention by the Cluster: In Case of S Hospital (입원환자의 급식서비스 인식에 따른 고객 군집화 및 군집별 급식서비스 질 평가, 재이용 의도 분석: S병원을 대상으로)

  • Lee, Hae-Young;Chang, Seung-Hee
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.35 no.10
    • /
    • pp.1491-1497
    • /
    • 2006
  • The purpose of this study was to analyze the relationship among inpatients' perceptions of foodservice quality, satisfaction and revisit intention. Questionnaires were hand-delivered to 350 inpatients and a total of 230 questionnaires were usable (response rate 65.7%), Statistical data analysis was completed using the SPSS Win 11.0 for descriptive analysis, independent t-test, $x^2$ test and k-means cluster analysis. The results of this study can be summarized as follows: The average score of overall importance of meal service in medical service was 4.25 out of 5.0, yet the score of overall quality of meal service and value had lower than importance score. A helpfulness to medical treatment (3.48), bringing customer happiness (3.18), overall satisfaction for foodservice (3.66), satisfaction based on expectation before discharge (3.53) and offering foodservice apt to hospital reputation (3.40) were measured as expressions of satisfaction. As a result of clustering analysis, two clusters were classified and named as affirmative opinion group and negative one. Expectation for four factors of foodservice quality between two groups had no significance. But affirmative opinion group had significantly higher score than negative one in perception and satisfaction. Affirmative customers' intention to revisit in the near future was evaluated as high in both considering general medical service (4.04) and reflecting meal service level (3.84).

Selecting Climate Change Scenarios Reflecting Uncertainties (불확실성을 고려한 기후변화 시나리오의 선정)

  • Lee, Jae-Kyoung;Kim, Young-Oh
    • Atmosphere
    • /
    • v.22 no.2
    • /
    • pp.149-161
    • /
    • 2012
  • Going by the research results of the past, of all the uncertainties resulting from the research on climate change, the uncertainty caused by the climate change scenario has the highest degree of uncertainty. Therefore, depending upon what kind of climate change scenario one adopts, the projection of the water resources in the future will differ significantly. As a matter of principle, it is highly recommended to utilize all the GCM scenarios offered by the IPCC. However, this could be considered to be an impractical alternative if a decision has to be made at an action officer's level. Hence, as an alternative, it is deemed necessary to select several scenarios so as to express the possible number of cases to the maximum extent possible. The objective standards in selecting the climate change scenarios have not been properly established and the scenarios have been selected, either at random or subject to the researcher's discretion. In this research, a new scenario selection process, in which it is possible to have the effect of having utilized all the possible scenarios, with using only a few principal scenarios and maintaining some of the uncertainties, has been suggested. In this research, the use of cluster analysis and the selection of a representative scenario in each cluster have efficiently reduced the number of climate change scenarios. In the cluster analysis method, the K-means clustering method, which takes advantage of the statistical features of scenarios has been employed; in the selection of a representative scenario in each cluster, the selection method was analyzed and reviewed and the PDF method was used to select the best scenarios with the closest simulation accuracy and the principal scenarios that is suggested by this research. In the selection of the best scenarios, it has been shown that the GCM scenario which demonstrated high level of simulation accuracy in the past need not necessarily demonstrate the similarly high level of simulation accuracy in the future and various GCM scenarios were selected for the principal scenarios. Secondly, the "Maximum entropy" which can quantify the uncertainties of the climate change scenario has been used to both quantify and compare the uncertainties associated with all the scenarios, best scenarios and the principal scenarios. Comparison has shown that the principal scenarios do maintain and are able to better explain the uncertainties of all the scenarios than the best scenarios. Therefore, through the scenario selection process, it has been proven that the principal scenarios have the effect of having utilized all the scenarios and retaining the uncertainties associated with the climate change to the maximum extent possible, while reducing the number of scenarios at the same time. Lastly, the climate change scenario most suitable for the climate on the Korean peninsula has been suggested. Through the scenario selection process, of all the scenarios found in the 4th IPCC report, principal climate change scenarios, which are suitable for the Korean peninsula and maintain most of the uncertainties, have been suggested. Therefore, it is assessed that the use of the scenario most suitable for the future projection of water resources on the Korean peninsula will be able to provide the projection of the water resources management that maintains more than 70~80% level of uncertainties of all the scenarios.

The Effects of Sidecar on Index Arbitrage Trading and Non-index Arbitrage Trading:Evidence from the Korean Stock Market (한국주식시장에서 사이드카의 역할과 재설계: 차익거래와 비차익거래에 미치는 효과를 중심으로)

  • Park, Jong-Won;Eom, Yun-Sung;Chang, Uk
    • The Korean Journal of Financial Management
    • /
    • v.24 no.3
    • /
    • pp.91-131
    • /
    • 2007
  • In the paper, the effects of sidecar on index arbitrage trading and non-index arbitrage trading in the Korean stock market are examined. The analyses of return, volatility, and liquidity dynamics illustrate that there are no distinct differences for index arbitrage group and non-index arbitrage group surrounding the sidecar events. For further analysis, we construct pseudo-sidecar sample and analyse the effects of the actual sidecar and pseudo-sidecar on arbitrage sample and non-index arbitrage sample. The result of analysis using pseudo-sidecar shows that the differences between index arbitrage group and non-index arbitrage group are larger in pseudo-sidecar sample than in actual sidecar sample. This means that former results can be explained by temporary order clustering in one side before and after the event. Sidecar has little effect on non-index arbitrage group, however, it has relatively large effect on arbitrage group. These results imply that it needs to redesign the sidecar system of the Korean stock market which applies for all program trading including arbitrage and non-index arbitrage trading.

  • PDF

Interpretation of Soil Catena for Agricultural Soils derived from Sedimentary Rocks (퇴적암 유래 농경지 토양에 대한 카테나 해석)

  • SONN, Yeon-Kyu;LEE, Dong-Sung;KIM, Keun-Tae;HYUN, Byung-Keun;JUN, Hye-Weon;JEON, Sang-Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.4
    • /
    • pp.1-14
    • /
    • 2017
  • In Korea, the soil series derived from sedimentary rocks are classified into seven soil series of coarse loamy soil such as Dain, Danbug, Dongam, Imdong, Jeomgog, Maryeong, and Yonggog; seventeen soil series of fine loamy soil such as Angye, Anmi, Banho, Bigog, Deoggog, Dogye, Dojeon, Gamgog, Gugog, Jincheon, Maji, Mungyeong, Oggye, Samam, Yanggog, Yeongwol, and Yulgog; six soil series of fine silty soil such as Goryeong, Bonggog, Juggog, Gyeongsan, Yuga, and Yugog; and four soil series of clayey soil such as Mitan, Pyeongan, Pyeongjeon, and Uji. All thirty-four soil series have different drainage rates and topography. However, the soil texture depends on the parent rock. The buffer functions in GIS (Geographic Information System) techniques were used to calculate adjacent soil series from a soil series. The length of the adjacent soil series was adjusted because a side of the buffer area was one meter long. The cluster analysis was conducted using the CCC (Cubic Clustering Criterion) method, in which the number of clusters is calculated based on the individual soil series ratio. Soil survey has been carried out since 1964 as "The reconnaissance soil survey", and 1:5,000 detailed soil survey was completed in 1999 with a five-years plan in Korea. Today, all the soil survey information has been computerized. GIS techniques were used to establish a digital soil map; however, there have not been any studies to interpret pedogenesis using the GIS technique. In this study, the area of the adjacent soil series were obtained using the GIS technique. The area of the adjacent soil series can be calculated based on the information area. The similarities of soil originated from sedimentary rocks were estimated using the length. As a result, the distribution of grain size was different based on the types of sedimentary rocks and the location. The clusters were distinguished into limestone, sandstone, and shale. In addition, the soil derived from shale was divided into red shale and gray shale. This means that quantitative interpretation of the catena and this established method can be used to interpret the relationship between soil series.

A Study on the Satisfaction of Self-Employed (만족도를 이용한 자영업에 관한 연구)

  • Oh, Yu-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.281-296
    • /
    • 2009
  • This study examines the job and life satisfactions of the self-employed. It uses the Korean Labour and Income Panel Study(KLIPS, hereafter) data for 1998 and 2004. We examine the phases of satisfaction and what variables influence satisfaction for both years and compare the results in order to see what changed between the two regimes. We make use of k-means clustering to divide self-employed into similar degrees of satisfaction. As a result, we are able to classify the self-employed into three groups(low, medium and high) both for the two regimes. High groups consists of relatively younger, well-educated, low working dates, higher proportion of woman than other groups. As a result of regression analysis, we have some evidence that women are more satisfied than men for job satisfaction and that the existence of income is more important than the amount of income for life satisfaction. The age, education, satisfaction for working place, and health are significant to both satisfactions.

Identification of Employee Experience Factors and Their Influence on Job Satisfaction (직원경험 요인 파악 및 직무 만족도에 끼치는 영향력 분석)

  • Juhyeon Lee;So-Hyun Lee;Hee-Woong Kim
    • Information Systems Review
    • /
    • v.25 no.2
    • /
    • pp.181-203
    • /
    • 2023
  • With the fierce competition of companies for the attraction of outstanding individuals, job satisfaction of employees has been of importance. In this circumstance, many companies try to invest in job satisfaction improvement by finding employees' everyday experiences and difficulties. However, due to a lack of understanding of the employee experience, their investments are not paying off. This study examined the relationship between employee experience and job satisfaction using employee reviews and company ratings from Glassdoor, one of the largest employee communities worldwide. We use text mining techniques such as K-means clustering and LDA topic-based sentiment analysis to extract key experience factors by job level, and DistilBERT sentiment analysis to measure the sentiment score of each employee experience factor. The drawn employee experience factors and each sentiment score were analyzed quantitatively, and thereby relations between each employee experience factor and job satisfaction were analyzed. As a result, this study found that there is a significant difference between the workplace experiences of managers and general employees. In addition, employee experiences that affect job satisfaction also differed between positions, such as customer relationship and autonomy, which did not affect the satisfaction of managers. This study used text mining and quantitative modeling method based on theory of work adjustment so as to find and verify main factors of employee experience, and thus expanded research literature. In addition, the results of this study are applicable to the personnel management strategy for improving employees' job satisfaction, and are expected to improve corporate productivity ultimately.

Study about Library and Information Center's Image of Library and Information Science Students as Workplace (문헌정보학과 학생의 직장으로서의 도서관·정보센터 이미지 분석)

  • Cho, Jane;Lee, Jiwon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.50 no.3
    • /
    • pp.113-132
    • /
    • 2016
  • Positioning technique which has been widely used for making marketing strategy by analyzing customer's image also has been used for public and test-taker's image analysis about public facilities, entrepreneurs, universities. This study analyze image of library and Information science students who trying to find a job in library fields about diverse types of library and information centers by Positioning technique. As a result of Similarity cognition analysis by multidimensional Scaling and K-means clustering, it was found that students recognize that public, national, university, school library are similar, on the other hand, portal company and special library are different from those types. In the jobs, user service jobs and technical service jobs are recognized as separated clusters, and cultural program job is also recognized dissimilarly from those clusters. By the way, images about work satisfaction and stability of employment shows high in national library; high wage shows high in portal company; employee's growth potential shows high in special library; job importance shows high in reference service jobs; difficulty shows high in content's job. Anyway, in the workplace selection, almost students regard stability of employment as top priorities, accordingly they prefers public library at most. Such a preference concentration tendency is strongly appeared in local university students than in metropolitan area students as a result of Pearson's chi-square test.