• Title/Summary/Keyword: data clustering

Search Result 2,742, Processing Time 0.025 seconds

Location Classification and Its Utilization for Illegal Parking Enforcement: Focusing on the Case of Gyeonggi (불법주정차 단속을 위한 지역(장소) 분류 및 활용 방안: 경기도를 중심으로)

  • Hyeon Han;So-yeon Choe;So-Hyun Lee
    • Information Systems Review
    • /
    • v.25 no.4
    • /
    • pp.113-130
    • /
    • 2023
  • Due to economic development and increasing gross national income, the number of automobiles continues to rise, leading to a serious issue of illegal parking due to limited road conditions and insufficient parking facilities. Illegal parking causes significant inconvenience and displeasure to people and can even result in accidents and loss of lives. The severity of accidents and their consequences, related to the growing number of vehicles and illegal parking, is escalating, particularly in the metropolitan areas. Consequently, efforts are being made to address this problem as a cause of social issues and come up with measures to reduce illegal parking. In particular, half of the public complaints in the metropolitan area are related to illegal parking, and the highest physical and human damage occurs in Gyeonggi. Thus, this study aims to use machine learning techniques based on data related to illegal parking in Suwon city, Gyeonggi, to categorize regional characteristics and propose effective measures to crack down on illegal parking. Additionally, practical, social, policy, and legal measures to decrease illegal parking in the metropolitan area are suggested. This study has academic significance in that it solved the problem of illegal parking, which is mentioned as one of the social problems that cause traffic congestion, by classifying regional characteristics using K-prototype, a machine learning algorithm. Furthermore, the results of this study contribute to practical and social aspects by providing measures to decrease illegal parking in the metropolitan area.

Tumor Habitat Analysis Using Longitudinal Physiological MRI to Predict Tumor Recurrence After Stereotactic Radiosurgery for Brain Metastasis

  • Da Hyun Lee;Ji Eun Park;NakYoung Kim;Seo Young Park;Young-Hoon Kim;Young Hyun Cho;Jeong Hoon Kim;Ho Sung Kim
    • Korean Journal of Radiology
    • /
    • v.24 no.3
    • /
    • pp.235-246
    • /
    • 2023
  • Objective: It is difficult to predict the treatment response of tissue after stereotactic radiosurgery (SRS) because radiation necrosis (RN) and tumor recurrence can coexist. Our study aimed to predict tumor recurrence, including the recurrence site, after SRS of brain metastasis by performing a longitudinal tumor habitat analysis. Materials and Methods: Two consecutive multiparametric MRI examinations were performed for 83 adults (mean age, 59.0 years; range, 27-82 years; 44 male and 39 female) with 103 SRS-treated brain metastases. Tumor habitats based on contrast-enhanced T1- and T2-weighted images (structural habitats) and those based on the apparent diffusion coefficient (ADC) and cerebral blood volume (CBV) images (physiological habitats) were defined using k-means voxel-wise clustering. The reference standard was based on the pathology or Response Assessment in Neuro-Oncologycriteria for brain metastases (RANO-BM). The association between parameters of single-time or longitudinal tumor habitat and the time to recurrence and the site of recurrence were evaluated using the Cox proportional hazards regression analysis and Dice similarity coefficient, respectively. Results: The mean interval between the two MRI examinations was 99 days. The longitudinal analysis showed that an increase in the hypovascular cellular habitat (low ADC and low CBV) was associated with the risk of recurrence (hazard ratio [HR], 2.68; 95% confidence interval [CI], 1.46-4.91; P = 0.001). During the single-time analysis, a solid low-enhancing habitat (low T2 and low contrast-enhanced T1 signal) was associated with the risk of recurrence (HR, 1.54; 95% CI, 1.01-2.35; P = 0.045). A hypovascular cellular habitat was indicative of the future recurrence site (Dice similarity coefficient = 0.423). Conclusion: After SRS of brain metastases, an increased hypovascular cellular habitat observed using a longitudinal MRI analysis was associated with the risk of recurrence (i.e., treatment resistance) and was indicative of recurrence site. A tumor habitat analysis may help guide future treatments for patients with brain metastases.

Cluster exploration of water pipe leak and complaints surveillance using a spatio-temporal statistical analysis (스캔통계량 분석을 통한 상수도 누수 및 수질 민원 발생 클러스터 탐색)

  • Juwon Lee;Eunju Kim;Sookhyun Nam;Tae-Mun Hwang
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.37 no.5
    • /
    • pp.261-269
    • /
    • 2023
  • In light of recent social concerns related to issues such as water supply pipe deterioration leading to problems like leaks and degraded water quality, the significance of maintenance efforts to enhance water source quality and ensure a stable water supply has grown substantially. In this study, scan statistic was applied to analyze water quality complaints and water leakage accidents from 2015 to 2021 to present a reasonable method to identify areas requiring improvement in water management. SaTScan, a spatio-temporal statistical analysis program, and ArcGIS were used for spatial information analysis, and clusters with high relative risk (RR) were determined using the maximum log-likelihood ratio, relative risk, and Monte Carlo hypothesis test for I city, the target area. Specifically, in the case of water quality complaints, the analysis results were compared by distinguishing cases occurring before and after the onset of "red water." The period between 2015 and 2019 revealed that preceding the occurrence of red water, the leak cluster at location L2 posed a significantly higher risk (RR: 2.45) than other regions. As for water quality complaints, cluster C2 exhibited a notably elevated RR (RR: 2.21) and appeared concentrated in areas D and S, respectively. On the other hand, post-red water incidents of water quality complaints were predominantly concentrated in area S. The analysis found that the locations of complaint clusters were similar to those of red water incidents. Of these, cluster C7 exhibited a substantial RR of 4.58, signifying more than a twofold increase compared to pre-incident levels. A kernel density map analysis was performed using GIS to identify priority areas for waterworks management based on the central location of clusters and complaint cluster RR data.

A Framework for Identifying and Analyzing IT Project Risk Factors (IT프로젝트 위험 요인 식별 및 분석 프레임워크 연구)

  • Jangho Choi;Chanhee Kwak;Heeseok Lee
    • Information Systems Review
    • /
    • v.19 no.4
    • /
    • pp.87-110
    • /
    • 2017
  • Analyzing and finding the risk factors in information technology (IT) projects have been discussed because risk management is an important issue in IT project management. This study obtained the risk factor checklists with priorities, analyzed the causal relationship of risk factors, and determined their influences on IT project management. However, only few studies systematically classified IT project risk factors in terms of risk exposure. These studies considered both the probability of occurrence and the degree of risk simultaneously. The present study determined 53 IT project risk factors on the basis of literature and expert group discussions. Additionally, this study presented clustering analysis based on the data of 140 project managers. The IT project risk factor classification framework was divided into four areas (HIHF, HILF, LIHF, and LILF). The present results can be used to help IT project managers establish effective risk management strategies and reduce IT project failures. This study also provides academic implication because it considers both the probability of occurrence and the degree of influence of risk factors.

Linking growth performance and carcass traits with enterotypes in Muscovy ducks

  • Qian Fan;Yini Xu;Yingping Xiao;Caimei Yang;Wentao Lyu;Hua Yang
    • Animal Bioscience
    • /
    • v.37 no.7
    • /
    • pp.1213-1224
    • /
    • 2024
  • Objective: Enterotypes (ETs) are the clustering of gut microbial community structures, which could serve as indicators of growth performance and carcass traits. However, ETs have been sparsely investigated in waterfowl. The objective of this study was to identify the ileal ETs and explore the correlation of the ETs with growth performance and carcass traits in Muscovy ducks. Methods: A total of 200 Muscovy ducks were randomly selected from a population of 5,000 ducks at 70-day old, weighed and slaughtered. The growth performance and carcass traits, including body weight, dressed weight and evidenced weight, dressed percentage, percentage of apparent yield, breast muscle weight, leg muscle weight, percentage of leg muscle and percentage of breast muscle, were determined. The contents of ileum were collected for the isolation of DNA and 16S rRNA gene sequencing. The ETs were identified based on the 16S rRNA gene sequencing data and the correlation of the ETs with growth performance and carcass traits was performed by Spearman correlation analysis. Results: Three ETs (ET1, ET2, and ET3) were observed in the ileal microbiota of Muscovy ducks with significant differences in number of features and α-diversity among these ETs (p<0.05). Streptococcus, Candida Arthritis, and Bacteroidetes were the presentative genus in ET1 to ET3, respectively. Correlation analysis revealed that Lactococcus and Bradyrhizobium were significantly correlated with percentage of eviscerated yield and leg muscle weight (p<0.05) while ETs were found to have a close association with percentage of eviscerated yield, leg muscle weight, and percentage of leg muscle in Muscovy ducks. However, the growth performance of ducks with different ETs did not show significant difference (p>0.05). Lactococcus were found to be significantly correlated with leg muscle weight, dressed weight, and percentage of eviscerated yield. Conclusion: Our findings revealed a substantial variation in carcass traits associated with ETs in Muscovy ducks. It is implied that ETs might have the potential to serve as a valuable biomarker for assessing duck carcass traits. It would provide novel insights into the interaction of gut microbiota with growth performance and carcass traits of ducks.

Analysis of Interactions in Multiple Genes using IFSA(Independent Feature Subspace Analysis) (IFSA 알고리즘을 이용한 유전자 상호 관계 분석)

  • Kim, Hye-Jin;Choi, Seung-Jin;Bang, Sung-Yang
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.33 no.3
    • /
    • pp.157-165
    • /
    • 2006
  • The change of external/internal factors of the cell rquires specific biological functions to maintain life. Such functions encourage particular genes to jnteract/regulate each other in multiple ways. Accordingly, we applied a linear decomposition model IFSA, which derives hidden variables, called the 'expression mode' that corresponds to the functions. To interpret gene interaction/regulation, we used a cross-correlation method given an expression mode. Linear decomposition models such as principal component analysis (PCA) and independent component analysis (ICA) were shown to be useful in analyzing high dimensional DNA microarray data, compared to clustering methods. These methods assume that gene expression is controlled by a linear combination of uncorrelated/indepdendent latent variables. However these methods have some difficulty in grouping similar patterns which are slightly time-delayed or asymmetric since only exactly matched Patterns are considered. In order to overcome this, we employ the (IFSA) method of [1] to locate phase- and shut-invariant features. Membership scoring functions play an important role to classify genes since linear decomposition models basically aim at data reduction not but at grouping data. We address a new function essential to the IFSA method. In this paper we stress that IFSA is useful in grouping functionally-related genes in the presence of time-shift and expression phase variance. Ultimately, we propose a new approach to investigate the multiple interaction information of genes.

Determination of Tumor Boundaries on CT Images Using Unsupervised Clustering Algorithm (비교사적 군집화 알고리즘을 이용한 전산화 단층영상의 병소부위 결정에 관한 연구)

  • Lee, Kyung-Hoo;Ji, Young-Hoon;Lee, Dong-Han;Yoo, Seoung-Yul;Cho, Chul-Koo;Kim, Mi-Sook;Yoo, Hyung-Jun;Kwon, Soo-Il;Chun, Jun-Chul
    • Journal of Radiation Protection and Research
    • /
    • v.26 no.2
    • /
    • pp.59-66
    • /
    • 2001
  • It is a hot issue to determine the spatial location and shape of tumor boundary in fractionated stereotactic radiotherapy (FSRT). We could get consecutive transaxial plane images from the phantom (paraffin) and 4 patients with brain tumor using helical computed tomography(HCT). K-means classification algorithm was adjusted to change raw data pixel value in CT images into classified average pixel value. The classified images consists of 5 regions that ate tumor region (TR), normal region (NR), combination region (CR), uncommitted region (UR) and artifact region (AR). The major concern was how to separate the normal region from tumor region in the combination area. Relative average deviation analysis was adjusted to alter average pixel values of 5 regions into 2 regions of normal and tumor region to define maximum point among average deviation pixel values. And then we drawn gross tumor volume (GTV) boundary by connecting maximum points in images using semi-automatic contour method by IDL(Interactive Data Language) program. The error limit of the ROI boundary in homogeneous phantom is estimated within ${\pm}1%$. In case of 4 patients, we could confirm that the tumor lesions described by physician and the lesions described automatically by the K-mean classification algorithm and relative average deviation analyses were similar. These methods can make uncertain boundary between normal and tumor region into clear boundary. Therefore it will be useful in the CT images-based treatment planning especially to use above procedure apply prescribed method when CT images intermittently fail to visualize tumor volume comparing to MRI images.

  • PDF

Recommending Core and Connecting Keywords of Research Area Using Social Network and Data Mining Techniques (소셜 네트워크와 데이터 마이닝 기법을 활용한 학문 분야 중심 및 융합 키워드 추천 서비스)

  • Cho, In-Dong;Kim, Nam-Gyu
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.1
    • /
    • pp.127-138
    • /
    • 2011
  • The core service of most research portal sites is providing relevant research papers to various researchers that match their research interests. This kind of service may only be effective and easy to use when a user can provide correct and concrete information about a paper such as the title, authors, and keywords. However, unfortunately, most users of this service are not acquainted with concrete bibliographic information. It implies that most users inevitably experience repeated trial and error attempts of keyword-based search. Especially, retrieving a relevant research paper is more difficult when a user is novice in the research domain and does not know appropriate keywords. In this case, a user should perform iterative searches as follows : i) perform an initial search with an arbitrary keyword, ii) acquire related keywords from the retrieved papers, and iii) perform another search again with the acquired keywords. This usage pattern implies that the level of service quality and user satisfaction of a portal site are strongly affected by the level of keyword management and searching mechanism. To overcome this kind of inefficiency, some leading research portal sites adopt the association rule mining-based keyword recommendation service that is similar to the product recommendation of online shopping malls. However, keyword recommendation only based on association analysis has limitation that it can show only a simple and direct relationship between two keywords. In other words, the association analysis itself is unable to present the complex relationships among many keywords in some adjacent research areas. To overcome this limitation, we propose the hybrid approach for establishing association network among keywords used in research papers. The keyword association network can be established by the following phases : i) a set of keywords specified in a certain paper are regarded as co-purchased items, ii) perform association analysis for the keywords and extract frequent patterns of keywords that satisfy predefined thresholds of confidence, support, and lift, and iii) schematize the frequent keyword patterns as a network to show the core keywords of each research area and connecting keywords among two or more research areas. To estimate the practical application of our approach, we performed a simple experiment with 600 keywords. The keywords are extracted from 131 research papers published in five prominent Korean journals in 2009. In the experiment, we used the SAS Enterprise Miner for association analysis and the R software for social network analysis. As the final outcome, we presented a network diagram and a cluster dendrogram for the keyword association network. We summarized the results in Section 4 of this paper. The main contribution of our proposed approach can be found in the following aspects : i) the keyword network can provide an initial roadmap of a research area to researchers who are novice in the domain, ii) a researcher can grasp the distribution of many keywords neighboring to a certain keyword, and iii) researchers can get some idea for converging different research areas by observing connecting keywords in the keyword association network. Further studies should include the following. First, the current version of our approach does not implement a standard meta-dictionary. For practical use, homonyms, synonyms, and multilingual problems should be resolved with a standard meta-dictionary. Additionally, more clear guidelines for clustering research areas and defining core and connecting keywords should be provided. Finally, intensive experiments not only on Korean research papers but also on international papers should be performed in further studies.

Big Five Personality in Discriminating the Groups by the Level of Social Sims (심리학적 도구 '5요인 성격 특성'에 의한 소셜 게임 연구: <심즈 소셜> 게임의 분석사례를 중심으로)

  • Lee, Dong-Yeop
    • Cartoon and Animation Studies
    • /
    • s.29
    • /
    • pp.129-149
    • /
    • 2012
  • The purpose of this study was to investigate the clustering and Big Five Personality domains in discriminating groups by level of school-related adjustment, as experienced by Social Sims game users. Social Games are based on web that has simple rules to play in fictional time and space background. This paper is to analyze the relationships between social networks and user behaviors through the social games . In general, characteristics of social games are simple, fun and easy to play, popular to the public, and based on personal connections in reality. These features of social games make themselves different from video games with one player or MMORPG with many unspecific players. Especially Social Game show a noticeable characteristic related to social learning. The object of this research is to provide a possibility that game that its social perspective can be strengthened in social game environment and analyze whether it actually influences on problem solving of real life problems, therefore suggesting its direction of alternative play means and positive simulation game. Data was collected by administering 4 questionnaires (the short version of BFI, Satisfaction with life, Career Decision-.Making Self-.Efficacy, Depression) to the participants who were 20 people in Seoul and Daejeon. For the purposes of the data analysis, both Stepwise Discriminant analysis and Cluster analysis was employed. Neuroticism, Openness, Conscientiousness within the Big Five Personality domains were seen to be significant variables when it came to discriminating the groups. These findings indicated that the short version of the BFI may be useful in understanding for game user behaviors When it comes to cultural research, digital game takes up a significant role. We can see that from the fact that game, which has only been considered as a leisure activity or commercial means, is being actively research for its methodological, social role and function. Among digital game's several meanings, one of the most noticeable ones is the research on its critical, social participating function. According to Jame Paul gee, the most important merit of game is 'projected identity'. This means that experiences from various perspectives is possible.[1] In his recent autobiography , he described gamer as an active problem solver. In addition, Gonzalo Francesca also suggested an alternative game developing method through 'game that conveys critical messages by strengthening critical reasons'. [2] They all provided evidences showing game can be a strong academic tool. Not only does a genre called social game exist in the field of media and Social Network Game, but there are also some efforts to positively evaluate its value Through these kinds of researches, we can study how game can give positive influence along with the change in its general perception, which would eventually lead to spreading healthy game culture and enabling fresh life experience. This would better bring out the educative side of the game and become a social communicative tool. The object of this game is to provide a possibility that the social aspect can be strengthened within the game environment and analyze whether it actually influences the problem solving of real life problems. Therefore suggesting it's direction of alternative play means positive game simulation.

A Study on the Consumer's Service Quality Perception Based on the Types of Life-style (소비자의 라이프스타일에 따른 서비스품질 지각 차이에 관한 연구)

  • Park, Yoon-Seo;Lee, Seung-In;Choi, In
    • Journal of Global Scholars of Marketing Science
    • /
    • v.19 no.2
    • /
    • pp.53-67
    • /
    • 2009
  • For the last decades, service quality has been studied as one of the most important tools for a service company to compete with the other companies. Based on these past researches, it has been agreed that the service quality is a basic and powerful tool to create the competitive advantage. Due to similar reason, many service marketing practitioners have been also focused on the service quality to retain the existing consumers and collect the new consumers. However, service quality is subjectively perceived by individual consumers. Consumer evaluation of service quality can be different from each other. Especially consumers with one life-style may evaluate the service quality differently from the consumers with the other life-styles. Therefore we need to know whether there are differences in service quality perception on the categories of life-style. Life-style refers to a distinctive mode of living in its aggregate and broadest sense. It embodies the patterns that were developed and emerged from the dynamics of living in a society. Since the concept of life-style and its relationship to marketing was introduced in 1963 by William Lazer, methods of measuring the life-style and their application have been developed. Life-style has been usually used to segment the marketplace because it offers marketers a unique and important view of the market. When Life-style is combined with clustering methods, life-style segmentation can generate identifiable whole persons rather than isolated fragment. Life-style segmentation begins with people instead of products and classifies them into different life-style types, each characterized by a unique style of living based on a wide range of activities, interests, and opinions(Plummer, 1974). In this study we applies the life-style segmentation based on the AIO(Activities, Interests, and Opinions) to the consumers of the large discount stores. In Korea, the large discount store market has entered into maturity stage so that the market differentiation strategy is becoming a more critical issue to the marketing practitioners. One of the most important tools to differentiate from the competitors in large discount store market is continuously to provide service of better quality than competitors. This study tries to find answers about the following questions: 1) How can we categorize the consumer life-styles in the large discount store? 2) What are the characteristics of the categorized groups? 3) Are there any differences in service quality perception among the consumers with different life-styles 4) Are there any differences in consumer behavior among them in the large discount store? For the purpose, we collected survey data from consumers and analyzed the data with the SPSS package where we had $X^2$-test, factor analysis, ANOVA, MANOVA, and cluster analysis. The survey was made during one month in the April of 2008. Among the collected 306 copies of questionnaires, 281 copies were chosen as the effective samples for empirical analysis except 25 copies with wrong responses. To identify the life-style patterns, we used the measures employed by Kim and Kwon(1999), where 44 items on a seven-point scale were used to measure factors of the life-style patterns. The Principal Component Method was used for factor extraction, and the VARIMAX orthogonal factor rotation was employed. The 7 items showing low factor loading were eliminated. The results of the factor analysis suggested that nine factors of the life-style patterns were identified as follows: 1) the equality-of-sexes and pursuit-of-independence tendency 2) self-management tendency 3) sociable tendency 4) self-display tendency 5) degree of a dilettante life 6) pursuit-of-information tendency 7) bargain hunter tendency 8) TV preference tendency 9) pursuit-of-leisure tendency. Next, after the K-means cluster analysis was performed with nine factors of the life-style patterns, the life-styles of the respondents were classified into four groups which are named as the 'progressive practicality-oriented group', 'positive success-oriented group', 'sociable ostentation-oriented group', 'stable conservation-oriented group'. The analysis results for usage behavior between the market segments showed statistically significant differences in the frequency of usage, duration time in the store, consumer satisfaction, and loyalty. Also, we tried to investigate whether the large discount store consumers differently perceive the quality of service based upon the types of life-style. To measure the service quality of large discount store, we adapted several measurement models measuring the service quality such as SERVPERF, BCP, R-SERVPERF, R-BCP. MANOVA and One-Way ANOVA were performed to confirm the difference in service quality perception based on the market segments. The results have also shown significant differences between life-style types in service quality perception. These findings show that the large discount store marketers should consider consumer life-style as one of the most important market segments for marketing and understand the difference in service quality perception between life-style types. Our findings give important implications to marketers of large discount stores as well as life-style researchers. First, this study showed there were significant differences in consumer's service quality perception and usage behavior between the types of life-style. It provides evidence that the life-style approach can be a important basis in segmenting the large discount store market and will make consumers perceive the service quality high. Second, most previous researches on service quality have been in aggregate level. However, our results imply that the future research on service quality have to focus on segment level.

  • PDF