• Title/Summary/Keyword: rank cluster analysis

Search Result 42, Processing Time 0.03 seconds

Recommendation of Personalized Surveillance Interval of Colonoscopy via Survival Analysis (생존분석을 이용한 맞춤형 대장내시경 검진주기 추천)

  • Gu, Jayeon;Kim, Eun Sun;Kim, Seoung Bum
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.42 no.2
    • /
    • pp.129-137
    • /
    • 2016
  • A colonoscopy is important because it detects the presence of polyps in the colon that can lead to colon cancer. How often one needs to repeat a colonoscopy may depend on various factors. The main purpose of this study is to determine personalized surveillance interval of colonoscopy based on characteristics of patients including their clinical information. The clustering analysis using a partitioning around medoids algorithm was conducted on 625 patients who had a medical examination at Korea University Anam Hospital and found several subgroups of patients. For each cluster, we then performed survival analysis that provides the probability of having polyps according to the number of days until next visit. The results of survival analysis indicated that different survival distributions exist among different patients' groups. We believe that the procedure proposed in this study can provide the patients with personalized medical information about how often they need to repeat a colonoscopy.

Assessing Spatial Disparities and Spatial-Temporal Dynamic of Urban Green Spaces: a Case Study of City of Chicago

  • Yang, Byungyun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.38 no.5
    • /
    • pp.487-496
    • /
    • 2020
  • This study introduces how GISs (Geographic Information Systems) are used to assess spatial disparities in urban green spaces in the Chicago. Green spaces provide us with a variety of benefits, namely environmental, economic, and physical benefits. This study seeks to explore socioeconomic relationships between green spaces and their surrounding communities and to evaluate spatial disparities from a variety of perspectives, such as health-related, socioeconomic, and physical environment factors. To achieve this goal, this study used spatial statistics, such as optimized hotspot analysis, network analysis, and space-time cluster analysis, which enable conclusions to be drawn from the geographic data. In particular, 12 variables within the three factors are used to assess spatial disparities in the benefits of the use of green spaces. Finally, the variables are standardized to rank the community areas and identify where the most vulnerable community areas or parks are. To evaluate the benefits given to the community areas, this study used the z- and composite scores, which are compared in the three different combinations. After identifying the most vulnerable community area, crime data is used to spatially understand when and where crimes occur near the parks selected. This work contributes to the work of urban planners who need to spatially evaluate community areas in considering the benefits of the uses of green spaces.

Trends of Annual and Monthly FAO Penman-Monteith Reference Evapotranspiration (연별 및 월별 FAO Penman-Monteith 기준증발산 추세 분석)

  • Rim, Chang-Soo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.1B
    • /
    • pp.65-77
    • /
    • 2008
  • The effects of climatic changes owing to urbanization, geographical and topographical conditions on annual and monthly FAO Penman-Monteith (FAO P-M) reference evapotranspiration, and energy and aerodynamic terms of FAO P-M reference evapotranspiration were studied. In this study, 21 climatological stations were selected. The statistical methods applied for trend analysis are Spearman rank test, Sen's test, linear regression analysis and analysis of actual variation ratio. Furthermore, the cluster analysis was applied to cluster 21 study stations by considering the geographical and topographical characteristics of study area. The study results indicate that urbanization affects the trend and amount of FAO P-M reference evapotranspiration, energy term and aerodynamic term; however, the result of Sen's test indicates that urbanization does not significantly affect the magnitude of trend (Sen's slope). The energy term increased at study stations located in coastal area; however, decreased at study stations located in inland area. The topographical slope of study area did not significantly influence on the trend of energy term. The aerodynamic term increased in both coastal area and inland area, indicating much significantly increasing trend in inland area, and it was also affected by the topographical slope of the study area.

Exploring the Feasibility of 16S rRNA Short Amplicon Sequencing-Based Microbiota Analysis for Microbiological Safety Assessment of Raw Oyster

  • Jaeeun Kim;Byoung Sik Kim
    • Journal of Microbiology and Biotechnology
    • /
    • v.33 no.9
    • /
    • pp.1162-1169
    • /
    • 2023
  • 16S rRNA short amplicon sequencing-based microbiota profiling has been thought of and suggested as a feasible method to assess food safety. However, even if a comprehensive microbial information can be obtained by microbiota profiling, it would not be necessarily sufficient for all circumstances. To prove this, the feasibility of the most widely used V3-V4 amplicon sequencing method for food safety assessment was examined here. We designed a pathogen (Vibrio parahaemolyticus) contamination and/or V. parahaemolyticus-specific phage treatment model of raw oysters under improper storage temperature and monitored their microbial structure changes. The samples stored at refrigerator temperature (negative control, NC) and those that were stored at room temperature without any treatment (no treatment, NT) were included as control groups. The profiling results revealed that no statistical difference exists between the NT group and the pathogen spiked- and/or phage treated-groups even when the bacterial composition was compared at the possible lowest-rank taxa, family/genus level. In the beta-diversity analysis, all the samples except the NC group formed one distinct cluster. Notably, the samples with pathogen and/or phage addition did not form each cluster even though the enumerated number of V. parahaemolyticus in those samples were extremely different. These discrepant results indicate that the feasibility of 16S rRNA short amplicon sequencing should not be overgeneralized in microbiological safety assessment of food samples, such as raw oyster.

Cure Rate Model with Clustered Interval Censored Data (군집화된 구간 중도절단자료에 대한 치유율 모형의 적용)

  • Kim, Yang-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.1
    • /
    • pp.21-30
    • /
    • 2014
  • Ordinary survival analysis cannot be applied when a significant fraction of patients may be cured. A cure rate model is the combination of cure fraction and survival model and can be applied to several types of cancer. In this article, the cure rate model is considered in the interval censored data with a cluster effect. A shared frailty model is introduced to characterize the cluster effect and an EM algorithm is used to estimate parameters. A simulation study is done to evaluate the performance of estimates. The proposed approach is applied to the smoking cessation study in which the event of interest is a smoking relapse. Several covariates (including intensive care) are evaluated to be effective for both the occurrence of relapse and the smoke quitting duration.

The Study of Soil Chemical Properties and Soil Bacterial Communities on the Cultivation Systems of Cnidium officinale Makino (일천궁의 연작재배에 따른 토양 이화학성 및 토양세균군집 연구)

  • Kim, Kiyoon;Han, Kyeung Min;Kim, Hyun-Jun;Jeon, Kwon Seok;Kim, Chung Woo;Jung, Chung Ryul
    • Korean Journal of Environmental Agriculture
    • /
    • v.39 no.1
    • /
    • pp.1-9
    • /
    • 2020
  • BACKGROUND: The aim of this study was to investigate the soil chemical properties and soil bacterial community of the cropping system for Cnidium officinale Makino. METHODS AND RESULTS: The bacterial community was analyzed for the relative abundance and principal coordinated analysis (PCoA analysis) by using by Illumina Miseq sequencing. The correlation analysis between soil chemical properties and soil bacterial community were analyzed by Spearman's rank correlation and DISTLM analysis. Soil bacterial community (phylum and class) showed two distinct clusters consisting of cluster 1 (first cropping) and cluster 2 (continuous cropping) from 2 different cultivation methods of Cnidium officinale Makino. PCoA and DISTLM analyses showed that soil pH and Ca significantly affected soil bacterial community in cultivation area of Cnidium officinale Makino. In addition, Spearman's rank correlation showed significant correlation between relative abundance (Acidobacteria and Actinobacteria) and soil factors (soil pH and Ca). CONCLUSION: The results of this study were considered to be important for determining the correlation between soil properties and soil bacterial community of the cropping method for Cnidium officinale Makino. Furthermore, the results will be helpful to investigate the cause of continuous cropping injury of the Cnidium officinale Makino by examining the changes of soil properties and soil bacterial communities.

A Preliminary Study on the Co-author Network Analysis of Korean Library & Information Science Research Community (공저 네트워크 분석에 관한 기초연구 - 문헌정보학 분야 4개 학술지를 중심으로 -)

  • Lee, Soo-Sang
    • Journal of Korean Library and Information Science Society
    • /
    • v.41 no.2
    • /
    • pp.297-315
    • /
    • 2010
  • This study investigates the various statistical data and measures of coauthorship network in the Korean LIS Research Community such as patterns of coauthorship, structural properties, types of cluster, centrality & impact analysis. This issues are mostly addressed through a Social Network Analysis of articles published from 2000 to 2009(10 years) in Korean Library & Information Science major four Journals. The coauthorship network was constructed and various measures of four centralities, PageRank, Effect size were calculated. The results show three implications. 1) There presents a phenomenon of Pareto's law in the articles publishing counts. 2) The top authors based on publishing counts prefer co-work publishing than solo-publishing. 3) The counts of article publishing are significantly correlated with five measures of network and not correlated with the case of power centrality.

  • PDF

Analysis of a Large-scale Protein Structural Interactome: Ageing Protein structures and the most important protein domain

  • Bolser, Dan;Dafas, Panos;Harrington, Richard;Schroeder, Michael;Park, Jong
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2003.10a
    • /
    • pp.26-51
    • /
    • 2003
  • Large scale protein interaction maps provide a new, global perspective with which to analyse protein function. PSIMAP, the Protein Structural Interactome Map, is a database of all the structurally observed interactions between superfamilies of protein domains with known three-dimensional structure in thePDB. PSIMAP incorporates both functional and evolutionary information into a single network. It makes it possible to age protein domains in terms of taxonomic diversity, interaction and function. One consequence of it is to predict the most important protein domain structure in evolution. We present a global analysis of PSIMAP using several distinct network measures relating to centrality, interactivity, fault-tolerance, and taxonomic diversity. We found the following results: ${\bullet}$ Centrality: we show that the center and barycenter of PSIMAP do not coincide, and that the superfamilies forming the barycenter relate to very general functions, while those constituting the center relate to enzymatic activity. ${\bullet}$ Interactivity: we identify the P-loop and immunoglobulin superfamilies as the most highly interactive. We successfully use connectivity and cluster index, which characterise the connectivity of a superfamily's neighbourhood, to discover superfamilies of complex I and II. This is particularly significant as the structure of complex I is not yet solved. ${\bullet}$ Taxonomic diversity: we found that highly interactive superfamilies are in general taxonomically very diverse and are thus amongst the oldest. This led to the prediction of the oldest and most important protein domain in evolution of lift. ${\bullet}$ Fault-tolerance: we found that the network is very robust as for the majority of superfamilies removal from the network will not break up the network. Overall, we can single out the P-loop containing nucleotide triphosphate hydrolases superfamily as it is the most highly connected and has the highest taxonomic diversity. In addition, this superfamily has the highest interaction rank, is the barycenter of the network (it has the shortest average path to every other superfamily in the network), and is an articulation vertex, whose removal will disconnect the network. More generally, we conclude that the graph-theoretic and taxonomic analysis of PSIMAP is an important step towards the understanding of protein function and could be an important tool for tracing the evolution of life at the molecular level.

  • PDF

Educational Needs and Self-Assessment for Competency of Newly Employed Therapists Using Sensory Integration Intervention (감각통합 중재를 사용하는 초임치료사의 교육요구도 및 역량에 대한 자기평가)

  • Lee, Ji-Hyun;Jung, Hyerim
    • The Journal of Korean Academy of Sensory Integration
    • /
    • v.20 no.2
    • /
    • pp.1-10
    • /
    • 2022
  • Objective : This study aimed to investigate the importance, performance, and educational needs of sensory integration intervention competency for newly employed therapists who use sensory integration intervention. Methods : The general characteristics, importance, performance, and educational needs of sensory integration intervention competency were investigated for therapists with less than three years of experience in sensory integration intervention. Educational needs and rankings were identified through Borich needs analysis. Results : The competency cluster that newly employed therapists perceived as the most important but with the lowest performance level was "Expertise," and the demand of the "Expertise" competency cluster was also the highest in the analysis of educational needs. The difference in importance and performance in all sub-competencies was statistically significant. In the Borich needs analysis, the rank of educational needs was derived as follows: "Evaluation skill" (5.56), "Analysis skill" (5.50), and "Overall knowledge of occupational therapy" (5.47). Conclusion : It was found that the newly employed therapist using sensory integration intervention recognized professional competency as the most important, while also recognizing that their professional competency was low. Accordingly, education to enhance professional competency was most needed. This study presented basic data for the direction of education to strengthen competency in consideration of the educational needs of newly employed therapists.

The Evaluation of Web Contents by User 'Likes' Count: An Usefulness of hT-index for Topic Preference Measurement

  • Song, Yeseul;Park, Ji-Hong;Shim, Jiyoung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.2
    • /
    • pp.27-49
    • /
    • 2015
  • The purpose of this study is to suggest an appropriate index for evaluating preferences of Web contents by examining the h-index and its variants. It focuses on how successfully each index represents relative user preference towards topical subjects. Based on data obtained from a popular IT blog (engadget.com), subject values of the h-index and its variants were calculated using 53 subject categories, article counts and the 'Likes' counts aggregated in each category. These values were compared through critical analysis of the indices and Spearman rank correlation analysis. A PFNet (Pathfinder Network) of subjects weighted by $h_T$ values was drawn and cluster analysis was conducted. Based on the four criteria suggested for the evaluation of Web contents, we concluded that the $h_T$-index is a relatively appropriate tool for the Web contents preference evaluation. The $h_T$-index was applied to visually represent the relative weight (topic preference by user 'Likes' count) for each subject category of the real online contents after suggesting the relative appropriateness of the $h_T$-index. Applying scientometric indicators to Web information could provide new insights into, and potential methods for, Web contents evaluation. In addition, information on the focus of users' attention would help online informants to plan more effective content strategies. The study tries to expand the application area of the h-type indices to non-academic online environments. The research procedure enables examination of the appropriateness of the index and highlights considerations for applying the indicators to Web contents.