• Title/Summary/Keyword: and clustering

Search Result 5,619, Processing Time 0.038 seconds

Characteristics and Survival of Genus Vibrio Isolated in the Intertidal Zone of the Yellow Sea near Kunsan (군산인근해역에서 분리동정된 Vibrio 속의 특성과 해수에서의 생존)

  • 왕혜영;이건형
    • Korean Journal of Environmental Biology
    • /
    • v.17 no.4
    • /
    • pp.439-448
    • /
    • 1999
  • To investigate the population dynamics and survival of Genus Vibrio, population densities of aerobic saprophytic bacteria and Vibrio groups were measured 4 times in the intertidal waters of the Yellow Sea near Kunsan from November, 1997 to June, 1998. The distribution of heterotrophic bacteria during the survey periods by plate count and direct count method ranged from 1.2$\pm$0.6$\times$10$^3$~2.0$\pm$1.5$\times$10$^4$CFU ml­$^1$and from 6.0$\pm$4.0$\times$10$^{5}$ ~1.9$\pm$1.5$\times$10$^{7}$ cells ml­$^1$, respectively. Vibrio groups were distributed in the range of 1$\times$10 and 6$\pm$2.2$\times$10$^2$CFU ml­$^1$. The proportion of Vibrio groups to total heterotrophic bacteria was between 0.1 and 6% during the survey periods. A total of 51 isolates was obtained from TCBS agar plates and identified to species level by Biolog Identification System$^{TM}$. As a result, dominant genera were V, mediterranei, V aitguillarum, tr metschnikovii, and V. parahaemolyticus, and isolates were clustered into 26 groups based on the relatedness of average linkage clustering method at 70% level. As for the susceptibility of 51 isolates to 7 kinds of antibacterial agents (gentamicin, ampicillin, chlorarnphenicol, streptomycin, kanamycin, tetracycline, carbenicillin), 96% of isolates showed high resistance to more than one antibiotics and 65% of isolates contained a plasmid, of which size was observed greater than 12 kb, The number of cells of 3 tested strains (V. anguillarum, V. vulnificus, and V. metschnikovii) in filtered aged seawater decreased by approximately 1 to 5 orders of magnitude during 30-d incubation. In most cases, the numbers of cells decreased rapidly until day 3, then decreased slowly by day 30. The number of cells incubated at 15$^{\circ}C$ showed higher survival than those at 4$^{\circ}C$ and $25^{\circ}C$. These results may be considered for the basic supporting data in the risk assessment of vibriosis in summer.r.

  • PDF

Usefulness of Data Mining in Criminal Investigation (데이터 마이닝의 범죄수사 적용 가능성)

  • Kim, Joon-Woo;Sohn, Joong-Kweon;Lee, Sang-Han
    • Journal of forensic and investigative science
    • /
    • v.1 no.2
    • /
    • pp.5-19
    • /
    • 2006
  • Data mining is an information extraction activity to discover hidden facts contained in databases. Using a combination of machine learning, statistical analysis, modeling techniques and database technology, data mining finds patterns and subtle relationships in data and infers rules that allow the prediction of future results. Typical applications include market segmentation, customer profiling, fraud detection, evaluation of retail promotions, and credit risk analysis. Law enforcement agencies deal with mass data to investigate the crime and its amount is increasing due to the development of processing the data by using computer. Now new challenge to discover knowledge in that data is confronted to us. It can be applied in criminal investigation to find offenders by analysis of complex and relational data structures and free texts using their criminal records or statement texts. This study was aimed to evaluate possibile application of data mining and its limitation in practical criminal investigation. Clustering of the criminal cases will be possible in habitual crimes such as fraud and burglary when using data mining to identify the crime pattern. Neural network modelling, one of tools in data mining, can be applied to differentiating suspect's photograph or handwriting with that of convict or criminal profiling. A case study of in practical insurance fraud showed that data mining was useful in organized crimes such as gang, terrorism and money laundering. But the products of data mining in criminal investigation should be cautious for evaluating because data mining just offer a clue instead of conclusion. The legal regulation is needed to control the abuse of law enforcement agencies and to protect personal privacy or human rights.

  • PDF

Difference in Electrophoretic Phenotypes of rice Cultivars Selected to Bensulfuron (Bensulfuron에 대(對)한 내성(耐性) 및 감수성(感受性) 수도품종(水稻品種)의 전기영동(電氣泳動) 표현형(表現型) 차이(差異))

  • Kuk, Y.I.;Guh, J.O.;Kim, Y.J.;Lee, D.J.
    • Korean Journal of Weed Science
    • /
    • v.8 no.3
    • /
    • pp.250-257
    • /
    • 1988
  • The study was intended to know any relations between the rice tolerance to bensulfuron and varietal speciation in seed protein composition or any enzymatical allelies with or without chemical treatment. Rice varieties used were UCP-28, Chinsurah Boro II, Fukunohama, Fadehpur-2, IR 14252-13-2-2-5 as the tolerant group, and HP 93(3) FA, HP94(9) FA, Padilabou Alumbis, KH-17854, and IR 1846-2841-1 as the susceptible, respectively. Electrophoretic methods used were SDS-PAGE for seed protein, 7% PAGE for isozymes (acid phosphatase, peroxidase, malate dehydrogenase, and esterase from rice seedling) and variation in isoenzyme profiles (malate dehydrogenase, peroxidase, and esterase) as affected by different concentrations of bensulfuron(0, $10^{-6}$, $10^{-5}$ and $3{\times}10^{-5}M$) was also studied. The results are summarized as follows. -Among 16 bands separated in seed proteins, two different rice groups selected in terms of tolerance to bensulfuron were clustered in dissimilarity, which was based on relatively larger area in whole peaks and higher activities in N, O, P bands for the tolerant group. -Among isozymes obtained from rice seedlings without chemical treatments, the following specificities were obtained. The tolerant varieties had the relatively higher activity in D band out of 4 peroxidase bands. Malate dehydrogenase was separated into 3 bands and only tolerant varieties had A band and higher activities in Band C bands. Esterase was separated into 3-4 bands with higher activities in A and B bands for tolerant varieties. There were one major band accompanied by 2-3 minor bands for acid phosphatase in which only tolerant varieties had the B band. -The effect of Bensulfuron concentration on the isozyme activities showed that the activity of C band in peroxidase was not present in tolerant varieties which was contrary to the increased activities in susceptible varieties. However, D band was gradually disappeared only in susceptible varieties as the concentration of bensulfuron was increased. For malate dehydrogenase in the susceptible varieties, major bands D, E and F kept consistantly higher activities while minor bands A, B and C disappeared sensitively. Among 5 bands of esterase separated, D band was present only in the tolerant varieties while E band only in the susceptible. The activities in A, C, E bands were sharply decreased in the susceptible varieties as the concentration of bensulfuron was increased.

  • PDF

Health Lifestyle Patterns of Seoul Adults (서울 일부지역 성인의 건강생활양식 유형연구)

  • Lee, Hwa-Kyung;Lee, In-Young;Kim, Eun-Mi;Lee, Hun-Jae;Bae, Sang-Soo
    • Journal of agricultural medicine and community health
    • /
    • v.31 no.2
    • /
    • pp.145-156
    • /
    • 2006
  • Objectives: Health behaviors are related to each other, or they may be essentially dependent upon each other. Hence the overall health behaviors of a given population could be better described in terms of health lifestyle patterns. This paper tried to classify such patterns in a sample population and suggest the socioeconomic and demographic characteristics of each groups. Methods: A sample population comprised of 2,775 adults who reported their health behaviors in a public health survey were classified according to their smoking, drinking, diet, and exercise related pattern of behaviors. Clustering analysis was used to classify them. Results: Six health lifestyle patterns were identified. Individuals in the passive lifestyle cluster (48.3%) had no active health promoting activities, but did avoid risk taking health behaviors. 24.8% of the sample (Health promoting lifestyle) had an overall healthy lifestyle. 13.5% of the sample were in the smoking cluster, and 8.4% were in the alcohol drinking cluster. The hedonic lifestyle (4.5%) was characterized by heavy smoking, alcohol drinking and poor diet and exercise. 0.7% of the sample (Smoking-Drinking lifestyle) had heavy smoking and drinking, but good diet and exercise. Each group could be characterized by sex, age, and income. Conclusions: A population sample of Seoul adults were successfully clustered into six health lifestyles. The socioeconomic and demographic characteristics were suggested for the characterization of the each health lifestyle groups. We can approach to a certain target population with specific strategy.

  • PDF

The Need for Paradigm Shift in Semantic Similarity and Semantic Relatedness : From Cognitive Semantics Perspective (의미간의 유사도 연구의 패러다임 변화의 필요성-인지 의미론적 관점에서의 고찰)

  • Choi, Youngseok;Park, Jinsoo
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.1
    • /
    • pp.111-123
    • /
    • 2013
  • Semantic similarity/relatedness measure between two concepts plays an important role in research on system integration and database integration. Moreover, current research on keyword recommendation or tag clustering strongly depends on this kind of semantic measure. For this reason, many researchers in various fields including computer science and computational linguistics have tried to improve methods to calculating semantic similarity/relatedness measure. This study of similarity between concepts is meant to discover how a computational process can model the action of a human to determine the relationship between two concepts. Most research on calculating semantic similarity usually uses ready-made reference knowledge such as semantic network and dictionary to measure concept similarity. The topological method is used to calculated relatedness or similarity between concepts based on various forms of a semantic network including a hierarchical taxonomy. This approach assumes that the semantic network reflects the human knowledge well. The nodes in a network represent concepts, and way to measure the conceptual similarity between two nodes are also regarded as ways to determine the conceptual similarity of two words(i.e,. two nodes in a network). Topological method can be categorized as node-based or edge-based, which are also called the information content approach and the conceptual distance approach, respectively. The node-based approach is used to calculate similarity between concepts based on how much information the two concepts share in terms of a semantic network or taxonomy while edge-based approach estimates the distance between the nodes that correspond to the concepts being compared. Both of two approaches have assumed that the semantic network is static. That means topological approach has not considered the change of semantic relation between concepts in semantic network. However, as information communication technologies make advantage in sharing knowledge among people, semantic relation between concepts in semantic network may change. To explain the change in semantic relation, we adopt the cognitive semantics. The basic assumption of cognitive semantics is that humans judge the semantic relation based on their cognition and understanding of concepts. This cognition and understanding is called 'World Knowledge.' World knowledge can be categorized as personal knowledge and cultural knowledge. Personal knowledge means the knowledge from personal experience. Everyone can have different Personal Knowledge of same concept. Cultural Knowledge is the knowledge shared by people who are living in the same culture or using the same language. People in the same culture have common understanding of specific concepts. Cultural knowledge can be the starting point of discussion about the change of semantic relation. If the culture shared by people changes for some reasons, the human's cultural knowledge may also change. Today's society and culture are changing at a past face, and the change of cultural knowledge is not negligible issues in the research on semantic relationship between concepts. In this paper, we propose the future directions of research on semantic similarity. In other words, we discuss that how the research on semantic similarity can reflect the change of semantic relation caused by the change of cultural knowledge. We suggest three direction of future research on semantic similarity. First, the research should include the versioning and update methodology for semantic network. Second, semantic network which is dynamically generated can be used for the calculation of semantic similarity between concepts. If the researcher can develop the methodology to extract the semantic network from given knowledge base in real time, this approach can solve many problems related to the change of semantic relation. Third, the statistical approach based on corpus analysis can be an alternative for the method using semantic network. We believe that these proposed research direction can be the milestone of the research on semantic relation.

Internal Structure and Movement History of the Keumwang Fault (금왕단층의 내부구조 및 단층발달사)

  • Kim, Man-Jae;Lee, Hee-Kwon
    • The Journal of the Petrological Society of Korea
    • /
    • v.25 no.3
    • /
    • pp.211-230
    • /
    • 2016
  • Detailed mapping along the Keumwang fault reveals a complex history of multiple brittle reactivations following late Jurassic and early Cretaceous ductile shearing. The fault core consists of a 10~50 m thick fault gouge layer bounded by a 30~100 m thick damaged zone. The Pre-cambrian gneiss and Jurassic granite underwent at least six distinct stages of fault movements based on deformation environment, time and mechanism. Each stage characterized by fault kinematics and dynamics at different deformation environment. Stage 1 generated mylonite series along the Keumwang shear zone by sinistral ductile shearing during late Jurassic and early Cretaceous. Stage 2 was a mostly brittle event generating cataclasite series superimposed on the mylonite series of the Keumwang shear zone. The roundness of pophyroclastes and the amount of matrix increase from host rocks to ultracataclasite indicating stronger cataclastic flow toward the fault core. At stage 3, fault gouge layer superimposed on the cataclasite generated during stage 2 and the sedimentary basins (Umsung and Pungam) formed along the fault by sinistral strike-slip movement. Fragments of older cataclasite suspended in the fault gouge suggest extensive reworking of fault rocks at brittle deformation environments. At stage 4, systematic en-echelon folds, joints and faults were formed in the sedimentary basins by sinistral strike-slip reactivation of the Keumwang fault. Most of the shearing is accommodated by slip along foliations and on discrete shear surfaces, while shear deformation tends to be relatively uniformly distributed within the fault damage zone developed in the mudrocks in the sedimentary basins. Fine-grained andesitic rocks intruded during stage 4. Stage 5 dextral strike-slip activity produced shear planes and bands in the andesitic rocks. ESR(Electron Spin Resonance) dates of fault gouge show temporal clustering within active period and migrating along the strike of the Keumwang fault during the stage 6 at the Quaternary period.

Digital Archives of Cultural Archetype Contents: Its Problems and Direction (디지털 아카이브즈의 문제점과 방향 - 문화원형 콘텐츠를 중심으로 -)

  • Hahm, Han-Hee;Park, Soon-Cheol
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.17 no.2
    • /
    • pp.23-42
    • /
    • 2006
  • This is a study of the digital archives of Culturecontent.com where 'Cultural Archetype Contents' are currently in service. One of the major purposes of our study is to point out problems in the current system and eventually propose improvements to the digital archives. The government launched a four-year project for developing the cultural archetype content sources and establishing its related business with the hope of enhancing the nation's competitiveness. More specifically, the project focuses on the production of source materials of cultural archetype contents in the subjects of Korea's history. tradition, everyday life. arts and general geographical books. In addition, through this project, the government also intends to establish a proper distribution system of digitalized culture contents and to control copyright issues. This paper analyzes the digital archives system that stores the culture content data that have been produced from 2002 to 2005 and evaluates the current system's weaknesses and strengths. The summary of our findings is as follows. First. the digital archives system does not contain a semantic search engine and therefore its full function is 1agged. Second, similar data is not classified into the same categories but into the different ones, thereby confusing and inconveniencing users. Users who want to find source materials could be disappointed by the current distributive system. Our paper suggests a better system of digital archives with text mining technology which consists of five significant intelligent process-keyword searches, summarization, clustering, classification and topic tracking. Our paper endeavors to develop the best technical environment for preserving and using culture contents data. With the new digitalized upgraded settings, users of culture contents data will discover a world of new knowledge. The technology we introduce in this paper will lead to the highest achievable digital intelligence through a new framework.

A Study on Analysis of consumer perception of YouTube advertising using text mining (텍스트 마이닝을 활용한 Youtube 광고에 대한 소비자 인식 분석)

  • Eum, Seong-Won
    • Management & Information Systems Review
    • /
    • v.39 no.2
    • /
    • pp.181-193
    • /
    • 2020
  • This study is a study that analyzes consumer perception by utilizing text mining, which is a recent issue. we analyzed the consumer's perception of Samsung Galaxy by analyzing consumer reviews of Samsung Galaxy YouTube ads. for analysis, 1,819 consumer reviews of YouTube ads were extracted. through this data pre-processing, keywords for advertisements were classified and extracted into nouns, adjectives, and adverbs. after that, frequency analysis and emotional analysis were performed. Finally, clustering was performed through CONCOR. the summary of this study is as follows. the first most frequently mentioned words were Galaxy Note (n = 217), Good (n = 135), Pen (n = 40), and Function (n = 29). it can be judged through the advertisement that consumers "Galaxy Note", "Good", "Pen", and "Features" have good functional aspects for Samsung mobile phone products and positively recognize the Note Pen. in addition, the recognition of "Samsung Pay", "Innovation", "Design", and "iPhone" shows that Samsung's mobile phone is highly regarded for its innovative design and functional aspects of Samsung Pay. second, it is the result of sentiment analysis on YouTube advertising. As a result of emotional analysis, the ratio of emotional intensity was positive (75.95%) and higher than negative (24.05%). this means that consumers are positively aware of Samsung Galaxy mobile phones. As a result of the emotional keyword analysis, positive keywords were "good", "good", "innovative", "highest", "fast", "pretty", etc., negative keywords were "frightening", "I want to cry", "discomfort", "sorry", "no", etc. were extracted. the implication of this study is that most of the studies by quantitative analysis methods were considered when looking at the consumer perception study of existing advertisements. In this study, we deviated from quantitative research methods for advertising and attempted to analyze consumer perception through qualitative research. this is expected to have a great influence on future research, and I am sure that it will be a starting point for consumer awareness research through qualitative research.

Development of Multiplex Microsatellite Marker Set for Identification of Korean Potato Cultivars (국내 감자 품종 판별을 위한 다중 초위성체 마커 세트 개발)

  • Cho, Kwang-Soo;Won, Hong-Sik;Jeong, Hee-Jin;Cho, Ji-Hong;Park, Young-Eun;Hong, Su-Young
    • Horticultural Science & Technology
    • /
    • v.29 no.4
    • /
    • pp.366-373
    • /
    • 2011
  • To analyze the genetic relationships among Korean potato cultivars and to develop cultivar identification method using DNA markers, we carried out genotyping using simple sequence repeats (SSR) analysis and developed multiplex-SSR set. Initially, we designed 92 SSR primer combinations reported previously and applied them to twenty four Korean potato cultivars. Among the 92 SSR markers, we selected 14 SSR markers based on polymorphism information contents (PIC) values. PIC values of the selected 14 markers ranged from 0.48 to 0.89 with an average of 0.76. PIC value of PSSR-29 was the lowest with 0.48 and PSSR-191 was the highest with 0.89. UPGMA clustering analysis based on genetic distances using 14 SSR markers classified 21 potato cultivars into 2 clusters. Cluster I and II included 16 and 5 cultivars, respectively. And 3 cultivars were not classified into major cluster group I and II. These 14 SSR markers generated a total of 121 alleles and the average number of alleles per SSR marker was 10.8 with a range from 3 to 34. Among the selected markers, we combined three SSR markers, PSSR-17, PSSR-24 and PSSR-24, as a multiplex-SSR set. This multiplex-SSR set used in the study can distinguish all the cultivars with one time PCR and PAGE (Polyacrylamide gel electrophoresis) analysis and PIC value of multiplex-SSR set was 0.95.

Analysis of Grain Quality Properties in Korea-bred Japonica Rice Cultivars (우리나라 자포니카 벼 품종의 식미관련 미질특성 분석)

  • Choi, Yong-Hwan;Kim, Kwang-Ho;Choi, Hae-Chun;Hwang, Hung-Goo;Kim, Yeon-Gyu;Kim, Kee-Jong;Lee, Young-Tae
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.51 no.7
    • /
    • pp.624-631
    • /
    • 2006
  • This study was conducted to make clustering analysis based on major physicochemical characteristics related to palatability of cooked rice. 89 Korea-bred japonica rice cultivars could be largely classified into two groups, that is, Dongjinbyeo and Ilpumbyeo groups. The Ilpumbyeo group was divided into two subgroups; Ilpumbyeo and Chucheongbyeo groups. The two major rice groups showed significant difference in viscogram properties of rice flour. Ilpumbyeo group revealed slightly higher estimates of viscogram traits as compared with Dongiinbyeo group in average. Early-maturing rice group showed slighly lower estimates of taste meter and higher protein content compared with medium or medium late maturing ones. Also, early and medium-maturing groups exhibited slightly higher estimates of peak, hot and breakdown viscosities but lower estimates of consistenency and setback viscosities compared with medium-late-maturing one. The rice cultivars developed in 2000's revealed slightly higher estimates of peak, hot, cool and consistency viscosities compared with those in $1980's{\sim}1990's$. The grain quality properties significantly associated with the esimates of Toyo taste meter were protein and amylose contents and hot viscosity. The lower protein content and hot viscosity and the higher amylose content, the higher estimates of the taster meter. The protein content was highly negatively correlated with amylose content of milled rice. The important quality components contributed to multiple regression formula for estimating the Toyo taster meter values were protein content, alkali digestion value, and hot viscosity. The fittness of this formula was about 49% along with the coefficients of determination.