• 제목/요약/키워드: Clustered-tree

검색결과 128건 처리시간 0.023초

독립적인 벡터 근사에 의한 분산 벡터 근사 트리의 성능 강화 (Performance Enhancement of a DVA-tree by the Independent Vector Approximation)

  • 최현화;이규철
    • 정보처리학회논문지D
    • /
    • 제19D권2호
    • /
    • pp.151-160
    • /
    • 2012
  • 지금까지 제안된 분산 고차원 색인의 대부분은 균일한 분포를 가지는 데이터 집합에서 좋은 검색 성능을 나타내나, 편향되거나 클러스터를 이루는 데이터의 집합에서는 그 성능이 크게 감소된다. 본 논문은 강하게 클러스터를 이루거나 편향된 분포를 가지는 데이터 집합에 대한 분산 벡터 근사 트리의 k-최근접 검색 성능을 향상시키는 방법을 제안한다. 기본 아이디어는 전체 데이터를 클러스터링하는 상위 트리의 말단 노드가 담당하는 데이터 공간의 크기를 계산하고, 그 공간 상의 특징 벡터를 근사하는 데 사용되는 비트의 수를 달리하여 벡터 근사의 식별 능력을 보장하는 것이다. 즉, 고밀도 클러스터에는 더 많은 수의 비트를 할당하는 것이다. 우리는 합성 데이터와 실세계 데이터를 가지고 분산 hybrid spill-tree와 기존 분산 벡터 근사 트리와의 성능 비교 실험을 수행하였다. 실험 결과는 확장된 분산 벡터 근사 트리의 검색 성능이 균일하지 않은 분포의 데이터 집합에서 크게 향상되었음을 보인다.

클러스터화된 무선 네트워크에서 전송량을 고려한 효율적인 멀티캐스트 키 관리 기법 (Bandwidth Efficient Key Management for Secure Multicast in Clustered Wireless Networks)

  • 신승재;허준범;이한진;윤현수
    • 한국정보과학회논문지:정보통신
    • /
    • 제36권5호
    • /
    • pp.437-455
    • /
    • 2009
  • 무선 통신 기술의 발달로 인해 앞으로는 다양한 종류의 멀티캐스트 기반 서비스가 클러스터화된 무선 네트워크를 통하여 이루어질 것으로 예상된다. 보안성을 제공하는 멀티캐스트 서비스의 경우 암호화에 사용하는 그룹키의 관리가 중요한 문제가 된다. 따라서 다양한 종류의 그룹키 관리 기법들이 계속해서 제안되고 있다. 대표적인 그룹키 관리 기법 중 하나인 트리 기반 그룹키 관리 기법은 키 분배 센터가 전송해야 하는 키 갱신 메시지의 수를 효과적으로 줄인다는 장점을 지니고 있지만, 키 갱신 메시지를 전달하는데 실제로 소모되는 네트워크 대역폭을 정확히 고려하지 않고 있다. 본 논문은 그룹 멤버쉽이 동적으로 변하는 클러스터화된 무선 네트워크 환경에서 트리 기반 그룹키 관리 기법을 사용했을 때 키 갱신을 위한 대역폭 소모량을 효율적으로 절감할 수 있는 방법을 제시하고 있다. 컴퓨터 시뮬레이션을 통한 실험은 제안하는 방법이 기존의 기법들에 비해 매우 우수한 대역폭 절감 능력을 지니고 있음을 보여주고 있다.

돼지 유행성 설사 바이러스 국내분리주의 유전학적 특성 규명 (Genetic Characteristics of Porcine Epidemic Diarrhea Virus Isolated in Korea)

  • 지영철;권혁무;정현규;한정희
    • 대한수의학회지
    • /
    • 제43권2호
    • /
    • pp.219-230
    • /
    • 2003
  • Porcine epidemic diarrhea virus(PED), a member of Coronaviridea, is the etiological agent of enteropathogenic diarrhea in swine. The purpose of this study was to investigate genetic characteristic of PEDV isolated in Korea. Nucleocapsid(N) gene and membrane (M) gene of recent Korean PEDV strains isolated in 2001 were amplified, cloned, sequenced and analyzed. N gene of seven Korean PEDV field isolates bad 94.5% to 99.4% nucleotide and 92.4% to 99.4% amino acid sequence homology each other. Nucleotide and amino acid sequences of Korean field PEDVs were different from published foreign PEDVs, showing 95.1% to 98.0% nucleotide and 93.5% to 97.6% amino acid sequence homology. By phylogenetic tree analysis on based nucleotide sequences, PEDVs were clustered into four groups. By phylogenetic tree analysis based on amino acid sequences. PEDVs were clustered into five groups. M gene of our Korean PEDV field isolates had 99.6% to 100% nucleotide and 98.7% to 100% amino acid sequence homology each other. Nuclotide and amino acid sequences of Korean field PEDVs were different from published foreign PEDVs, showing 98.5% to 98.8% nucleotide and 97.3% to 97.8% amino acid sequence homology. By phylogenetic tree analysis based on nucleotide and amino acid sequences, PEDVs were clustered into two groups which were Korean PEDV isolate group and foreign PEDV isolate group.

Genetic Differentiation among the Mitochondrial ND2 Gene and $tRNA^{Trp}$ Gene Sequences of Genus Rana (Anura) in Korea

  • Lee, Hyuk;Yang, Suh-Yung;Lee, Hei-Yung
    • Animal cells and systems
    • /
    • 제4권1호
    • /
    • pp.31-37
    • /
    • 2000
  • The genetic variations among six species of Rana from Korea (R. nigro-maculata, R. piancyi, R. dybowskii, R. sp, R. rugosa type A, B and R. amurensis) were investigated using 499 bases of mitochondrial DNA sequences for ND2 (NADH dehydrogenase subunit 2) gene and $tRNA^{Trp}$ gene. Partial sequences of ND2 gene (427 bp) and full sequences of $tRNA^{Trp}$ gene (73 bp) were identified. The level of sequence divergences ranged from 0.2 to 5.2% within species and 4.9-28.0% among 6 species of the genus Rana. The $tRNA^{Trp}$ gene of the genus Rana was composed of 77 nucleotides which showed a two dimensional "cloverleaf" structure. The secondary structure of $tRNA^{Trp}$ was not found compensatory changes which could potentially confound phylogenetic inference. In the neighborjoining tree, brown frogs were clustered first with the level of sequence divergence of 13.20% between R. amurensis and R. dybowskii, and 9% between R. dybowskii and R. sp. supported by 99% bootstrap iterations, respectively. R. nigromaculata and R. plancyi were clustered into another group with 5.1% divergence supported by 100% bootstrap iteration. R. rugosa A 8nd B types were grouped by 4.9% divergence and clustered into the last group with other two groups with 100% bootstrap iterations.

  • PDF

Effective Acoustic Model Clustering via Decision Tree with Supervised Decision Tree Learning

  • Park, Jun-Ho;Ko, Han-Seok
    • 음성과학
    • /
    • 제10권1호
    • /
    • pp.71-84
    • /
    • 2003
  • In the acoustic modeling for large vocabulary speech recognition, a sparse data problem caused by a huge number of context-dependent (CD) models usually leads the estimated models to being unreliable. In this paper, we develop a new clustering method based on the C45 decision-tree learning algorithm that effectively encapsulates the CD modeling. The proposed scheme essentially constructs a supervised decision rule and applies over the pre-clustered triphones using the C45 algorithm, which is known to effectively search through the attributes of the training instances and extract the attribute that best separates the given examples. In particular, the data driven method is used as a clustering algorithm while its result is used as the learning target of the C45 algorithm. This scheme has been shown to be effective particularly over the database of low unknown-context ratio in terms of recognition performance. For speaker-independent, task-independent continuous speech recognition task, the proposed method reduced the percent accuracy WER by 3.93% compared to the existing rule-based methods.

  • PDF

GC-트리 : 이미지 데이타베이스를 위한 계층 색인 구조 (GC-Tree: A Hierarchical Index Structure for Image Databases)

  • 차광호
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제31권1호
    • /
    • pp.13-22
    • /
    • 2004
  • 멀티미디어 데이타의 사용이 증가함에 따라 고차원 이미지 데이타에 대한 효율적인 색인과 검색 기법이 크게 요구되고 있다. 그러나 많은 노력에도 불구하고 현재의 다차원 색인 기법들은 고차원 데이타 공간에서 만족할 만한 성능을 보여주지 못하고 있다. 이러한 소위 차원의 저주를 해결하기 위해 최근에 차원을 줄이거나 근사 해를 구하는 둥의 접근법이 시도되고 있지만 이러한 방법들은 근본적으로 정확도의 상실이라는 문제를 갖고 있다. 정확도의 보존을 위해 VA-file, LPC-file둥과 같이 벡터 근사에 기반 한 기법들이 최근에 개발되었다. 그러나 이 기법은 검색 성능이 색인 파일의 크기에 큰 영향을 받으며, 한번에 큰 검색 공간을 줄이는 계층 색인 구조의 장점을 상실한다. 본 논문에서는 이미지 데이터베이스에서 유사성 질의를 위한 새로운 계층 색인 구조인 GC-트리를 제안한다. GC-트리는 밀도 함수에 기초하여 데이타 공간을 적응적으로 분할하고, 색인 구조를 동적으로 생성한다. 이러한 특성을 갖는 GC-트리는 군집화 된 고차원 이미지 데이타 검색에 훌륭한 성능을 나타낸다.

Inferring the Molecular Phylogeny of Chroococcalian Strains (Blue-green algae/Cyanophyta) from the Geumgang River, Based on Partial Sequences of 16S rRNA Gene

  • Lee, Wook-Jae;Bae, Kyung-Sook
    • Journal of Microbiology
    • /
    • 제40권4호
    • /
    • pp.335-339
    • /
    • 2002
  • Partial sequences of 16S rRNA gene of five chroococcalian blue-green algal strains, Aphanothece nidulans KCTC AG10041, Aphanothece naegelii KCTC AG10042, Microcystis aeruginosa KCTC AG10159, Microcystis ichthyoblabe KCTC AG10160, and Microcystis viridis KCTC AG10198, which were isolated from water from the Geumgang River, were determined and were inferred their phylogenetic and taxonomic positions among taxa of order Chroococcales. Most taxa of Chroococcales whose partial 16S rRNA gene sequences were aligned in this study, are clustered with other related taxa. Aphanothece nidulans KCTC AG10041 and Aphanothece naegelii KCTC AG10042 made a cluster with other European species of these genera, which supported 100% of the bootstrap trees with a very high sequence similarity (97.4-99.4%) in this study. Three strains, Microcystis aeruginosa KCTC AG10159, M. ichthyoblabe KCTC AG10160, and M. viridis KCTC AG10198, formed a cluster with other Microcystis spp. supported 100 % of the bootstrap trees with a similarity of 97.0-99.9% except for two strains. However, this phylogentic tree made no resolution among the species of Microcystis spp. The topology of the tree reconfirmed the taxonomic status of three species of Microcystis, identified in this study based on the morphology, as three colonial types of Microcystis aeruginosa com. nov. Otsuka et al. (1999c). The genera of chroococcalian cyanophytes are heterogeneously clustered in these sequence analyses. We suggest that more molecular studies on the genera of Chroococcales with reference strains, widely collected from restricted geographic or environmental ranges, get accurate taxonomic or phylogenetic determinations.

Comparative Genome-Scale Expression Analysis of Growth Phase-dependent Genes in Wild Type and rpoS Mutant of Escherichia coli

  • Oh, Tae-Jeong;Jung, Il-Lae;Woo, Sook-Kyung;Kim, Myung-Soon;Lee, Sun-Woo;Kim, Keun-Ha;Kim, In-Gyu;An, Sung-Whan
    • 한국미생물생명공학회:학술대회논문집
    • /
    • 한국미생물생명공학회 2004년도 Annual Meeting BioExibition International Symposium
    • /
    • pp.258-265
    • /
    • 2004
  • Numerous genes of Escherichia coli have been shown to growth phase-dependent expression throughout growth. The global patterns of growth phase-dependent gene expression of E. coli throughout growth using oligonucleotide microarrays containing a nearly complete set of 4,289 annotated open reading frames. To determine the change of gene expression throughout growth, we compared RNAs taken from timecourses with common reference RNA, which is combined with equal amount of RNA pooled from each time point. The hierarchical clustering of the conditions in accordance with timecourse expression revealed that growth phases were clustered into four classes, consistent with known physiological growth status. We analyzed the differences of expression levels at genome level in both exponential and stationary growth phase cultures. Statistical analysis showed that 213 genes are shown to, growth phase-dependent expression. We also analyzed the expression of 256 known operons and 208 regulatory genes. To assess the global impact of RpoS, we identified 193 genes coregulated with rpoS and their expression levels were examined in the isogenic rpoS mutant. The results revealed that 99 of 193 were novel RpoS-dependent stationary phase-induced genes and the majority of those are functionally unknown. Our data provide that global changes and adjustments of gene expression are coordinately regulated by growth transition in E. coli.

  • PDF

머신 러닝을 활용한 의류제품의 판매량 예측 모델 - 아우터웨어 품목을 중심으로 - (Sales Forecasting Model for Apparel Products Using Machine Learning Technique - A Case Study on Forecasting Outerwear Items -)

  • 채진미;김은희
    • 한국의류산업학회지
    • /
    • 제23권4호
    • /
    • pp.480-490
    • /
    • 2021
  • Sales forecasting is crucial for many retail operations. For apparel retailers, accurate sales forecast for the next season is critical to properly manage inventory and plan their supply chains. The challenge in this increases because apparel products are always new for the next season, have numerous variations, short life cycles, long lead times, and seasonal trends. In this study, a sales forecasting model is proposed for apparel products using machine learning techniques. The sales data pertaining to outerwear items for four years were collected from a Korean sports brand and filtered with outliers. Subsequently, the data were standardized by removing the effects of exogenous variables. The sales patterns of outerwear items were clustered by applying K-means clustering, and outerwear attributes associated with the specific sales-pattern type were determined by using a decision tree classifier. Six types of sales pattern clusters were derived and classified using a hybrid model of clustering and decision tree algorithm, and finally, the relationship between outerwear attributes and sales patterns was revealed. Each sales pattern can be used to predict stock-keeping-unit-level sales based on item attributes.

Genetic Relationships among Different Breeds of Chinese Gamecocks Revealed by mtDNA Variation

  • Qu, L.J.;Li, X.Y.;Yang, N.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제22권8호
    • /
    • pp.1085-1090
    • /
    • 2009
  • There are currently five primary breeds of Chinese gamecock, the Henan, Luxi, Tulufan, Xishuangbanna andZhangzhou. Though there is historical evidence of cockfighting in China dating as far back as 2,800 years, the origin and genetic relationships of these breeds are not well understood. We used sequence variation from the mtDNA cytb gene and control region (1,697 bp) to examine the domestication history and genetic relationship of the Chinese gamecock. From 75 samples (14-16 per breed) we found 34 haplotypes, and 45 variable nucleotides. Phylogenetic reconstruction indicated multiple origins of the gamecock breeds. The breeds in the north and center of China, Tulufan, Luxi and Henan, clustered together in a haplogroup and may have the same ancestor. However the southern breeds, Zhangzhou and Xishuangbanna clustered into two isolated haplogroups, suggesting another two origins of Chinese gamecock. Meanwhile, extensive admixture was also found because samples from different breeds, more or less, were always grouped together in the same clades. Based on these results, we discuss the possibilities of multiple origins of gamecock breeds, from both ancestral gamecocks as well as other domestic chickens and red jungle fowl.