• Title/Summary/Keyword: Similarity search

Search Result 531, Processing Time 0.025 seconds

An Index-Based Approach for Subsequence Matching Under Time Warping in Sequence Databases (시퀀스 데이터베이스에서 타임 워핑을 지원하는 효과적인 인덱스 기반 서브시퀀스 매칭)

  • Park, Sang-Hyeon;Kim, Sang-Uk;Jo, Jun-Seo;Lee, Heon-Gil
    • The KIPS Transactions:PartD
    • /
    • v.9D no.2
    • /
    • pp.173-184
    • /
    • 2002
  • This paper discuss an index-based subsequence matching that supports time warping in large sequence databases. Time warping enables finding sequences with similar patterns even when they are of different lengths. In earlier work, Kim et al. suggested an efficient method for whole matching under time warping. This method constructs a multidimensional index on a set of feature vectors, which are invariant to time warping, from data sequences. For filtering at feature space, it also applies a lower-bound function, which consistently underestimates the time warping distance as well as satisfies the triangular inequality. In this paper, we incorporate the prefix-querying approach based on sliding windows into the earlier approach. For indexing, we extract a feature vector from every subsequence inside a sliding window and construct a multidimensional index using a feature vector as indexing attributes. For query processing, we perform a series of index searches using the feature vectors of qualifying query prefixes. Our approach provides effective and scalable subsequence matching even with a large volume of a database. We also prove that our approach does not incur false dismissal. To verify the superiority of our approach, we perform extensive experiments. The results reveal that our approach achieves significant speedup with real-world S&P 500 stock data and with very large synthetic data.

Invariant Classification and Detection for Cloth Searching (의류 검색용 회전 및 스케일 불변 이미지 분류 및 검색 기술)

  • Hwang, Inseong;Cho, Beobkeun;Jeon, Seungwoo;Choe, Yunsik
    • Journal of Broadcast Engineering
    • /
    • v.19 no.3
    • /
    • pp.396-404
    • /
    • 2014
  • The field of searching clothing, which is very difficult due to the nature of the informal sector, has been in an effort to reduce the recognition error and computational complexity. However, there is no concrete examples of the whole progress of learning and recognizing for cloth, and the related technologies are still showing many limitations. In this paper, the whole process including identifying both the person and cloth in an image and analyzing both its color and texture pattern is specifically shown for classification. Especially, deformable search descriptor, LBPROT_35 is proposed for identifying the pattern of clothing. The proposed method is scale and rotation invariant, so we can obtain even higher detection rate even though the scale and angle of the image changes. In addition, the color classifier with the color space quantization is proposed not to loose color similarity. In simulation, we build database by training a total of 810 images from the clothing images on the internet, and test some of them. As a result, the proposed method shows a good performance as it has 94.4% matching rate while the former Dense-SIFT method has 63.9%.

Influence Factors of Online-Based Interpersonal Relationships by Developmental Level -Centered on Social Networking Service Users - (대인관계 발달 단계에 따른 온라인기반 대인관계에 미치는 영향요인 - 소셜네트워크 서비스(SNS) 사용자를 중심으로 -)

  • Heo, Song-Ji;Kim, Ja-Young;Jang, Hee-Jin;Ko, Hye-Young;Park, Su-E
    • Journal of Korea Game Society
    • /
    • v.12 no.2
    • /
    • pp.75-89
    • /
    • 2012
  • In this paper, the correlation which is at work between affecting factors and interpersonal relationships' dimension depend on developmental level has been studied to search for clues about how to develop the online based interpersonal relationships -the fundamental aims of SNS related services- efficiently. People who'd ever entered into a relation through the 'online-based generated relationship' by SNS were divided into two groups on the development level. They were conducted a survey, and the results were derived using PLS statistics. As the result, 7 kinds of factors as social attraction, physical attraction, reciprocity, content quality, coexistence perception, information provision and similarity had an impact on the initial level of relationships, and 6 kinds of factors as social attraction, physical attraction, reciprocity, content quality, web appearance and coexistence perception had an impact on the developed level of relationships. This study could be utilized for the service design for facilitating interpersonal relationships efficiently by their level of development.

The Existence of a Putative Regulatory Element in 3'-Untranslated Region of Proto-oncogene HOX11's mRNA

  • Li, Yue;Jiang, Zhao-Zhao;Chen, Hai-Xu;Leung, Wai-Keung;Sung, Joseph J.Y.;Ma, Wei-Jun
    • BMB Reports
    • /
    • v.38 no.4
    • /
    • pp.500-506
    • /
    • 2005
  • HOX11 encodes a homeodomain-containing transcription factor which directs the development of the spleen during embryogenesis. While HOX11 expression is normally silenced through an unknown mechanism in all tissues by adulthood, the deregulation of HOX11 expression is associated with leukemia, such as T-cell acute lymphoblastic leukemia. The elucidation of regulatory elements contributing to the molecular mechanism underlying the regulation of HOX11 gene expression is of great importance. Previous reports of HOX11 regulatory elements mainly focused on the 5'-flanking region of HOX11 on the chromosome related to transcriptional control. To expand the search of putative cis-elements involved in HOX11 regulation at the post-transcriptional level, we analyzed HOX11 mRNA 3'-untranslated region (3'UTR) and found an AU-rich region. To characterize this AU-rich region, in vitro analysis of HOX11 mRNA 3'UTR was performed with human RNA-binding protein HuR, which interacts with AU-rich element (ARE) existing in the 3'UTR of many growth factors' and cytokines' mRNAs. Our results showed that the HOX11 mRNA 3'UTR can specifically bind with human HuR protein in vitro. This specific binding could be competed effectively by typical ARE containing RNA. After the deletion of the AU-rich region present in the HOX11 mRNA 3'UTR, the interaction of HOX11 mRNA 3'UTR with HuR protein was abolished. These findings suggest that HOX11 mRNA 3'UTR contains cis-acting element which shares similarity in the action pattern with RE-HuR interactions and may involve in the post-transcriptional regulation of the HOX11 gene.

Phylogenetic analysis of the medicinal mushroom and taxonomical positions of their commercial products (약용버섯의 계통분류 및 국내유통 Inonotus속내 종간 구별을 위한 신속동정법 개발)

  • Jin, Cheng-Yun;Jeong, Min-Jung;Kim, Gi-Young;Park, Jae-Min;Kim, Mun-Ok;Moon, Dong-Oh;Lee, Tae-Ho;Lee, Jae-Dong
    • Journal of Mushroom
    • /
    • v.3 no.2
    • /
    • pp.52-59
    • /
    • 2005
  • The Aphyllophorales is a large order containing about 2,000 known species. Many of these are the bracket and coral fungi. The vast majority of these fungi are saprophytic on the plant debris. Many species are significant in decomposing plant remains, as they are able to digest cellulose or lignin that occurs in plant cell walls. Many of these fungi have been involved in everyday human affairs. A few were used medicinally by the Greeks and Romans as a remedy for many complaints, including colic, fractured limbs and bruises. Other bracket fungi have been used as curry combs for horses, as snuff, as razor strops and as a source of dye for clothing. The texture of the basidiocarp may be similar to that of cork, wood, leather, paper, or cartilage. Unlike the basidiocarps of the Order Agaricales, the basidiocarps of the Aphyllophorales are not fleshly and moist. Division of the members of the Aphyllophorales into genera was originally made on the basis of gross morphology of the basidiocarp and hymenium and Donk(1964) recognizes 22 families in this order. The species and genus whose typical in Aphylloporales were listed in Table. with related information. The ITS region sequence of some genus were found by BLAST search. Sequences retrieved from GenBank were visually aligned by the program CLUSTAL G. As a result, the medicinal mushroom was separated in four groups. In this multiple alignment, the sequence analysis among Fomes group, Inonotus group and Phellinus group showed high genetic similarity except Hericium group and Sparassis group.

  • PDF

Region Based Image Similarity Search using Multi-point Relevance Feedback (다중점 적합성 피드백방법을 이용한 영역기반 이미지 유사성 검색)

  • Kim, Deok-Hwan;Lee, Ju-Hong;Song, Jae-Won
    • The KIPS Transactions:PartD
    • /
    • v.13D no.7 s.110
    • /
    • pp.857-866
    • /
    • 2006
  • Performance of an image retrieval system is usually very low because of the semantic gap between the low level feature and the high level concept in a query image. Semantically relevant images may exhibit very different visual characteristics, and may be scattered in several clusters. In this paper, we propose a content based image rertrieval approach which combines region based image retrieval and a new relevance feedback method using adaptive clustering together. Our main goal is finding semantically related clusters to narrow down the semantic gap. Our method consists of region based clustering processes and cluster-merging process. All segmented regions of relevant images are organized into semantically related hierarchical clusters, and clusters are merged by finding the number of the latent clusters. This method, in the cluster-merging process, applies r: using v principal components instead of classical Hotelling's $T_v^2$ [1] to find the unknown number of clusters and resolve the singularity problem in high dimensions and demonstrate that there is little difference between the performance of $T^2$ and that of $T_v^2$. Experiments have demonstrated that the proposed approach is effective in improving the performance of an image retrieval system.

WordNet-Based Category Utility Approach for Author Name Disambiguation (저자명 모호성 해결을 위한 개념망 기반 카테고리 유틸리티)

  • Kim, Je-Min;Park, Young-Tack
    • The KIPS Transactions:PartB
    • /
    • v.16B no.3
    • /
    • pp.225-232
    • /
    • 2009
  • Author name disambiguation is essential for improving performance of document indexing, retrieval, and web search. Author name disambiguation resolves the conflict when multiple authors share the same name label. This paper introduces a novel approach which exploits ontologies and WordNet-based category utility for author name disambiguation. Our method utilizes author knowledge in the form of populated ontology that uses various types of properties: titles, abstracts and co-authors of papers and authors' affiliation. Author ontology has been constructed in the artificial intelligence and semantic web areas semi-automatically using OWL API and heuristics. Author name disambiguation determines the correct author from various candidate authors in the populated author ontology. Candidate authors are evaluated using proposed WordNet-based category utility to resolve disambiguation. Category utility is a tradeoff between intra-class similarity and inter-class dissimilarity of author instances, where author instances are described in terms of attribute-value pairs. WordNet-based category utility has been proposed to exploit concept information in WordNet for semantic analysis for disambiguation. Experiments using the WordNet-based category utility increase the number of disambiguation by about 10% compared with that of category utility, and increase the overall amount of accuracy by around 98%.

A study on reduction of sensibility dimension for selection of wallpaper (벽지 선택을 위한 감성 차원 축소에 관한 연구)

  • Chun Young-Min;Kim Soon-Young;Kim Sung-Hwan;Chung Sung-Suk
    • Science of Emotion and Sensibility
    • /
    • v.8 no.4
    • /
    • pp.333-344
    • /
    • 2005
  • The sensitivity adjectives on wall paper are collected. With the collected sensitivity adjective, we are going to develop the model which can recommend the wallpaper to customer. A large number of adjectives describing affective responses were collected from such diverse sources as questionnaire survey results, field survey results and internet survey result. To search the representative adjective of collected adjective, we used the diverse statistical analysis method. We attempted to decide the axis name of dimension through the MDS(Multi-Dimensional Scale) analysis method using the similarity matrix an4 to find a three or four reduced factors through the factor analysis method using the varimax rotation method. The result of the analysis showed that the reduced factors could account about $82\%$ when the number of factor is three(popular, elegance, and passable) ant about $93\%$ when the number of factor is four (elegance, passable, beautiful, and affectionate) On the basis of this result, we expect it can be used to develop the model recommending the wallpaper.

  • PDF

Construction of a Full-length cDNA Library from Korean Stewartia (Stewartia koreana Nakai) and Characterization of EST Dataset (노각나무(Stewartia koreana Nakai)의 cDNA library 제작 및 EST 분석)

  • Im, Su-Bin;Kim, Joon-Ki;Choi, Young-In;Choi, Sun-Hee;Kwon, Hye-Jin;Song, Ho-Kyung;Lim, Yong-Pyo
    • Horticultural Science & Technology
    • /
    • v.29 no.2
    • /
    • pp.116-122
    • /
    • 2011
  • In this study, we report the generation and analysis of 1,392 expressed sequence tags (ESTs) from Korean Stewartia (Stewartia koreana Nakai). A cDNA library was generated from the young leaf tissue and a total of 1,392 cDNA were partially sequenced. EST and unigene sequence quality were determined by computational filtering, manual review, and BLAST analyses. Finally, 1,301 ESTs were acquired after the removal of the vector sequence and filtering over a minimum length 100 nucleotides. A total of 893 unigene, consisting of 150 contigs and 743 singletons, was identified after assembling. Also, we identified 95 new microsatellite-containing sequences from the unigenes and classified the structure according to their repeat unit. According to homology search with BLASTX against the NCBI database, 65% of ESTs were homologous with known function and 11.6% of ESTs were matched with putative or unknown function. The remaining 23.2% of ESTs showed no significant similarity to any protein sequences found in the public database. Annotation based searches against multiple databases including wine grape and populus sequences helped to identify putative functions of ESTs and unigenes. Gene ontology (GO) classification showed that the most abundant GO terms were transport, nucleotide binding, plastid, in terms biological process, molecular function and cellular component, respectively. The sequence data will be used to characterize potential roles of new genes in Stewartia and provided for the useful tools as a genetic resource.

Bacterial Community Structure Shift Driven by Salinity: Analysis of DGGE Band Patterns from Freshwater to Seawater of Hyeongsan River, Korea (염도의 변화에 따른 미생물 군집의 변화: 경북 형산강 하류 미생물 군집 변화의 DGGE pattern 분석)

  • Beck, Bo Ram;Holzapfel, Wilhelm;Hwang, Cher Won;Do, Hyung Ki
    • Journal of Life Science
    • /
    • v.23 no.3
    • /
    • pp.406-414
    • /
    • 2013
  • The influence of a gradual increase in salinity on the diversity of aquatic bacterial in rivers was demonstrated. The denaturing gradient gel electrophoresis (DGGE) was used to analyze the bacterial community shift downstream in the Hyeongsan River until it joins the open ocean. Four water samples were taken from the river showing the salinity gradients of 0.02%, 1.48%, 2.63%, and 3.62%. The samples were collected from four arbitrary stations in 2.91 km intervals on average, and a DGGE analysis was performed. Based on the results of this analysis, phylogenetic similarity identification, tree analysis, and a comparison of each station were performed. The results strongly suggested that the response of the bacterial community response was concomitant to gradual changes in salinity, which implies that salt concentration is a major factor in shifting the microbiota in aquatic habitats. The results also imply a huge diversity in a relatively small area upstream from the river mouth, compared to that in open oceans or coastal regions. Therefore, areas downstream towards a river mouth or delta are could be good starting points in the search for new bacterial species and strains ("biotypes").