• Title/Summary/Keyword: Similarity sampling

Search Result 122, Processing Time 0.033 seconds

A Scalable Clustering Method for Categorical Sequences (범주형 시퀀스들에 대한 확장성 있는 클러스터링 방법)

  • Oh, Seung-Joon;Kim, Jae-Yearn
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.2
    • /
    • pp.136-141
    • /
    • 2004
  • There has been enormous growth in the amount of commercial and scientific data, such as retail transactions, protein sequences, and web-logs. Such datasets consist of sequence data that have an inherent sequential nature. However, few clustering algorithms consider sequentiality. In this paper, we study how to cluster sequence datasets. We propose a new similarity measure to compute the similarity between two sequences. We also present an efficient method for determining the similarity measure and develop a clustering algorithm. Due to the high computational complexity of hierarchical clustering algorithms for clustering large datasets, a new clustering method is required. Therefore, we propose a new scalable clustering method using sampling and a k-nearest-neighbor method. Using a real dataset and a synthetic dataset, we show that the quality of clusters generated by our proposed approach is better than that of clusters produced by traditional algorithms.

Speckle Noise Reduction and Image Quality Improvement in U-net-based Phase Holograms in BL-ASM (BL-ASM에서 U-net 기반 위상 홀로그램의 스펙클 노이즈 감소와 이미지 품질 향상)

  • Oh-Seung Nam;Ki-Chul Kwon;Jong-Rae Jeong;Kwon-Yeon Lee;Nam Kim
    • Korean Journal of Optics and Photonics
    • /
    • v.34 no.5
    • /
    • pp.192-201
    • /
    • 2023
  • The band-limited angular spectrum method (BL-ASM) causes aliasing errors due to spatial frequency control problems. In this paper, a sampling interval adjustment technique for phase holograms and a technique for reducing speckle noise and improving image quality using a deep-learningbased U-net model are proposed. With the proposed technique, speckle noise is reduced by first calculating the sampling factor and controlling the spatial frequency by adjusting the sampling interval so that aliasing errors can be removed in a wide range of propagation. The next step is to improve the quality of the reconstructed image by learning the phase hologram to which the deep learning model is applied. In the S/W simulation of various sample images, it was confirmed that the peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) were improved by 5% and 0.14% on average, compared with the existing BL-ASM.

Enhancement of the Box-Counting Algorithm for Fractal Dimension Estimation (프랙탈 차원 추정을 위한 박스 계수법의 개선)

  • So, Hye-Rim;So, Gun-Baek;Jin, Gang-Gyoo
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.22 no.9
    • /
    • pp.710-715
    • /
    • 2016
  • Due to its simplicity and high reliability, the box-counting(BC) method is one of the most frequently used techniques to estimate the fractal dimensions of a binary image with a self-similarity property. The fractal calculation requires data sampling that determines the size of boxes to be sampled from the given image and directly affects the accuracy of the fractal dimension estimation. There are three non-overlapping regular grid methods: geometric-step method, arithmetic-step method and divisor-step method. These methods have some drawbacks when the image size M becomes large. This paper presents a BC algorithm for enhancing the accuracy of the fractal dimension estimation based on a new sampling method. Instead of using the geometric-step method, the new sampling method, called the coverage ratio-step method, selects the number of steps according to the coverage ratio. A set of experiments using well-known fractal images showed that the proposed method outperforms the existing BC method and the triangular BC method.

Comparison of Regional Differences of PCBs Concentration Using Pine Needles and Soil (지역별 소나무잎과 토양에 침착된 PCBs 농도 비교)

  • Chun, Man-Young;Kim, Tae-Wook
    • Environmental Analysis Health and Toxicology
    • /
    • v.24 no.3
    • /
    • pp.251-259
    • /
    • 2009
  • This study was conducted to measure the concentration of PCBs in pine needles and soil in urban (Seoul, many artificial sources of PCBs), semi-rural (Anseong, small town located below Seoul in wind direction) and rural areas (Jincheon, rarely artificial sources of PCBs) in which the artificial production amount of PCBs are different. The total PCBs concentrations in pine needles, which did not show big difference in three sampling sites, were 107.5 pg/g (urban), 94.8 pg/g (semi-rural) and 78.8 pg/g (rural) respectively. The low chlorinated PCBs were major component in pine needles and the PCBs congener concentration profile of each sampling area were similar each other, and the octanol-air partitioning coefficient, Koa, highly correlated with the PCBs concentrations in pine needles. The total PCBs concentrations in soil which did show big difference in three sampling sites, were 830.0 pg/g (urban), 314.1 pg/g (semi-rural) and 136.5 pg/g (rural) respectively. The high chlorinated PCBs were major component in soil and the PCBs congener concentration profile of each sampling area were similar each other. There was no similarity between the PCBs concentration of pine needles and those of soil at each site, because of the different mechanism of deposition and volatilization processes of PCBs. The total PCBs concentrations of 2009 became 12.9 times lower than those of 2001. The reduce rate of PCB 28 was the greatest.

Genetic Variability of Sorghum Charcoal Rot Pathogen (Macrophomina phaseolina) Assessed by Random DNA Markers

  • Bashasab, Rajkumar, Fakrudin;Kuruvinashetti, Mahaling S
    • The Plant Pathology Journal
    • /
    • v.23 no.2
    • /
    • pp.45-50
    • /
    • 2007
  • Genetic diversity among selected isolates of Macrophomina phaseolina, a causal agent of charcoal rot (stalk rot) disease in sorghum was studied using PCR-RAPD markers. A set of ten isolates, from ten different rabi sorghum genotypes representing two traditional sorghum growing situations viz., Dharwad- a transitional high rainfall region and Bijapur- a semi-arid low rainfall region in South India. From a set of 40 random primers tested, amplicon profiles of 15 were reproducible. A total of 149 amplicon levels, with an average of 9.9 bands per primer, were available for analysis, of which 148 were polymorphic (99.3%). It was possible to discriminate all the isolates with any of the 15 primers employed. UPGMA clustering of data indicated that the isolates shared varied levels of genetic similarity within a range of 0.14 to 0.72 similarity coefficient index and it was suggestive that grouping of isolates was not related to sampling location in anyway. A high level of genetic heterogeneity of 0.28 was recorded among the isolates.

Similarity-Based Subsequence Search in Image Sequence Databases (이미지 시퀀스 데이터베이스에서의 유사성 기반 서브시퀀스 검색)

  • Kim, In-Bum;Park, Sang-Hyun
    • The KIPS Transactions:PartD
    • /
    • v.10D no.3
    • /
    • pp.501-512
    • /
    • 2003
  • This paper proposes an indexing technique for fast retrieval of similar image subsequences using the multi-dimensional time warping distance. The time warping distance is a more suitable similarity measure than Lp distance in many applications where sequences may be of different lengths and/or different sampling rates. Our indexing scheme employs a disk-based suffix tree as an index structure and uses a lower-bound distance function to filter out dissimilar subsequences without false dismissals. It applies the normaliration for an easier control of relative weighting of feature dimensions and the discretization to compress the index tree. Experiments on medical and synthetic image sequences verify that the proposed method significantly outperforms the naive method and scales well in a large volume of image sequence databases.

The Effects of the Attributes of Korean Celebrity Advertising Models on Chinese Consumer's Intention to Purchase Korean Fashion Brands (한국 연예인 광고모델 속성이 중국 소비자 한국 패션브랜드 구매도에 미치는 영향)

  • Kwon, Yoo-Jin;Hong, Byung-Sook;Seo, Si-Won;Cho, Mi-Ae
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.33 no.3
    • /
    • pp.477-488
    • /
    • 2009
  • As the Korean cultural contents, such as drama, films, music, gained popularity in China, Korean fashion brands used Korean celebrities as their models to as a sales promotion strategy for Chinese consumers. With the point of view that the advertising model as a human capital as well, the purpose of this study is to investigate the factors of attributes of Korean celebrity advertising model, and to analyze effects on fashion brand recognition, preference, trust and purchase intention. With convenience sampling, the research surveyed Shanghai consumers in their 20's to early 30's who had purchased Korean fashion items. The 291 responses were analyzed by frequency analysis, reliability test, factor analysis, multiple regression analysis, The results are as follows. Frist, Korean celebrity advertising model attribute factors were divided into similarity, familiarity, popularity, attractiveness and trust. Second, the brand recognition was affected by similarity, familiarity and popularity factors, and the brand preference was affected by similarity, familiarity, popularity and attractiveness factors. Third, the trust of Korean fashion brands was affected by similarity, familiarity, attractiveness, trust, brand recognition and brand preference. Lastly, the intention to purchase Korean Fashion brand was affected by similarity, familiarity, attractiveness, brand recognition, brand preference and brand trust.

The benefit of one cannot replace the other: seagrass and mangrove ecosystems at Santa Fe, Bantayan Island

  • Mendoza, Ayana Rose R.;Patalinghug, Jenny Marie R.;Divinagracia, Joshua Ybanez
    • Journal of Ecology and Environment
    • /
    • v.43 no.2
    • /
    • pp.183-190
    • /
    • 2019
  • Background: In the Philippines, the practice of planting mangroves over seagrass has been a practice done to promote coastline protection from damages done by storms. Despite the added protection to the coastline, the addition of an artificial ecosystem gradually inflicts damage to the ecosystem already established. In this study, seagrass communities that had no history of mangrove planting were compared with those that had mangrove planting. The percent substrate cover of seagrass in the sampling areas was determined, and the macroinvertebrates present in the sampling areas were also observed. The study was conducted based on reports of mangrove planting activity that disrupted seagrass functions on Santa Fe, Bantayan Island, Cebu. Transect-quadrat method sampling was done to assess the chosen sites. Results: Six species of seagrass was found on the site without mangrove planting which was barangay Ocoy (Cymodocea sp., Thalassia sp., Halodule sp., Enhalus sp., Halophila sp., and Syringodium sp.) and had a higher percent cover, while only four were found on the site with mangrove planting (barangay Marikaban). It was also found that barangay Marikaban had a lesser Shannon-Wiener and Simpson's index compared to barangay Ocoy. Jaccard's index of similarity between the two sites was low. Conclusion: With the results of the assessment, we recommend proper monitoring of future mangrove planting activities and that these activities should not disrupt another ecosystem as all ecosystems are important.

Random Pixel Sampling-based Backlight Dimming for Liquid Crystal Display (LCD 디스플레이를 위한 무작위 화소 추출 기반 백라이트 디밍)

  • Kang, Suk-Ju;Kim, Young Hwan
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.11
    • /
    • pp.174-180
    • /
    • 2014
  • In this paper, we propose the random pixel sampling technique to solve the high computational complexity in the perceptual SSIM-based backlight dimming. Specifically, the proposed algorithm selects pixels in a total frame considering the pre-defined number, and generates the block by combining these pixels. Then, it estimates parameters, which are required in the SSIM calculation, in the combined block, and hence, it can reduce the computation time significantly. In the experimental results, the proposed algorithm reduced the average power consumption and computation time by up to 38.1776 % and 99.5828 %, respectively while preserving the average SSIM., compared with the conventional algorithm.

The Brand Image Retrieval System Based on Color and Shape (컬러와 형태에 기반을 둔 상표 영상 검색 시스템)

  • Shin, Seong-Yoon;Pyo, Seong-Bae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.3
    • /
    • pp.167-172
    • /
    • 2006
  • An image retrieval system retrieves and offers same of similar image based on various features of image. This paper present a brand image retrieval system based on color and shape of image. We use the image for a color information by dividing into the area and extracting the area color distribution histogram. We use for the shape information by preprocessing of the boundary extraction, the centroid extraction, angular sampling etc. and calculating of the sum of the distance from the centroid to the boundary, the standard deviation, and the rate of long axis to short axis. We accomplish the retrieval through a similarity measurement by using the color and shape information which is extracted in this way.

  • PDF