• Title/Summary/Keyword: 정렬법

Search Result 340, Processing Time 0.025 seconds

A method for comparing documents using fingerprinting and sequence alignment. (지문법과 서열정렬법을 결합한 다단계 정렬 방법의 문서 유사도 비교)

  • Seo, Jongkyu;Ock, Chang-Seok;Cho, Hwan-Gue
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.11a
    • /
    • pp.576-579
    • /
    • 2012
  • 문서유사도를 비교하는 방법은 지문법과 서열 정렬법이 널리 알려져 있다. 지문법은 계산속도가 빠른 대신 정확도가 떨어지며, 서열정렬법은 계산속도가 느린 대신 정확도가 높다. 다단계 정렬은 두 방법의 비중을 조절하여 문서 유사도를 비교할 수 있는 방법으로, 각 방법의 장점을 얻으면서 단점을 보완하도록 고안되었다[1]. 이 논문에서는 다단계 정렬방법에 대해 설명하고, 다단계정렬 방법에서 발생 가능한 단편화 문제를 제거하여 정확도를 향상시키는 방법에 대해 소개한다.

Proposal of Fast Counting Sort (빠른 계수 정렬법의 제안)

  • Lee, Sang-Un
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.15 no.5
    • /
    • pp.61-68
    • /
    • 2015
  • Among comparison sorts, no algorithm excels a current set lower bound of O(nlogn) in operation. Quicksort, the fastest of its kind, has a complexity of O(nlogn) at its best and on average and $O(n^2)$ at worst. This paper thus presents two methods: first is an O(n+k) simple counting sort which operates much more speedily than an O(n+k), (k=maximum value) counting sort, and second is an O(ln) radix counting sort which counts the frequency of numbers in the digit l of a data and saves it in a corresponding virtual bucket in an array, only to virtually divide the array into radix digit numbers. For the 6 experimental data, the proposed algorithm makes O(nlogn) or $O(n^2)$ of Quicksort simple into O(n+k) or O(ln). After all, the proposed sorting algorithm has proved to be much faster than the counting sort and Quicksort.

Alignment of a ring laser cavity by using the cavity transmission spectrum control method (투과광 스펙트럼 측정법을 이용한 링레이저 공진기의 광학적 정렬)

  • 전형욱;최용진;이기홍;신상훈;이혁수;손정영
    • Korean Journal of Optics and Photonics
    • /
    • v.8 no.6
    • /
    • pp.456-460
    • /
    • 1997
  • Several methods of aligning ring type cavities have been investigated for long time. The measurement of cavity transmission spectrum control is somewhat effective due to aligning cavity with measuring cavity loss at the same time. In this research, four mirrors are aligned and attached by the measurement of pulse width of the transmitted light. The intracavity loss is optimized to about 0.98%, giving the calculated total reflection coefficient of about 99.02%.

  • PDF

An Image Processing Method for Aligning the Positions of Semiconductor Package using Principal Component Analysis (주성분분석법을 이용한 반도체패키지의 위치정렬 영상처리기법)

  • Kim, Hak-Man
    • Proceedings of the KAIS Fall Conference
    • /
    • 2009.12a
    • /
    • pp.850-853
    • /
    • 2009
  • 반도체 조립공정에서 사용되는 Pick and Placement장비는 반도체패키지를 컴퓨터 비젼을 이용하여 위치 정렬하고 Placement Tray에 적재하는 장비로서 고속,고정밀도가 요구된다. 다변량 통계적 분석방법인 주성분 분석법은 주어진 데이터에서 특징이 되는 일정한 패턴을 찾는 방법으로 영상의 차원감소를 위해 최근 많이 사용되어지고 있다. 본 논문에서는 반도체패키지의 기하학적 형태를 이용하여 위치정렬을 하도록 한 후 성능을 검증하도록 하였다. 패키지 원영상에서 밝기값의 차이에 따른 윤곽선을 인식한 후, 각 위치값들을 주성분 분석법을 이용해 직선을 추출한 방법으로 위치정렬한 결과 신뢰할만한 위치정렬 성능을 보였다.

  • PDF

Multi-Level Sequence Alignment : An Adaptive Control Method Between Speed and Accuracy for Document Comparison (계산속도 및 정확도의 적응적 제어가 가능한 다단계 문서 비교 시스템)

  • Seo, Jong-Kyu;Tak, Haesung;Cho, Hwan-Gue
    • Journal of KIISE
    • /
    • v.41 no.9
    • /
    • pp.728-743
    • /
    • 2014
  • Finger printing and sequence alignment are well-known approaches for document similarity comparison. A fingerprinting method is simple and fast, but it can not find particular similar regions. A string alignment method is used for identifying regions of similarity by arranging the sequences of a string. It has an advantage of finding particular similar regions, but it also has a disadvantage of taking more computing time. The Multi-Level Alignment (MLA) is a new method designed for taking the advantages of both methods. The MLA divides input documents into uniform length blocks, and then extracts fingerprints from each block and calculates similarity of block pairs by comparing the fingerprints. A similarity table is created in this process. Finally, sequence alignment is used for specifying longest similar regions in the similarity table. The MLA allows users to change block's size to control proportion of the fingerprint algorithm and the sequence alignment. As a document is divided into several blocks, similar regions are also fragmented into two or more blocks. To solve this fragmentation problem, we proposed a united block method. Experimentally, we show that computing document's similarity with the united block is more accurate than the original MLA method, with minor time loss.

Probability Sampling to Select Polling Places in Exit Poll (출구조사를 위한 투표소 확률추출 방법)

  • Kim, Young-Won;Uhm, Yoon-Hee
    • Survey Research
    • /
    • v.6 no.2
    • /
    • pp.1-32
    • /
    • 2005
  • The accuracy of exit poll mainly depends on the sampling method of voting places. For exit poll, we propose a probability sampling method of selecting voting places as an alternative to the bellwether polling place sampling. Through an empirical study based on the 2004 general election data, the efficiency of the suggested systematic sampling from ordered voting places was evaluated in terms of mean prediction error and it turns out that the proposed sampling method outperformed the bellwether polling places sampling. We also calculated the variance of estimator from the proposed sampling, and considered the sample size problem to guarantee the target precision using the design effect of the proposed sample design.

  • PDF

Nonparametric Method Using an Alignment Method in a Randomized Block Design with Replications (반복이 있는 랜덤화 블록 계획법에서 정렬 방법을 이용한 비모수 검정법)

  • Lee, Min-Hee;Kim, Dong-Jae
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.1
    • /
    • pp.77-84
    • /
    • 2012
  • Mack and Skillings (1980) proposed a typical nonparametric method in a randomized block design with replications. However, this method may lose information because of the use of average observations instead of individual observations. In this paper, we proposed a nonparametric method that employed an aligned method suggested by Hodges and Lehmann (1962) under a randomized block design with replications. In addition, the comparative results of a Monte Carlo power study are presented.

An effective method for comparing similarity of document with Multi-Level alignment (다단계정렬을 활용한 효율적인 문서 유사도 비교법)

  • Seo, Jong-Kyu;Hwang, Hae-Lyen;Cho, Hwan-Gue
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.04a
    • /
    • pp.402-405
    • /
    • 2012
  • 문서와 문서간의 유사도들 측정하는 방법 은 크게 지문법 (fingerprint)을 이용한 방법과 서열 정렬(sequence alignment)알고리즘을 이용한 방법이 있다. 두 방법은 각각 속도와 정확도라는 장점을 가지고 있다. 다단계정렬(MLA, Multi-Level alignment))는 이러한 두 방법을 조합하여 탐색 속도와 정확도 사이의 비중을 사용자가 결정할 수 있도록 하기 위한 방법이다.[1] 다단계 정렬은 두 문서를 단위 블록(basis block)로 나누고 블록 간의 벡터를 비교하여 유사도를 측정하게 되는데, 본 연구에서는 초성 추출 및 어간 추출을 통해 단위 블록의 벡터를 빠른 시 간에 생성하고 비교하는 방법과 다단계 탐색을 통해 정확도를 유지하면서 빠르게 유사도를 측정하는 방식에 대해 설명한다. 실험결과 제안 방법을 통해 다단계 정렬 방법을 이용한 대용량 문서 비교의 속도가 2 배 이상 빨라짐을 보인다.

Nonparametric procedures using aligned method and linear placement statistics in randomized block design (랜덤화 블록 계획법에서 정렬방법과 선형위치통계량을 이용한 비모수 검정법)

  • Han, Jinjoo;Kim, Dongjae
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1411-1419
    • /
    • 2016
  • Nonparametric procedures in randomized block design was proposed by Friedman (1937) as a general alternative. This method is used to find out the difference in treatment effect. It can cause a loss of inter block information using the ranking in each block. This paper proposed nonparametric procedures using an aligned method proposed by Hodges and Lehmann (1962) to reduce block information based on joint placement suggest by Jo and Kim (2013) in a randomized block design. We also compared the power of the test of the proposed procedures and established method through a Monte Carlo simulation.

Nonparametric procedures using aligned method and joint placement in randomized block design (랜덤화 블록 계획법에서 정렬방법과 결합 위치를 이용한 비모수 검정법)

  • Jo, Sungdong;Kim, Dongjae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.1
    • /
    • pp.95-103
    • /
    • 2013
  • Nonparametric procedure in randomized block design (RBD) was proposed by Friedman (1937) for general alternatives. Also Page (1963) suggested the test for ordered alternatives in RBD. In this paper, we proposed the new nonparametric method in randomized block design using aligned method suggested by Hodges and Lehmann (1962) and the joint placement described in Chung and Kim (2007). Also, Monte Carlo simulation study was adapted to compare the power of the proposed procedure with those of previous procedure.