• 제목/요약/키워드: Automatic clustering

검색결과 242건 처리시간 0.022초

자연어를 이용한 자동정보검색시스템 구축에 관한 연구 (A Study of Designing the Automatic Information Retrieval System based on Natural Language)

  • 서휘
    • 한국문헌정보학회지
    • /
    • 제35권4호
    • /
    • pp.141-160
    • /
    • 2001
  • 본 연구에서는 자연어를 이용하여 자동으로 정보검색을 수행하는 시스템을 구축하였다. 구현 시스템은 Delphi 4.0(PASCAL)으로 프로그래밍 하였으며, 자동색인, 클러스터링 기법, 자연어 계층관계의 구축과 표현, 자동정보탐색이 가능하도록 구성했다. 이 시스템을 이용하여 질의어의 표현, 생성, 확장, 탐색식의 구성, 피드백 탐색 등 정보탐색의 전과정을 자동으로 수행할 수 있었다.

  • PDF

자동정보검색을 위한 한글 시소러스 브라우저 구축에 관한 연구 (A Study of Designing the Han-Guel Thesaurus Browser for Automatic Information Retrieval)

  • 서휘
    • 한국도서관정보학회지
    • /
    • 제31권2호
    • /
    • pp.279-302
    • /
    • 2000
  • 본 연구는 질의어의 표현, 새성, 확장, 탐색식의 구성, 피드백 탐색 등 정보 탐색의 전과정을 지동으로 수행할 수 있는 한글 시소러스 브라우저 기반 자동정보검색 시스템을 구현하기 위해 시도되었다. 구현 시스템은 Delphi 4.0(PASCAL)으로 프로그래밍 되었으며, 자동색인, 클러스터링 기법, 시소러스의 구축과 표현, 자동정보겸색이 가능하도록 구성되었다. 구현된 시스템의 평가결과는 새로운 알고리즘에 의해 구축된 시소러스 브라우저가 정보검색에 있어서 시소러스의 구축의 용이성, 이용의 편리성, 검색 속도, 검색의 적합성 수준에서 우수힘을 입증하고 있다.

  • PDF

Nucleus Recognition of Uterine Cervical Pap-Smears using FCM Clustering Algorithm

  • Kim, Kwang-Baek
    • Journal of information and communication convergence engineering
    • /
    • 제6권1호
    • /
    • pp.94-99
    • /
    • 2008
  • Segmentation for the region of nucleus in the image of uterine cervical cytodiagnosis is known as the most difficult and important part in the automatic cervical cancer recognition system. In this paper, the region of nucleus is extracted from an image of uterine cervical cytodiagnosis using the HSI model. The characteristics of the nucleus are extracted from the analysis of morphemetric features, densitometric features, colormetric features, and textural features based on the detected region of nucleus area. The classification criterion of a nucleus is defined according to the standard categories of the Bethesda system. The fuzzy C-means clustering algorithm is employed to the extracted nucleus and the results show that the proposed method is efficient in nucleus recognition and uterine cervical Pap-Smears extraction.

Automatic Extraction of Blood Flow Area in Brachial Artery for Suspicious Hypertension Patients from Color Doppler Sonography with Fuzzy C-Means Clustering

  • Kim, Kwang Baek;Song, Doo Heon;Yun, Sang-Seok
    • Journal of information and communication convergence engineering
    • /
    • 제16권4호
    • /
    • pp.258-263
    • /
    • 2018
  • Color Doppler sonography is a useful tool for examining blood flow and related indices. However, it should be done by well-trained operator, that is, operator subjectivity exists. In this paper, we propose an automatic blood flow area extraction method from brachial artery that would be an essential building block of computer aided color Doppler analyzer. Specifically, our concern is to examine hypertension suspicious (prehypertension) patients who might develop their symptoms to established hypertension in the future. The proposed method uses fuzzy C-means clustering as quantization engine with careful seeding of the number of clusters from histogram analysis. The experiment verifies that the proposed method is feasible in that the successful extraction rates are 96% (successful in 48 out of 50 test cases) and demonstrated better performance than K-means based method in specificity and sensitivity analysis but the proposed method should be further refined as the retrospective analysis pointed out.

클러스터링을 이용한 시소러스 브라우저의 설계에 대한 이론적 연구 (A Theoretical Study of Designing Thesaurus Browser by Clustering Algorithm)

  • Seo, Hwi
    • 한국도서관정보학회지
    • /
    • 제30권3호
    • /
    • pp.427-456
    • /
    • 1999
  • This paper deals with the problems of information retrieval through full-test database which arise from both the deficiency of searching strategies or methods by information searcher and the difficulties of query representation, generation, extension, etc. In oder to solve these problems, we should use automatic retrieval instead of manual retrieval in the past. One of the ways to make the gap narrow between the terms by the writers and query by the searchers is that the query should be searched with the terms which the writers use. Thus, the preconditions which should be taken one accorded way to solve the problems are that all areas of information retrieval such as should taken one accorded way to solve the problems are that all areas of information retrieval such as contents analysis, information structure, query formation, query evaluation, etc. should be solved as a coherence way. We need to deal all the ares of automatic information retrieval for the efficiency of retrieval thought this paper is trying to solve the design of thesaurus browser. Thus, this paper shows the theoretical analyses about the form of information retrieval, automatic indexing, clustering technique, establishing and expressing thesaurus, and information retrieval technique. As the result of analyzing them, this paper shows us theoretical model, that is to say, the thesaurus browser by clustering algorithm. The result in the paper will be a theoretical basis on new retrieval algorithm.

  • PDF

Repeated Clustering to Improve the Discrimination of Typical Daily Load Profile

  • Kim, Young-Il;Ko, Jong-Min;Song, Jae-Ju;Choi, Hoon
    • Journal of Electrical Engineering and Technology
    • /
    • 제7권3호
    • /
    • pp.281-287
    • /
    • 2012
  • The customer load profile clustering method is used to make the TDLP (Typical Daily Load Profile) to estimate the quarter hourly load profile of non-AMR (Automatic Meter Reading) customers. This study examines how the repeated clustering method improves the ability to discriminate among the TDLPs of each cluster. The k-means algorithm is a well-known clustering technology in data mining. Repeated clustering groups the cluster into sub-clusters with the k-means algorithm and chooses the sub-cluster that has the maximum average error and repeats clustering until the final cluster count is satisfied.

시소러스 브라우저 자동구현을 위한 Metadata를 이용한 색인어 처리방안에 대한 연구 (A Theoretical Study on Indexing Methods using the Metadata for the Automatic Construction of a Thesaurus Browser)

  • 서휘
    • 한국도서관정보학회지
    • /
    • 제35권4호
    • /
    • pp.451-467
    • /
    • 2004
  • 본 연구에서는 시소러스 브라우저를 자동으로 구성하기 위한 방법에 대한 이론적인 연구와 함께 시소러스 브라우저 구성과정의 핵심인 자동색인과 용어 간 계층을 자동으로 형성하는 클러스터링 알고리즘에 대한 선행 연구결과를 제시하였다. 그리고 웹 문헌에서 전통적인 종이 형태 문헌의 서지사항에 해당하는 메타데이터를 분석하고 이를 처리하는 방안을 조사함에 의해 웹 문헌에서 색인어를 자동으로 추출할 수 있는 방안에 대하여 연구하였다. 또한 대부분의 웹 문헌에 메타데이터가 수록되어 있지 않음에 착안하여 기존의 웹 문헌에 메타데이터 자동 편집기를 이용하여 메타데이터를 수록하는 방안에 대한 연구결과를 제시하였다.

  • PDF

Unconstrained Object Segmentation Using GrabCut Based on Automatic Generation of Initial Boundary

  • Na, In-Seop;Oh, Kang-Han;Kim, Soo-Hyung
    • International Journal of Contents
    • /
    • 제9권1호
    • /
    • pp.6-10
    • /
    • 2013
  • Foreground estimation in object segmentation has been an important issue for last few decades. In this paper we propose a GrabCut based automatic foreground estimation method using block clustering. GrabCut is one of popular algorithms for image segmentation in 2D image. However GrabCut is semi-automatic algorithm. So it requires the user input a rough boundary for foreground and background. Typically, the user draws a rectangle around the object of interest manually. The goal of proposed method is to generate an initial rectangle automatically. In order to create initial rectangle, we use Gabor filter and Saliency map and then we use 4 features (amount of area, variance, amount of class with boundary area, amount of class with saliency map) to categorize foreground and background. From the experimental results, our proposed algorithm can achieve satisfactory accuracy in object segmentation without any prior information by the user.

자동검침 고객의 부하패턴을 이용한 일일 대표 부하패턴 생성 (Typical Daily Load Profile Generation using Load Profile of Automatic Meter Reading Customer)

  • 김영일;신진호;이봉재;양일권
    • 전기학회논문지
    • /
    • 제57권9호
    • /
    • pp.1516-1521
    • /
    • 2008
  • Recently, distribution load analysis using AMR (Automatic Meter Reading) data is researched in electric utilities. Load analysis method based on AMR system generates the typical load profile using load data of AMR customers, estimates the load profile of non-AMR customers, and analyzes the peak load and load profile of the distribution circuits and sectors per every 15 minutes/hour/day/week/month. Typical load profile is generated by the algorithm calculating the average amount of power consumption of each groups having similar load patterns. Traditional customer clustering mechanism uses only contract type code as a key. This mechanism has low accuracy because many customers having same contract code have different load patterns. In this research, We propose a customer clustring mechanism using k-means algorithm with contract type code and AMR data.

확장된 Fuzzy Clustering 알고리즘을 이용한 자동 목표물 검출 (Automatic Target Detection Using the Extended Fuzzy Clustering)

  • 김수환;강경진;이태원
    • 전자공학회논문지B
    • /
    • 제28B권10호
    • /
    • pp.842-913
    • /
    • 1991
  • The automatic target detection which automatically identifies the location of the target with its input image is one of the significant subjects of image processing field. Then, there are some problems that should be solved to detect the target automatically from the input image. First of all, the ambiguity of the boundary between targets or between a target and background should be solved and the target should be searched adaptively. In other words, the target should be identified by the relative brightness to the background, not by the absolute brightness. In this paper, to solve these problems, a new algorithm which can identify the target automatically is proposed. This algorithm uses the set of fuzzy for solving the ambiguity between the boundaries, and using the weight according to the brightness of data in the input image, the target is identified adaptively by the relative brightness to the background. Applying this algorithm to real images, it is experimentally proved that it is can be effectively applied to the automatic target detection.

  • PDF