• Title/Summary/Keyword: Data Clustering

Search Result 2,747, Processing Time 0.04 seconds

Rapid discrimination system of Chinese cabbage (Brassica rapa) at metabolic level using Fourier transform infrared spectroscopy (FT-IR) based on multivariate analysis (배추 대사체 추출물의 FT-IR 스펙트럼 및 다변량 통계분석을 통한 계통 신속 식별 체계)

  • Ahn, Myung Suk;Lim, Chan Ju;Song, Seung Yeob;Min, Sung Ran;Lee, In Ho;Nou, Ill-Sup;Kim, Suk Weon
    • Journal of Plant Biotechnology
    • /
    • v.43 no.3
    • /
    • pp.383-390
    • /
    • 2016
  • To determine whether FT-IR spectral analysis based on multivariate analysis could be used to discriminate Chinese cabbage breeding line at metabolic level, whole cell extracts of nine different breeding lines (three paternal, three maternal and three $F_1$ lines) were subjected to Fourier transform infrared spectroscopy (FT-IR). FT-IR spectral data of Chinese cabbage plants were analyzed by principal component analysis (PCA), partial least square discriminant analysis (PLS-DA), and hierarchical clustering analysis (HCA). The hierarchical dendrograms based on PLS-DA from two of three cross combinations showed that paternal, maternal, and their progeny $F_1$ lines samples were perfectly separated into three branches in breeding line dependent manner. However, a cross combination failed to fully discriminate them into three branches. Thus, hierarchical dendrograms based on PLS-DA of FT-IR spectral data of Chinese cabbage breeding lines could be used to represent the most probable chemotaxonomical relationship among maternal, paternal, and $F_1$ plants. Furthermore, these metabolic discrimination systems could be applied for rapid selection and classification of useful Chinese cabbage cultivars.

Wide-area Surveillance Applicable Core Techniques on Ship Detection and Tracking Based on HF Radar Platform (광역감시망 적용을 위한 HF 레이더 기반 선박 검출 및 추적 요소 기술)

  • Cho, Chul Jin;Park, Sangwook;Lee, Younglo;Lee, Sangho;Ko, Hanseok
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.2_2
    • /
    • pp.313-326
    • /
    • 2018
  • This paper introduces core techniques on ship detection and tracking based on a compact HF radar platform which is necessary to establish a wide-area surveillance network. Currently, most HF radar sites are primarily optimized for observing sea surface radial velocities and bearings. Therefore, many ship detection systems are vulnerable to error sources such as environmental noise and clutter when they are applied to these practical surface current observation purpose systems. In addition, due to Korea's geographical features, only compact HF radars which generates non-uniform antenna response and has no information on target information are applicable. The ship detection and tracking techniques discussed in this paper considers these practical conditions and were evaluated by real data collected from the Yellow Sea, Korea. The proposed method is composed of two parts. In the first part, ship detection, a constant false alarm rate based detector was applied and was enhanced by a PCA subspace decomposition method which reduces noise. To merge multiple detections originated from a single target due to the Doppler effect during long CPIs, a clustering method was applied. Finally, data association framework eliminates false detections by considering ship maneuvering over time. According to evaluation results, it is claimed that the proposed method produces satisfactory results within certain ranges.

Establishment of rapid discrimination system of leguminous plants at metabolic level using FT-IR spectroscopy with multivariate analysis (FT-IR 스펙트럼 기반 다변량통계분석기법에 의한 두과작물의 대사체 수준 식별체계 확립)

  • Song, Seung-Yeob;Ha, Tae-Joung;Jang, Ki-Chang;Kim, In-Jung;Kim, Suk-Weon
    • Journal of Plant Biotechnology
    • /
    • v.39 no.3
    • /
    • pp.121-126
    • /
    • 2012
  • To determine whether FT-IR spectroscopy combined with multivariate analysis for whole cell extracts can be used to discriminate major leguminous plant at metabolic level, seed extracts of six leguminous plants were subjected to Fourier transform infrared spectroscopy (FT-IR). FT-IR spectral data from seed extracts were analyzed by principal component analysis (PCA), partial least square discriminant analysis (PLS-DA) and hierarchical clustering analysis (HCA). The PCA could not fully discriminate six leguminous plants, however PLS-DA could successfully discriminate six leguminous plants. The hierarchical dendrogram based on PLS-DA separated the six leguminous plants into four branches. The first branch was consisted of all three Vigna species including Vigna radiata var. radiate, Vigna angularis var. angularis and Vigna unguiculata subsp. Unguiculata. Whereas Pisum sativum var. sativum, Glycine max L and Phaseolus vulgaris var. vulgaris were clustered into a separate branch respectively. The overall results showed that metabolic discrimination system were in accordance with known phylogenic taxonomy. Thus we suggested that the hierarchical dendrogram based on PLS-DA of FT-IR spectral data from seed extracts represented the most probable chemotaxonomical relationship between six leguminous plants.

An Object Detection and Tracking System using Fuzzy C-means and CONDENSATION (Fuzzy C-means와 CONDENSATION을 이용한 객체 검출 및 추적 시스템)

  • Kim, Jong-Ho;Kim, Sang-Kyoon;Hang, Goo-Seun;Ahn, Sang-Ho;Kang, Byoung-Doo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.16 no.4
    • /
    • pp.87-98
    • /
    • 2011
  • Detecting a moving object from videos and tracking it are basic and necessary preprocessing steps in many video systems like object recognition, context aware, and intelligent visual surveillance. In this paper, we propose a method that is able to detect a moving object quickly and accurately in a condition that background and light change in a real time. Furthermore, our system detects strongly an object in a condition that the target object is covered with other objects. For effective detection, effective Eigen-space and FCM are combined and employed, and a CONDENSATION algorithm is used to trace a detected object strongly. First, training data collected from a background image are linear-transformed using Principal Component Analysis (PCA). Second, an Eigen-background is organized from selected principal components having excellent discrimination ability on an object and a background. Next, an object is detected with FCM that uses a convolution result of the Eigen-vector of previous steps and the input image. Finally, an object is tracked by using coordinates of an detected object as an input value of condensation algorithm. Images including various moving objects in a same time are collected and used as training data to realize our system that is able to be adapted to change of light and background in a fixed camera. The result of test shows that the proposed method detects an object strongly in a condition having a change of light and a background, and partial movement of an object.

The comparison of coauthor networks of two statistical journals of the Korean Statistical Society using social network analysis (소셜 네트워크분석을 활용한 통계학회 논문집과 응용통계연구 공저자 네트워크 비교)

  • Chun, Heuiju
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.335-346
    • /
    • 2015
  • The purpose of this study is to compare not only network influence of individual coauthor but also the types and properties of two coauthor networks of Communications for Statistical Applications and Methods and the Korean Journal of Applied Statistics which are published by the Korean Statistical Society using social network analysis.As the result of two network structure comparison, density, inclusiveness, reciprocity and clustering coefficient which represent the type of coauthor networks show almost similar values and the Korean Journal of Applied Statistics has bigger values in average degree, average distance and diameter because it has more nodes than Communications for Statistical Applications and Methods. Finally two journals have very similar type of coauthor network. In the comparison of network centrality of two coauthor networks, closeness centrality and betweenness centrality of the Korean Journal of Applied Statistics are bigger than those of Communications for Statistical Applications and Methods at the statistical significance level 0.05. The coauthor network of the Korean Journal of Applied Statistics has faster information delivery and stronger betweenness than that of Communications for Statistical Applications.

Hierarchical Browsing Interface for Geo-Referenced Photo Database (위치 정보를 갖는 사진집합의 계층적 탐색 인터페이스)

  • Lee, Seung-Hoon;Lee, Kang-Hoon
    • Journal of the Korea Computer Graphics Society
    • /
    • v.16 no.4
    • /
    • pp.25-33
    • /
    • 2010
  • With the popularization of digital photography, people are now capturing and storing far more photos than ever before. However, the enormous number of photos often discourages the users to identify desired photos. In this paper, we present a novel method for fast and intuitive browsing through large collections of geo-referenced photographs. Given a set of photos, we construct a hierarchical structure of clusters such that each cluster includes a set of spatially adjacent photos and its sub-clusters divide the photo set disjointly. For each cluster, we pre-compute its convex hull and the corresponding polygon area. At run-time, this pre-computed data allows us to efficiently visualize only a fraction of the clusters that are inside the current view and have easily recognizable sizes with respect to the current zoom level. Each cluster is displayed as a single polygon representing its convex hull instead of every photo location included in the cluster. The users can quickly transfer from clusters to clusters by simply selecting any interesting clusters. Our system automatically pans and zooms the view until the currently selected cluster fits precisely into the view with a moderate size. Our user study demonstrates that these new visualization and interaction techniques can significantly improve the capability of navigating over large collections of geo-referenced photos.

A Study on International Engineering Market Focusing on Engineering/Consulting Delivery System (엔지니어링 해외진출 활성화를 위한 유망국가 분석 - 시장 현황 및 입낙찰 절차를 중심으로 -)

  • Kim, Sang-Bum;Kwak, Hyun-Jun
    • Korean Journal of Construction Engineering and Management
    • /
    • v.14 no.2
    • /
    • pp.171-183
    • /
    • 2013
  • Domestric Construction & Engineering market has been long in recession due to the global economic crisis. Domestic construction industries consequently looks at overseas construction markets where relatively more construction projects are constantly required. In order to provide meaningful information for Korean engineering companies to keep the pace with the changes in the construction industry, various construction related date and statistics are analyzed. In addition, previous research from the related organizations and construction engineering companies are closely reviewed. Investigation of preliminary data and research have been conducted to draw remedies for their overseas expansion. Moreover, it is attempted to classify foreign markets as the Asia, Africa and etc. (Europe, North America/the Pacific and Latin America) to provide the list of first target countries and its regional market information focusing on their bidding system. This study had tried to show comparative analysis of different bidding procedures between Korea and the selected countries to suggest measures of improvements for the domestic bidding system. Finally, this study suggested policy recommendations to meet the requirement of bid qualification to advance the global market that was suggested with validated clustering bidding data.

Development of the Approximate Cost Estimating Model Using Statistical Inference for PSC Box Girder Bridge Constructed by the Incremental Launching Method (통계적 기법을 활용한 ILM압출공법 교량 상부공사 개략공사비 산정모델 개발 연구)

  • Kim, Sang-Bum;Cho, Ji-Hoon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.33 no.2
    • /
    • pp.781-790
    • /
    • 2013
  • This research focuses on development of the conceptual cost estimation models for I.L.M box girder bridge. The current conceptual cost estimation for public construction projects is dependent on governmental average unit price references which has been regarded as inaccurate and unreliable by many experts. Therefore, there have been strong demands for developing a better way of conceptual cost estimating methods. This research has proposed three different conceptual cost estimating method for a P.S.C. girder bridge built with the I.L.M method. Model (I) attempts to seek the proper breakdown of standard works that are accountable for more than 95 percentage in total cost and calculates the amount of standard work's materials from the standard section and volume of I.L.M box girder bridge. Model (II) utilizes a correlation analysis (coefficient over 0.6 or more) between breakdown of standard works and input data that would be considered available information in preliminary design phase. Model(III) obtains conceptual estimating through multiple-regression analysis between the breakdown of standard works and all of input data related to them. In order to validate the clustering of coverage in the preliminary design phase, the variation of I.L.M cost coverage from multiple-regression analysis[model(III)] has been investigated which result in between -3.76% and 11.79%, comparing with AACE(Association for the Advancement of Cost Engineering) which informs its variation between -5% and +15% in the design phase. The model proposed from this research are envisioned to be improved to a great distinct if reliable cost date for P.S.C. girder bridges can be continually collected with reasonable accuracies.

A Dynamic Recommendation System Using User Log Analysis and Document Similarity in Clusters (사용자 로그 분석과 클러스터 내의 문서 유사도를 이용한 동적 추천 시스템)

  • 김진수;김태용;최준혁;임기욱;이정현
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.5
    • /
    • pp.586-594
    • /
    • 2004
  • Because web documents become creation and disappearance rapidly, users require the recommend system that offers users to browse the web document conveniently and correctly. One largely untapped source of knowledge about large data collections is contained in the cumulative experiences of individuals finding useful information in the collection. Recommendation systems attempt to extract such useful information by capturing and mining one or more measures of the usefulness of the data. The existing Information Filtering system has the shortcoming that it must have user's profile. And Collaborative Filtering system has the shortcoming that users have to rate each web document first and in high-quantity, low-quality environments, users may cover only a tiny percentage of documents available. And dynamic recommendation system using the user browsing pattern also provides users with unrelated web documents. This paper classifies these web documents using the similarity between the web documents under the web document type and extracts the user browsing sequential pattern DB using the users' session information based on the web server log file. When user approaches the web document, the proposed Dynamic recommendation system recommends Top N-associated web documents set that has high similarity between current web document and other web documents and recommends set that has sequential specificity using the extracted informations and users' session information.

Construction of a Microsatellite Marker Database of Commercial Pepper Cultivars (유통 중인 고추 품종에 대한 Microsatellite 마커 Data Base 구축)

  • Kwon, Yong-Sham;Hong, Jee-Hwa;Choi, Keun-Jin
    • Horticultural Science & Technology
    • /
    • v.31 no.5
    • /
    • pp.580-589
    • /
    • 2013
  • This study was carried out to evaluate the suitability of microsatellite markers for varietal identification and genetic relationship of 170 commercial pepper cultivars. The relationship between marker genotypes and 11 pepper cultivars with different morphological traits was also analyzed. Of the 302 pairs of microsatellite primers screened against 11 pepper cultivars, 24 pairs were highly polymorphic in terms of number of alleles. These markers were applied for the construction of DNA profile data base for 170 commercial pepper cultivars. A total of 164 polymorphic amplified fragments were obtained from 24 microsatellite primers. The average polymorphism information content was 0.673 ranging from 0.324 to 0.824. One hundred and sixty four microsatellite alleles were used to calculate Jaccard's distance coefficients using unweighted pair group method. A clustering group of varieties, based on the results of microsatellite analysis, were categorized into 3 major groups corresponding to morphological traits. The phenogram discriminated all varieties by markers genotypes. These microsatellite markers will be useful as a tool for protection of plant breeders' intellectual property rights through variety identification in distinctness, uniformity and stability test.