• Title/Summary/Keyword: spectral clustering

Search Result 90, Processing Time 0.028 seconds

Enhancing Classification Performance by Separating Spectral Signature of Training Data Set (교사 자료의 분광 특징 분리에 의한 감독 분류 성능 향상)

  • 김광은
    • Korean Journal of Remote Sensing
    • /
    • v.18 no.6
    • /
    • pp.369-376
    • /
    • 2002
  • This paper presents a method to enhance the performance of supervised classification by separating the spectral signature of the training data sets for each class. Using clustering technique, a training data set is divided into several subsets which show a pattern of the normal distribution with small value of spectral variances. Then a supervised classification is applied with the divided training data set as training data for the temporary subclasses of the original class. The proposed method is applied to a Landsat TM image of Busan area for the applicability test. The result shows that the proposed method produces better classified results than the conventional statistical classification methods. It is expected that the proposed method will reduce the effort and expense for selecting the training data set for each class in an area which has spectrally homogeneous signature.

A Study of the Classification and Application of Digital Broadcast Program Type based on Machine Learning (머신러닝 기반의 디지털 방송 프로그램 유형 분류 및 활용 방안 연구)

  • Yoon, Sang-Hyeak;Lee, So-Hyun;Kim, Hee-Woong
    • Knowledge Management Research
    • /
    • v.20 no.3
    • /
    • pp.119-137
    • /
    • 2019
  • With the recent spread of digital content, more people have been watching the digital content of TV programs on their PCs or mobile devices, rather than on TVs. With the change in such media use pattern, genres(types) of broadcast programs change in the flow of the times and viewers' trends. The programs that were broadcast on TVs have been released in digital content, and thereby people watching such content change their perception. For this reason, it is necessary to newly and differently classify genres(types) of broadcast programs on the basis of digital content, from the conventional classification of program genres(types) in broadcasting companies or relevant industries. Therefore, this study suggests a plan for newly classifying broadcast programs through using machine learning with the log data of people watching the programs in online media and for applying the new classification. This study is academically meaningful in the point that it analyzes and classifies program types on the basis of digital content. In addition, it is meaningful in the point that it makes use of the program classification algorithm developed in relevant industries, and especially suggests the strategy and plan for applying it.

Hierarchical Clustering Approach of Multisensor Data Fusion: Application of SAR and SPOT-7 Data on Korean Peninsula

  • Lee, Sang-Hoon;Hong, Hyun-Gi
    • Proceedings of the KSRS Conference
    • /
    • 2002.10a
    • /
    • pp.65-65
    • /
    • 2002
  • In remote sensing, images are acquired over the same area by sensors of different spectral ranges (from the visible to the microwave) and/or with different number, position, and width of spectral bands. These images are generally partially redundant, as they represent the same scene, and partially complementary. For many applications of image classification, the information provided by a single sensor is often incomplete or imprecise resulting in misclassification. Fusion with redundant data can draw more consistent inferences for the interpretation of the scene, and can then improve classification accuracy. The common approach to the classification of multisensor data as a data fusion scheme at pixel level is to concatenate the data into one vector as if they were measurements from a single sensor. The multiband data acquired by a single multispectral sensor or by two or more different sensors are not completely independent, and a certain degree of informative overlap may exist between the observation spaces of the different bands. This dependence may make the data less informative and should be properly modeled in the analysis so that its effect can be eliminated. For modeling and eliminating the effect of such dependence, this study employs a strategy using self and conditional information variation measures. The self information variation reflects the self certainty of the individual bands, while the conditional information variation reflects the degree of dependence of the different bands. One data set might be very less reliable than others in the analysis and even exacerbate the classification results. The unreliable data set should be excluded in the analysis. To account for this, the self information variation is utilized to measure the degrees of reliability. The team of positively dependent bands can gather more information jointly than the team of independent ones. But, when bands are negatively dependent, the combined analysis of these bands may give worse information. Using the conditional information variation measure, the multiband data are split into two or more subsets according the dependence between the bands. Each subsets are classified separately, and a data fusion scheme at decision level is applied to integrate the individual classification results. In this study. a two-level algorithm using hierarchical clustering procedure is used for unsupervised image classification. Hierarchical clustering algorithm is based on similarity measures between all pairs of candidates being considered for merging. In the first level, the image is partitioned as any number of regions which are sets of spatially contiguous pixels so that no union of adjacent regions is statistically uniform. The regions resulted from the low level are clustered into a parsimonious number of groups according to their statistical characteristics. The algorithm has been applied to satellite multispectral data and airbone SAR data.

  • PDF

Rapid discrimination of commercial strawberry cultivars using Fourier transform infrared spectroscopy data combined by multivariate analysis

  • Kim, Suk Weon;Min, Sung Ran;Kim, Jonghyun;Park, Sang Kyu;Kim, Tae Il;Liu, Jang R.
    • Plant Biotechnology Reports
    • /
    • v.3 no.1
    • /
    • pp.87-93
    • /
    • 2009
  • To determine whether pattern recognition based on metabolite fingerprinting for whole cell extracts can be used to discriminate cultivars metabolically, leaves and fruits of five commercial strawberry cultivars were subjected to Fourier transform infrared (FT-IR) spectroscopy. FT-IR spectral data from leaves were analyzed by principal component analysis (PCA) and Fisher's linear discriminant function analysis. The dendrogram based on hierarchical clustering analysis of these spectral data separated the five commercial cultivars into two major groups with originality. The first group consisted of Korean cultivars including 'Maehyang', 'Seolhyang', and 'Gumhyang', whereas in the second group, 'Ryukbo' clustered with 'Janghee', both Japanese cultivars. The results from analysis of fruits were the same as of leaves. We therefore conclude that the hierarchical dendrogram based on PCA of FT-IR data from leaves represents the most probable chemotaxonomical relationship between cultivars, enabling discrimination of cultivars in a rapid and simple manner.

MR Brain Image Segmentation Using Clustering Technique

  • Yoon, Ock-Kyung;Kim, Dong-Whee;Kim, Hyun-Soon;Park, Kil-Houm
    • Proceedings of the IEEK Conference
    • /
    • 2000.07a
    • /
    • pp.450-453
    • /
    • 2000
  • In this paper, an automated segmentation algorithm is proposed for MR brain images using T1-weighted, T2-weighted, and PD images complementarily. The proposed segmentation algorithm is composed of 3 steps. In the first step, cerebrum images are extracted by putting a cerebrum mask upon the three input images. In the second step, outstanding clusters that represent inner tissues of the cerebrum are chosen among 3-dimensional (3D) clusters. 3D clusters are determined by intersecting densely distributed parts of 2D histogram in the 3D space formed with three optimal scale images. Optimal scale image best describes the shape of densely distributed parts of pixels in 2D histogram. In the final step, cerebrum images are segmented using FCM algorithm with it’s initial centroid value as the outstanding cluster’s centroid value. The proposed segmentation algorithm complements the defect of FCM algorithm, being influenced upon initial centroid, by calculating cluster’s centroid accurately And also can get better segmentation results from the proposed segmentation algorithm with multi spectral analysis than the results of single spectral analysis.

  • PDF

Traffic Speed Prediction Based on Graph Neural Networks for Intelligent Transportation System (지능형 교통 시스템을 위한 Graph Neural Networks 기반 교통 속도 예측)

  • Kim, Sunghoon;Park, Jonghyuk;Choi, Yerim
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.1
    • /
    • pp.70-85
    • /
    • 2021
  • Deep learning methodology, which has been actively studied in recent years, has improved the performance of artificial intelligence. Accordingly, systems utilizing deep learning have been proposed in various industries. In traffic systems, spatio-temporal graph modeling using GNN was found to be effective in predicting traffic speed. Still, it has a disadvantage that the model is trained inefficiently due to the memory bottleneck. Therefore, in this study, the road network is clustered through the graph clustering algorithm to reduce memory bottlenecks and simultaneously achieve superior performance. In order to verify the proposed method, the similarity of road speed distribution was measured using Jensen-Shannon divergence based on the analysis result of Incheon UTIC data. Then, the road network was clustered by spectrum clustering based on the measured similarity. As a result of the experiments, it was found that when the road network was divided into seven networks, the memory bottleneck was alleviated while recording the best performance compared to the baselines with MAE of 5.52km/h.

Discrimination of cultivation ages and cultivars of ginseng leaves using Fourier transform infrared spectroscopy combined with multivariate analysis

  • Kwon, Yong-Kook;Ahn, Myung Suk;Park, Jong Suk;Liu, Jang Ryol;In, Dong Su;Min, Byung Whan;Kim, Suk Weon
    • Journal of Ginseng Research
    • /
    • v.38 no.1
    • /
    • pp.52-58
    • /
    • 2014
  • To determine whether Fourier transform (FT)-IR spectral analysis combined with multivariate analysis of whole-cell extracts from ginseng leaves can be applied as a high-throughput discrimination system of cultivation ages and cultivars, a total of total 480 leaf samples belonging to 12 categories corresponding to four different cultivars (Yunpung, Kumpung, Chunpung, and an open-pollinated variety) and three different cultivation ages (1 yr, 2 yr, and 3 yr) were subjected to FT-IR. The spectral data were analyzed by principal component analysis and partial least squares-discriminant analysis. A dendrogram based on hierarchical clustering analysis of the FT-IR spectral data on ginseng leaves showed that leaf samples were initially segregated into three groups in a cultivation age-dependent manner. Then, within the same cultivation age group, leaf samples were clustered into four subgroups in a cultivar-dependent manner. The overall prediction accuracy for discrimination of cultivars and cultivation ages was 94.8% in a cross-validation test. These results clearly show that the FT-IR spectra combined with multivariate analysis from ginseng leaves can be applied as an alternative tool for discriminating of ginseng cultivars and cultivation ages. Therefore, we suggest that this result could be used as a rapid and reliable F1 hybrid seed-screening tool for accelerating the conventional breeding of ginseng.

Metabolic Discrimination of Papaya (Carica papaya L.) Leaves Depending on Growth Temperature Using Multivariate Analysis of FT-IR Spectroscopy Data (FT-IR 스펙트럼 다변량통계분석을 이용한 파파야(Carica papaya L.)의 생육온도 변화에 따른 대사체 수준 식별)

  • Jung, Young Bin;Kim, Chun Hwan;Lim, Chan Kyu;Kim, Sung Chel;Song, Kwan Jeong;Song, Seung Yeob
    • Journal of the Korean Society of International Agriculture
    • /
    • v.31 no.4
    • /
    • pp.378-383
    • /
    • 2019
  • To determine whether FT-IR spectral analysis based on multivariate analysis for whole cell extracts can be used to discriminate papaya at metabolic level. FT-IR spectral data from leaves were analyzed by principal component analysis (PCA), partial least square discriminant analysis (PLS-DA) and hierarchical clustering analysis (HCA). FT-IR spectra confirmed typical spectral differences between the frequency regions of 1,700-1,500, 1,500-1,300 and 1,100-950 cm-1, respectively. These spectral regions were reflecting the quantitative and qualitative variations of amide I, II from amino acids and proteins (1,700-1,500 cm-1), phosphodiester groups from nucleic acid and phospholipid (1,500-1,300 cm-1) and carbohydrate compounds (1,100-950 cm-1). The result of PCA analysis showed that papaya leaves could be separated into clusters depending on different growth temperature. In this case, showed discrimination confirmed according to metabolite content of growth condition from papaya. And PLS-DA analysis also showed more clear discrimination pattern than PCA result. Furthermore, these metabolic discrimination systems could be applied for rapid selection and classification of useful papaya cultivars.

IMAGING THE RADIO HALO IN THE ABELL 2256 CLUSTER OF GALAXIES

  • KIM K.-T.
    • Journal of The Korean Astronomical Society
    • /
    • v.32 no.2
    • /
    • pp.75-82
    • /
    • 1999
  • Diffuse radio emission in Abell 2256 was detected above 3 $\sigma$ with DRAO observations at 1420 MHz. The halo size is $\~13' {\times}10' (\~1h^{-1}_{50}\;Mpc$) in full extent and is elongated along a position angle of about $112^{\circ}$. The total flux density contained in the halo is 30$\pm$10 mJy at 1420 MHz and its spectral index is -2.04$\pm$0.04, showing no evidence for steepening up to 1420 MHz. Using the size estimate, yields a more reliable equipartition magnetic field strength which is $0.34(1 + k)^{2/7}{\mu}G$. In addition, five new radio sources are identified.

  • PDF

Object Detection from High Resolution Satellite Image by Using Genetic Algorithms

  • Kim Kwang-Eun
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.120-122
    • /
    • 2005
  • With the commercial availability of very high resolution satellite imagery, the concealment of national confidential targets such as military facilities became one of the most bothering task to the image distributors. This task has been carried out by handwork masking of the target objects. Therefore, the quality of the concealment was fully depends on the ability and skill of a worker. In this study, a spectral clustering based technique for the seamless concealment of confidential targets in high resolution imagery was developed. The applicability test shows that the proposed technique can be used as a practical procedure for those who need to hide some information in image before public distribution

  • PDF