• Title/Summary/Keyword: k-평균군집방법

Search Result 192, Processing Time 0.031 seconds

A Proposed Simple Method for Multisite Point Rainfall Generation (일강우자료의 다지점 모의 발생을 위한 간단한 방법 제안)

  • Yu, Cheol-Sang;Lee, Dong-Ryul
    • Journal of Korea Water Resources Association
    • /
    • v.33 no.1
    • /
    • pp.99-110
    • /
    • 2000
  • In this study we proposed a simple method for generating multi-site daily rainfall based on the 1-order Markov chain and considering the spatial correlation. The occurrence of rainfall is simulated by a simple 1st-order Markov chain and its intensity to be chosen randomly from the observed data. The spatial correlation between sites could be conserved as the rainfall intensity at each site is to be chosen consistently with the target site in time through generation. It is found that the generated daily rainfall data reproduce genera] characteristics of the observed data such as average, standard deviation, average number of wet and dry days, but the clustering level in time is somewhat loosened. Thus, the lag-I correlation coefficient of the generated data gave smaller value than the observed, also the average lengths of wet run and dry run and the wet-to-wet and dry-to-dry probabilities were a bit less than the observed. This drawback seems to be overcome somewhat by choosing a proper site representing overall basin characteristics or by use of more detailed states of rainfall occurrence.

  • PDF

A Similar Price Zone Determination of Public Land Price Using a Hybrid Clustering Technique (평균연결법과 K-means 혼합클러스터링 기법을 이용한 공시지가 유사가격권역의 설정)

  • Yi Seong-Kyu;Park Soo-Hong;Hong Sung-Eon
    • Journal of the Korean Geographical Society
    • /
    • v.41 no.1 s.112
    • /
    • pp.121-135
    • /
    • 2006
  • Even though the similar land price zone is very important element in the public land appraisal procedure, the concept is implicitly described and applied into the actual land appraisal system. This situation makes it worse when applying for the automatic selection of a comparative standard land parcel. In addition, the division of similar land price zones requires the objective and reasonable process for improving ALPAS(Automatic land Price Appraisal System), which becomes an issue today. To solve the similar land price zone determination problem that is caused by the lack of objective numerical standard, this study proposed a similar land price zone determination method using a hybrid clustering technique. Results showed that this hybrid clustering method that applied into the test area could easily detect similar land price zones with considerable accuracy levels, which are verified with some test statistics and real comparative standard land parcels done by manually.

Groundwater-use Estimation Method Based on Field Monitoring Data in South Korea (실측 자료에 기반한 우리나라 지하수의 용도별 이용량 추정 방법)

  • Kim, Ji-Wook;Jun, Hyung-Pil;Lee, Chan-Jin;Kim, Nam-Ju;Kim, Gyoo-Bum
    • The Journal of Engineering Geology
    • /
    • v.23 no.4
    • /
    • pp.467-476
    • /
    • 2013
  • With increasing interest in environmental issues and the quality of surface water becoming inadequate for water supply, the Korean government has launched a groundwater development policy to satisfy the demand for clean water. To drive this policy effectively, it is essential to guarantee the accuracy of sustainable groundwater yield and groundwater use amount. In this study, groundwater use was monitored over several years at various locations in Korea (32 cities/counties in 5 provinces) to obtain accurate groundwater use data. Statistical analysis of the results was performed as a method for estimating rational groundwater use. For the case of groundwater use for living purposes, we classified the cities/counties into three regional types (urban, rural, and urban-rural complex) and divided the groundwater facilities into five types (domestic use, apartment housing, small-scale water supply, schools, and businesses) according to use. For the case of agricultural use, we defined three regional types based on rainfall intensity (average rainfall, below-average rainfall, and above-average rainfall) and the facilities into six types (rice farming, dry-field farming, floriculture, livestock-cows, livestock-pigs, and livestock-chickens). Finally, we developed groundwater-use estimation equations for each region and use type, using cluster analysis and regression model analysis of the monitoring data. The results will enhance the reliability of national groundwater statistics.

Centroid Neural Network with Bhattacharyya Kernel (Bhattacharyya 커널을 적용한 Centroid Neural Network)

  • Lee, Song-Jae;Park, Dong-Chul
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.9C
    • /
    • pp.861-866
    • /
    • 2007
  • A clustering algorithm for Gaussian Probability Distribution Function (GPDF) data called Centroid Neural Network with a Bhattacharyya Kernel (BK-CNN) is proposed in this paper. The proposed BK-CNN is based on the unsupervised competitive Centroid Neural Network (CNN) and employs a kernel method for data projection. The kernel method adopted in the proposed BK-CNN is used to project data from the low dimensional input feature space into higher dimensional feature space so as the nonlinear problems associated with input space can be solved linearly in the feature space. In order to cluster the GPDF data, the Bhattacharyya kernel is used to measure the distance between two probability distributions for data projection. With the incorporation of the kernel method, the proposed BK-CNN is capable of dealing with nonlinear separation boundaries and can successfully allocate more code vector in the region that GPDF data are densely distributed. When applied to GPDF data in an image classification probleml, the experiment results show that the proposed BK-CNN algorithm gives 1.7%-4.3% improvements in average classification accuracy over other conventional algorithm such as k-means, Self-Organizing Map (SOM) and CNN algorithms with a Bhattacharyya distance, classed as Bk-Means, B-SOM, B-CNN algorithms.

Class Imbalance Resolution Method and Classification Algorithm Suggesting Based on Dataset Type Segmentation (데이터셋 유형 분류를 통한 클래스 불균형 해소 방법 및 분류 알고리즘 추천)

  • Kim, Jeonghun;Kwahk, Kee-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.3
    • /
    • pp.23-43
    • /
    • 2022
  • In order to apply AI (Artificial Intelligence) in various industries, interest in algorithm selection is increasing. Algorithm selection is largely determined by the experience of a data scientist. However, in the case of an inexperienced data scientist, an algorithm is selected through meta-learning based on dataset characteristics. However, since the selection process is a black box, it was not possible to know on what basis the existing algorithm recommendation was derived. Accordingly, this study uses k-means cluster analysis to classify types according to data set characteristics, and to explore suitable classification algorithms and methods for resolving class imbalance. As a result of this study, four types were derived, and an appropriate class imbalance resolution method and classification algorithm were recommended according to the data set type.

Analysing the Relationship Between Tree-Ring Growth of Quercus acutissima and Climatic Variables by Dendroclimatological Method (연륜기후학적 방법에 의한 상수리나무의 연륜생장과 기후인자와의 관계분석)

  • Moon, Na Hyun;Sung, Joo Han;Lim, Jong Hwan;Park, Ko Eun;Shin, Man Yong
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.17 no.2
    • /
    • pp.93-101
    • /
    • 2015
  • This study was conducted to analyze the relationship between tree-ring growth of Quercus acutissima and climatic variables by dendroclimatological method. Annual tree-ring growth data of Quercus acutissima collected by the $5^{th}$ National Forest Inventory (NFI5) were organized to analyze the spatial distribution of the species growth pattern. To explain the relationship between tree-ring growth of Quercus acutissima and climatic variables, monthly temperature and precipitation data from 1950 to 2010 were compared with tree-ring growth data for each county. When tree-ring growth data were analyzed through cluster analysis based on similarity of climatic conditions, four clusters were identified. In addition, index chronology of Quercus acutissima for each cluster was produced through cross-dating and standardization procedures. The adequacy of index chronologies was tested using basic statistics such as mean sensitivity, auto correlation, signal to noise ratio, and expressed population signal of annual tree-ring growth. Response function analysis was conducted to reveal the relationship between tree-ring growth and climatic variables for each cluster. The results of this study are expected to provide valuable information necessary for estimating local growth characteristics of Quercus acutissima and for predicting changes in tree growth patterns caused by climate change.

A Study on Method of Classification by Walking Resting and Running Based on Motion of Wrist (손목 움직임 기반 휴식, 걷기, 달리기 분류에 관한 연구)

  • Ha, Jeong-Ho;Kim, Jun-Ho;Choe, Sun-Taag;Cho, We-Duke
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.10a
    • /
    • pp.172-175
    • /
    • 2016
  • 본 논문은 손목에 부착된 단일 3축 가속도 센서를 이용하여 사용자 움직임 기반의 휴식, 걷기, 달리기(느린속도, 빠른속도)를 분류하는 방법에 관한 연구이다. 초당 32회 표본 값의 가속도 정보에서 특징 신호인 평균, 표준편차를 산출하고 사용자의 행동상태를 4가지 상태로 분류한다. 분류 기준이 모호한 상태전이 신호에 대해 6가지 상태로 분류하여 구해진 총 10개의 행동상태 정보를 2차원 평면에 사영하고 최종적으로 K-means 군집화 기법을 적용하여 사용자의 행동상태를 4가지 상태로 분류한다.

A Study on the Optimization of State Tying Acoustic Models using Mixture Gaussian Clustering (혼합 가우시안 군집화를 이용한 상태공유 음향모델 최적화)

  • Ann, Tae-Ock
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.6
    • /
    • pp.167-176
    • /
    • 2005
  • This paper describes how the state tying model based on the decision tree which is one of Acoustic models used for speech recognition optimizes the model by reducing the number of mixture Gaussians of the output probability distribution. The state tying modeling uses a finite set of questions which is possible to include the phonological knowledge and the likelihood based decision criteria. And the recognition rate can be improved by increasing the number of mixture Gaussians of the output probability distribution. In this paper, we'll reduce the number of mixture Gaussians at the highest point of recognition rate by clustering the Gaussians. Bhattacharyya and Euclidean method will be used for the distance measure needed when clustering. And after calculating the mean and variance between the pair of lowest distance, the new Gaussians are created. The parameters for the new Gaussians are derived from the parameters of the Gaussians from which it is born. Experiments have been performed using the STOCKNAME (1,680) databases. And the test results show that the proposed method using Bhattacharyya distance measure maintains their recognition rate at $97.2\%$ and reduces the ratio of the number of mixture Gaussians by $1.0\%$. And the method using Euclidean distance measure shows that it maintains the recognition rate at $96.9\%$ and reduces the ratio of the number of mixture Gaussians by $1.0\%$. Then the methods can optimize the state tying model.

Study on Resources Annexation in Tongyeong Marine Ranching I. Effects of Zooplankton Attraction by Night-lights (통영 바다목장 자원조성을 위한 연구 I. 야간점등에 의한 동물플랑크톤 유도효과)

  • Yoon, Ho-Seop;Choi, Sang-Duk
    • Korean Journal of Environmental Biology
    • /
    • v.24 no.2 s.62
    • /
    • pp.126-137
    • /
    • 2006
  • Effect of night-lights on zooplankton attraction has been studied in Tongyeong marine ranch area during the period from 12 July to 30 August, 2004. Each sampling has been carried out to collect zooplankton from both control area in natural waters experiment area of night-lights waters at night. A total of 43 taxa of zooplankton occurred during the study. Copepods showed the prosperity in species number with 15 species. Acartia erythraed and Copepodite occurred abundantly in night-lights waters. Zooplankton abundance appeared to increase in night-lights mainly due to the gathering of copepods and larvae through the study period. Average $3\sim166$ times of zooplankton abundance was recorded in night-lights when compared with that in control area of natural waters due to the gathering of copepods and larvae. Cluster analysis, based on monthly abundance data of the 13 most frequent species, showed that the species were seperated into two different groups: the photo-positive group and the photo-negative group.

Seasonal Variations of Epilithic Biofilm Biomass and Community Structure at Byeonsan Peninsula, Korea (한국 변산반도 암반생물막의 생물량과 군집구조의 계절 변화)

  • Kim, Bo Yeon;Park, Seo Kyoung;Lee, Jung Rok;Choi, Han Gil
    • Korean Journal of Environment and Ecology
    • /
    • v.30 no.6
    • /
    • pp.1009-1021
    • /
    • 2016
  • The community structure and abundance of epilithic biofilm were bimonthly examined to know spatial and temporal patterns of biofilm biomass and taxonimical composition at the two study sites, Gosapo and Gyeokpo with different degrees of wave exposure levels from November 2010 to September 2011. Biomass was estimated by using chlorophyll a contents (Chl a), normalized difference vegetation index (NDVI), and vegetation index (VI). Cyanobacteria such as Aphanotece spp. predominated in the proportion of 57.53% at Gosapo and of 61.12% at Gyeokpo and they are abundant in mid shore and in summer at both study sites. The diatoms Navicula spp., Achnanthes spp. and Licmophora spp. were common species and they showed an increasing trend from high to low shore. NDVI, VI, and chl a contents were the greatest at mid shore for Gosapo (0.44, 3.05, $24.56{\mu}g/cm^2$) and at low shore for Gyeokpo (0.41, 2.73, $17.98{\mu}g/cm^2$). NDVI, VI, and chl a content were all maximal in January and minimal in March at the both sites. Average NDVI, VI, and chlorophyll a contents of biofilms were greater at Gosapo (0.43, 2.89, $22.84{\mu}g/cm^2$) than Gyeokpo (0.38, 2.48, $15.48{\mu}g/cm^2$).Of three shore levels(high, mid, and low) Chl a contents were positively correlated with NDVI and VI at the two study sites indicating that non-destructive NDVI and VI values can be used in stead of destructive Chl a extraction method. In conclusion, epilithic biofilm was more abundant seasonally in winter, vertically in mid and low intertidal zone, and horizontally at wave exposed shore than in summer, at high and sheltered shore in Korea.