• Title/Summary/Keyword: 군집 자료

Search Result 1,192, Processing Time 0.036 seconds

Improving Forecasts of Dam Inflow Using Rescaling Errors From ANN and Regression Model (ANN과 회귀모형의 오차 수정을 통한 댐 유입량 예측 향상)

  • Jang, Sun-Woo;Yoo, Ji-Young;Kim, Tae-Woong
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2010.05a
    • /
    • pp.1164-1168
    • /
    • 2010
  • 수자원이 우리 생활의 전반적으로 중요한 역할을 차지하면서 댐의 효율적인 운영과 안정적인 용수공급에 대한 연구는 지속적으로 수행되어지고 있다. 1990년대 이후 비선형적인 특성을 잘 모의하는 장점을 가진 인공신경망(ANN)을 이용하여 유입량 예측에 대한 많은 연구가 수행되었다. 하지만 ANN 모형을 포함한 회귀모형은 월 강우 및 유입량의 예측에 대해 간편하게 사용을 할 수 있지만, 예측의 정확성에 한계를 가지고 있다. 본 연구에서는 ANN 모형과 회귀모형의 예측오차를 후처리 과정을 통하여 오차를 줄임으로써 예측모형의 성과를 향상시키는 방법을 제안하였다. 연구지역은 금강수계의 대청댐 유역으로, 1982년 9월부터 2005년 12월에 해당하는 유역 내 11개 지점의 강우관측소에서 관측한 월 강우와 댐 유입량을 수집하여 모형을 구축하였다. 강우량과 유입량 자료에 대해 자기상관함수와 교차상관함수를 이용하여 입력변수를 결정하였고, 정규화를 통한 전처리 과정을 거쳐 ANN 모형과 회귀모형을 이용한 예측모형을 구축하였으며, 예측성과의 향상을 위하여 군집 분석을 이용하여 오차를 재조정하였다. 이러한 오차 후처리 과정을 포함한 모형은 RMSE와 상관계수를 이용하여 비교 평가한 결과, 예측성과를 약 40% 정도 향상시켰다.

  • PDF

Regionalization using cluster probability model and copula based drought frequency analysis (클러스터 확률 모형에 의한 지역화와 코풀라에 의한 가뭄빈도분석)

  • Azam, Muhammad;Choi, Hyun Su;Kim, Hyeong San;Hwang, Ju Ha;Maeng, Seungjin
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.46-46
    • /
    • 2017
  • 지역가뭄빈도분석의 분위산정에 대한 신뢰성은 수문학적으로 균일한 지역으로 구분하기 위해 사용된 장기간의 과거 자료와 분석절차에 의해 결정된다. 그러나 극심한 가뭄은 매우 드물게 발생하며 신뢰 할 수 있는 지역빈도분석을 위한 지속기간이 충분치 않는 경우가 많이 발생한다. 이 외에도 우리나라의 복잡한 지형적 및 기후적 특징은 동질한 지역으로 구분하기 위한 통계적인 처리방법이 필요하였다. 본 연구에서 적용한 지역빈도분석은 여러 지역의 다양한 변수인 수문기상 특성을 분석하여 동질한 지역을 확인하고, 주요 가뭄변수(지속 시간 및 심각도)를 통합 적용하여 각각의 동질한 지역 분위를 추정함으로써 동질한 지역을 구분하는 해결책을 제시하였다. 본 연구에서는 가우시안 혼합 모형(Gaussian Mixture Model)을 기반으로 기반 군집분석 방법을 적용하여 최적의 동질한 지역을 구분하고 그 결과를 우도비검정 및 다른 유효성 검사 지수를 이용해서 확인하였다. 가우시안 혼합 모델에서 산정했던 매개변수를 방향저감 공간으로 표현하기 위해서 가우시안 혼합 모델방향 저감(GMMDR)방법을 적용하였다. 이 변수는 가뭄빈도분석을 위해 다양한 분포와 코풀라(copula) 적합도를 이용하여 추정 비교하였다. 그 결과 우리나라를 4개의 동질한 지역으로 나누게 되었다. 가우시안과 Frank copula를 이용한 Pearson type III(PE3) 분포는 우리나라의 가뭄 기간과 심각도의 공동 분포를 추정하는데 적합한 것으로 나타났다.

  • PDF

Analysis on the Difference of Tourist Satisfaction and Intention of Revisiting and Recommendation to the Clusters of the FIT's Life Style - Focused on Chinese Tourists into Visiting Korea (개별여행객의 라이프스타일 유형에 따른 관광만족도, 재방문의도 및 추천의도의 차이 연구 - 방한 중국인 관광객을 중심으로)

  • Ruiyao Li;Tae Hee Lee
    • Korean Journal of Culture and Social Issue
    • /
    • v.21 no.3
    • /
    • pp.535-550
    • /
    • 2015
  • This study looks into different life styles of Chinese FITs and how their life styles might cause differences in tourist satisfation, intention of revisiting and recommendation. Based on the analyses of the life styles of Chinese FITs who visited Korea, the final result concluded 27 questions and 5 causes. In total, 6 hypotheses were accepted through confidence analysis, correlational analysis of variables, life style cluster analysis and analysis of differences between life style clusters. So the study expects to make a positive contribution, providing useful information to create strategies to intrigue more Chinese FITs and develop better tourist product.

  • PDF

Estimation of Short-Duration Rainfall Quantile Intensity-Duration-Frequency curve using down-scaling in North Korea (하향 스케일링을 이용한 북한 지역의 단기 IDF곡선 추정)

  • Jung, Younghun;Joo, Kyungwon;Kim, Sunghun;Heo, Jun-Haeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.249-249
    • /
    • 2020
  • 수공구조물을 설계하기 위해서는 다양한 지속기간에 대한 설계수문량을 추정해야 한다. 국내의 경우도 기후변화로 인한 이상기후의 발생으로 1~2시간 동안 강우강도가 큰 집중호우가 발생하여 도시홍수를 발생시키며 직접 또는 간접적으로 피해를 주고 있다. 특히 북한 지역은 강우관측소가 존재하지 않은 지역이 많아 수공구조물 설계에 필요한 설계수문량을 추정하기에 많은 어려움이 있다. 본 연구에서는 하향스케일링(down-scaling)을 이용하여 북한 지역의 24시간 이내의 확률강우량을 추정하고자 한다. 이를 위하여 미계측 유역인 화천댐 상류유역의 지역빈도해석과 군집분석을 수행하여 수문학적 동질성을 확보하였고, 한강유역을 4개의 동질지역으로 구분하였다. 스케일 성질은 동일한 분포형을 가정하므로 수문학적 동질성이 확보된 기준 지속기간의 자료로부터 임의이 지속기간에 대한 확률강우량 추정이 가능하다. 따라서 북한지역의 짧은 지속기간에 대한 확률 강우량 추정을 위하여 동일한 지역 내의 지역 스케일 지수와 스케일 인자를 이용하여 하향스케일링을 적용할 수 있으며, 단기 혹은 장기에 해당하는 지속기간에 대한 확률강우량을 추정할 수 있다.

  • PDF

Interpretation of Soil Catena for Agricultural Soils derived from Sedimentary Rocks (퇴적암 유래 농경지 토양에 대한 카테나 해석)

  • SONN, Yeon-Kyu;LEE, Dong-Sung;KIM, Keun-Tae;HYUN, Byung-Keun;JUN, Hye-Weon;JEON, Sang-Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.4
    • /
    • pp.1-14
    • /
    • 2017
  • In Korea, the soil series derived from sedimentary rocks are classified into seven soil series of coarse loamy soil such as Dain, Danbug, Dongam, Imdong, Jeomgog, Maryeong, and Yonggog; seventeen soil series of fine loamy soil such as Angye, Anmi, Banho, Bigog, Deoggog, Dogye, Dojeon, Gamgog, Gugog, Jincheon, Maji, Mungyeong, Oggye, Samam, Yanggog, Yeongwol, and Yulgog; six soil series of fine silty soil such as Goryeong, Bonggog, Juggog, Gyeongsan, Yuga, and Yugog; and four soil series of clayey soil such as Mitan, Pyeongan, Pyeongjeon, and Uji. All thirty-four soil series have different drainage rates and topography. However, the soil texture depends on the parent rock. The buffer functions in GIS (Geographic Information System) techniques were used to calculate adjacent soil series from a soil series. The length of the adjacent soil series was adjusted because a side of the buffer area was one meter long. The cluster analysis was conducted using the CCC (Cubic Clustering Criterion) method, in which the number of clusters is calculated based on the individual soil series ratio. Soil survey has been carried out since 1964 as "The reconnaissance soil survey", and 1:5,000 detailed soil survey was completed in 1999 with a five-years plan in Korea. Today, all the soil survey information has been computerized. GIS techniques were used to establish a digital soil map; however, there have not been any studies to interpret pedogenesis using the GIS technique. In this study, the area of the adjacent soil series were obtained using the GIS technique. The area of the adjacent soil series can be calculated based on the information area. The similarities of soil originated from sedimentary rocks were estimated using the length. As a result, the distribution of grain size was different based on the types of sedimentary rocks and the location. The clusters were distinguished into limestone, sandstone, and shale. In addition, the soil derived from shale was divided into red shale and gray shale. This means that quantitative interpretation of the catena and this established method can be used to interpret the relationship between soil series.

A Study on the Characteristics of Enterprise R&D Capabilities Using Data Mining (데이터마이닝을 활용한 기업 R&D역량 특성에 관한 탐색 연구)

  • Kim, Sang-Gook;Lim, Jung-Sun;Park, Wan
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.1-21
    • /
    • 2021
  • As the global business environment changes, uncertainties in technology development and market needs increase, and competition among companies intensifies, interests and demands for R&D activities of individual companies are increasing. In order to cope with these environmental changes, R&D companies are strengthening R&D investment as one of the means to enhance the qualitative competitiveness of R&D while paying more attention to facility investment. As a result, facilities or R&D investment elements are inevitably a burden for R&D companies to bear future uncertainties. It is true that the management strategy of increasing investment in R&D as a means of enhancing R&D capability is highly uncertain in terms of corporate performance. In this study, the structural factors that influence the R&D capabilities of companies are explored in terms of technology management capabilities, R&D capabilities, and corporate classification attributes by utilizing data mining techniques, and the characteristics these individual factors present according to the level of R&D capabilities are analyzed. This study also showed cluster analysis and experimental results based on evidence data for all domestic R&D companies, and is expected to provide important implications for corporate management strategies to enhance R&D capabilities of individual companies. For each of the three viewpoints, detailed evaluation indexes were composed of 7, 2, and 4, respectively, to quantitatively measure individual levels in the corresponding area. In the case of technology management capability and R&D capability, the sub-item evaluation indexes that are being used by current domestic technology evaluation agencies were referenced, and the final detailed evaluation index was newly constructed in consideration of whether data could be obtained quantitatively. In the case of corporate classification attributes, the most basic corporate classification profile information is considered. In particular, in order to grasp the homogeneity of the R&D competency level, a comprehensive score for each company was given using detailed evaluation indicators of technology management capability and R&D capability, and the competency level was classified into five grades and compared with the cluster analysis results. In order to give the meaning according to the comparative evaluation between the analyzed cluster and the competency level grade, the clusters with high and low trends in R&D competency level were searched for each cluster. Afterwards, characteristics according to detailed evaluation indicators were analyzed in the cluster. Through this method of conducting research, two groups with high R&D competency and one with low level of R&D competency were analyzed, and the remaining two clusters were similar with almost high incidence. As a result, in this study, individual characteristics according to detailed evaluation indexes were analyzed for two clusters with high competency level and one cluster with low competency level. The implications of the results of this study are that the faster the replacement cycle of professional managers who can effectively respond to changes in technology and market demand, the more likely they will contribute to enhancing R&D capabilities. In the case of a private company, it is necessary to increase the intensity of input of R&D capabilities by enhancing the sense of belonging of R&D personnel to the company through conversion to a corporate company, and to provide the accuracy of responsibility and authority through the organization of the team unit. Since the number of technical commercialization achievements and technology certifications are occurring both in the case of contributing to capacity improvement and in case of not, it was confirmed that there is a limit in reviewing it as an important factor for enhancing R&D capacity from the perspective of management. Lastly, the experience of utility model filing was identified as a factor that has an important influence on R&D capability, and it was confirmed the need to provide motivation to encourage utility model filings in order to enhance R&D capability. As such, the results of this study are expected to provide important implications for corporate management strategies to enhance individual companies' R&D capabilities.

Understanding Chemical Characteristics of Seepage Water and Groundwater in a Coastal LPG Storage Cavern using Factor and Cluster Analyses (인자 및 군집분석을 통한 해안 LPG공동 유출수 및 지하수 수질특성의 이해)

  • Jo, Yun-Ju;Lee, Jin-Yong
    • Economic and Environmental Geology
    • /
    • v.42 no.6
    • /
    • pp.599-608
    • /
    • 2009
  • This study was conducted to examine chemical characteristics and correlations among seepage water, subsurface waters and inland groundwater in and around a coastal underground LPG cavern using factor and cluster analyses. The study area is located in western coast of Incheon metropolitan city and is about 8 km off the coast. The LPG cavern storing propane and butane was built beneath artificially reclaimed island. Mean bathymetry is 8.5 m and maximum sea level change is 10 m. Water sampling was conducted in May and August, 2006 from 22 sampling points. Correlation analysis showed strong correlations among $Fe^{2+}$ and $Mn^{2+}$ (r=0.83~0.99), and Na and Cl (r=0.70~0.97), which indicated reductive dissolution of iron and manganese bearing minerals and seawater ingression effect, respectively. According to factor analysis, Factors 1 (May) and I (August) showed high loadings for parameters representing seawater ingression into the cavern and effect of submarine groundwater discharge, respectively while Factors 2 and IV showed high loadings for those representing oxidation condition (DO and ORP). Factors 4 and II have large positive loadings for $Fe^{2+}$ and $Mn^{2+}$. The increase of $Fe^{2+}$ and $Mn^{2+}$ was related to decomposition of organic matter and subsequent their dissolution under reduced condition. Cluster analysis showed the resulting 6 groups for May and 5 groups for August, which mainly included groups of inland groundwater, cavern seepage water, sea water and subsurface water in the LPG storage cavern. Subsurface water (Group 2 and Group III) around the underground storage cavern showed high EC and major ions contents, which represents the seawater effect. Cavern seepage water (Group 5 and Group II) showed a reduced condition (low DO and negative ORP) and higher levels of $Fe^{2+}$ and $Mn^{2+}$.

Characteristics of Fish Assemblage by Reservoir Size in Yeongsan·Seomjin River Watershed in Korea (영산강·섬진강 수계 호소의 규모별 어류군집 특성)

  • Park, Sang-Hyeon;Kim, Jeong-Hui;Baek, Seung-Ho;Choi, Ho-Seung;Kim, Dae-Won;Ko, Eui-Jeong;Kim, Hyun-Woo
    • Korean Journal of Ecology and Environment
    • /
    • v.53 no.3
    • /
    • pp.229-240
    • /
    • 2020
  • In this study, the results of fish assemblage survey for 16 reservoirs in the Yeongsan·Seomjin-River watershed were presented with fish assemblage characteristics analysis in relation to reservoir size. The survey method including number of sampling sites was followed the "Biomonitoring survey and assessment manual" of the Ministry of Environment/National Institute of Environmental Research (MOE/NIER), and the reservoirs were categorized as three size groups, small, medium or large reservoirs, based on the MOE/NIER as well. Total 13 family classified into 44 species were collected from 2018 (7 reservoirs) to 2019 (9 reservoirs), and the dominant and subdominant species were Hemiculter eigenmanni (Relative abundance, RA, 32.9%) and Lepomis macrochirus (RA, 31.4%), respectively. As a result of the analysis in relation to the reservoir size, the average (±standard deviation) number of species of the small, medium and large reservoirs were 11±2.9, 14.3±2.1, 22.7±0.6, respectively, which showed positive correlation with the reservoir size. Total 6 fish assemblage characteristics(number of species, number of individuals, richness index, herbivorous fish ratio, carnivorous fish ratio, exotic fish ratio) showed significant differences between the each reservoir size groups (P<0.05). As a result of cluster analysis, 16 reservoirs were clustered into 5 groups with 60% similarity, and the each reservoirs seems to be clustered depends on the distance from each other, watershed and their historical geology rather than size. These results are baseline information for the understanding of fish assemblage in Korean reservoirs, important for establishing management policy of reservoirs in the Yeongsan·Seomjin-River watershed.

Construction and Application of Network Design System for Optimal Water Quality Monitoring in Reservoir (저수지 최적수질측정망 구축시스템 개발 및 적용)

  • Lee, Yo-Sang;Kwon, Se-Hyug;Lee, Sang-Uk;Ban, Yang-Jin
    • Journal of Korea Water Resources Association
    • /
    • v.44 no.4
    • /
    • pp.295-304
    • /
    • 2011
  • For effective water quality management, it is necessary to secure reliable water quality information. There are many variables that need to be included in a comprehensive practical monitoring network : representative sampling locations, suitable sampling frequencies, water quality variable selection, and budgetary and logistical constraints are examples, especially sampling location is considered to be the most important issues. Until now, monitoring network design for water quality management was set according to the qualitative judgments, which is a problem of representativeness. In this paper, we propose network design system for optimal water quality monitoring using the scientific statistical techniques. Network design system is made based on the SAS program of version 9.2 and configured with simple input system and user friendly outputs considering the convenience of users. It applies to Excel data format for ease to use and all data of sampling location is distinguished to sheet base. In this system, time plots, dendrogram, and scatter plots are shown as follows: Time plots of water quality variables are graphed for identifying variables to classify sampling locations significantly. Similarities of sampling locations are calculated using euclidean distances of principal component variables and dimension coordinate of multidimensional scaling method are calculated and dendrogram by clustering analysis is represented and used for users to choose an appropriate number of clusters. Scatter plots of principle component variables are shown for clustering information with sampling locations and representative location.

Analysis of Area Type Classification of Seoul Using Geodemographics Methods (Geodemographics의 연구기법을 활용한 서울시 지역유형 분석 연구)

  • Woo, Hyun-Jee;Kim, Young-Hoon
    • Journal of the Korean association of regional geographers
    • /
    • v.15 no.4
    • /
    • pp.510-523
    • /
    • 2009
  • Geodemographics(GD) can be defined as an analytical approach of socio-economic and behavioral data about people to investigate geographical patterns. GD is based on the assumptions that demographical and behavioral characteristics of people who live in the same neighborhood are similar and then the neighborhoods can be categorized with spatial classifications with the geographical classifications. Thus, this paper, in order to identify the applicability of the geographical classification of the GD, explores the concepts of the geodemographics into Seoul city areas with Korea census data sets that contain key characteristics of demographic profiles in the area. Then, this paper attempt to explain each area classification profile by using clustering techniques with Ward's and k-means statistical methods. For this as as as, this paper employs 2005 Census dataset released by Korea National Statistics Office and the neighborhood unit is based on Dong level, the smallest administrative boundary unit in Korea. After selecting and standardizing variables, several areas are categorized by the cluster techniques into 13, this paps as distinctive cluster profiles. These cluster profiles are used to cthite a short description and expand on the cluster names. Finally, the results of the classification propose a reasonable judgement for target area types which benefits for the people who make a spatial decision for their spatial problem-solving.

  • PDF