• Title/Summary/Keyword: K-Mean 알고리즘

Search Result 656, Processing Time 0.019 seconds

The Evaluation of Meteorological Inputs retrieved from MODIS for Estimation of Gross Primary Productivity in the US Corn Belt Region (MODIS 위성 영상 기반의 일차생산성 알고리즘 입력 기상 자료의 신뢰도 평가: 미국 Corn Belt 지역을 중심으로)

  • Lee, Ji-Hye;Kang, Sin-Kyu;Jang, Keun-Chang;Ko, Jong-Han;Hong, Suk-Young
    • Korean Journal of Remote Sensing
    • /
    • v.27 no.4
    • /
    • pp.481-494
    • /
    • 2011
  • Investigation of the $CO_2$ exchange between biosphere and atmosphere at regional, continental, and global scales can be directed to combining remote sensing with carbon cycle process to estimate vegetation productivity. NASA Earth Observing System (EOS) currently produces a regular global estimate of gross primary productivity (GPP) and annual net primary productivity (NPP) of the entire terrestrial earth surface at 1 km spatial resolution. While the MODIS GPP algorithm uses meteorological data provided by the NASA Data Assimilation Office (DAO), the sub-pixel heterogeneity or complex terrain are generally reflected due to coarse spatial resolutions of the DAO data (a resolution of $1{\circ}\;{\times}\;1.25{\circ}$). In this study, we estimated inputs retrieved from MODIS products of the AQUA and TERRA satellites with 5 km spatial resolution for the purpose of finer GPP and/or NPP determinations. The derivatives included temperature, VPD, and solar radiation. Seven AmeriFlux data located in the Corn Belt region were obtained to use for evaluation of the input data from MODIS. MODIS-derived air temperature values showed a good agreement with ground-based observations. The mean error (ME) and coefficient of correlation (R) ranged from $-0.9^{\circ}C$ to $+5.2^{\circ}C$ and from 0.83 to 0.98, respectively. VPD somewhat coarsely agreed with tower observations (ME = -183.8 Pa ~ +382.1 Pa; R = 0.51 ~ 0.92). While MODIS-derived shortwave radiation showed a good correlation with observations, it was slightly overestimated (ME = -0.4 MJ $day^{-1}$ ~ +7.9 MJ $day^{-1}$; R = 0.67 ~ 0.97). Our results indicate that the use of inputs derived MODIS atmosphere and land products can provide a useful tool for estimating crop GPP.

Estimation of forest Site Productivity by Regional Environment and Forest Soil Factors (권역별 입지$\cdot$토양 환경 요인에 의한 임지생산력 추정)

  • Won Hyong-kyu;Jeong Jin-Hyun;Koo Kyo-Sang;Song Myung Hee;Shin Man Yong
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.7 no.2
    • /
    • pp.132-140
    • /
    • 2005
  • This study was conducted to develop regional site index equations for main tree species in Gangwon, Gyunggi-Chungcheong, Gyungsang, and Jeolla area of Korea, using environmental and soil factors obtained from a digital forest site map. Using the large data set obtained from the digital forest map, a total of 28 environmental and soil factors were regressed on site index by tree species for developing the best site index equations for each of the regions. The selected main tree species were Larix 1eptolepis, Pinus koraiensis, Pinus densiflora, Pinus thunbergii, and Quercus acutissima. Finally, four to five environmental and soil factors by species were chosen as independent variables in defining the best regional site index equations with the highest coefficients of determination $(R^2)$. For those site index equations, three evaluation statistics such as mean difference, standard deviation of difference and standard error of difference were applied to the data sets independently collected from fields within the region. According to the evaluation statistics, it was found that the regional site index equations by species developed in this study conformed well to the independent data set, having relatively low bias and variation. It was concluded that the regional site index equations by species had sufficient capability for the estimation of site productivity.

Seasonal and Inter-annual Variability of Water Use Efficiency of an Abies holophylla Plantation in Korea National Arboretum (국립수목원의 전나무(Abies holophylla) 조림지의 물 이용 효율의 계절 및 경년 변동)

  • Thakuri, Bindu Malla;Kang, Minseok;Zhang, Yonghui;Chun, Junghwa;Kim, Joon
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.366-377
    • /
    • 2016
  • Water use efficiency (WUE) is considered as an important ecological indicator which may provide information on the process-structure relationships associated with energy-matter-information flows in ecosystem. The WUE at ecosystem-level can be defined as the ratio of gross primary productivity (GPP) to evapotranspiration (ET). In this study, KoFlux's long-term (2007-2015) eddy covariance measurements of $CO_2$ and water vapor fluxes were used to examine the WUE of needle fir plantation in Korea National Arboretum. Our objective is to ascertain the seasonality and inter-annual variability in WUE of this needle fir plantation so that the results may be assimilated into the development of a holistic ecological indicator for resilience assessment. Our results show that the WUE of needle fir plantation is characterized by a concave seasonal pattern with a minimum ($1.8-3.3g\;C{\cdot}(kg\;H_2O)^{-1}$) in August and a maximum ($5.1-11.4g\;C{\cdot}(kg\;H_2O)^{-1}$) in February. During the growing season (April to October), WUE was on average $3.5{\pm}0.3g\;C\;(kg\;H_2O)^{-1}$. During the dormant seasons (November to March), WUE showed more variations with a mean of $7.4{\pm}1.0g\;C{\cdot}(kg\;H_2O)^{-1}$. These values are in the upper ranges of WUE reported in the literature for coniferous forests in temperate zone. Although the growing season was defined as the period from April to October, the actual length of the growing season (GSL) varied each year and its variation explained 62% of the inter-annual variability of the growing season WUE. This is the first study to quantify long-term changes in ecosystem-level WUE in Korea and the results can be used to test models, remote-sensing algorithms and resilience of forest ecosystem.

Development of Market Growth Pattern Map Based on Growth Model and Self-organizing Map Algorithm: Focusing on ICT products (자기조직화 지도를 활용한 성장모형 기반의 시장 성장패턴 지도 구축: ICT제품을 중심으로)

  • Park, Do-Hyung;Chung, Jaekwon;Chung, Yeo Jin;Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.1-23
    • /
    • 2014
  • Market forecasting aims to estimate the sales volume of a product or service that is sold to consumers for a specific selling period. From the perspective of the enterprise, accurate market forecasting assists in determining the timing of new product introduction, product design, and establishing production plans and marketing strategies that enable a more efficient decision-making process. Moreover, accurate market forecasting enables governments to efficiently establish a national budget organization. This study aims to generate a market growth curve for ICT (information and communication technology) goods using past time series data; categorize products showing similar growth patterns; understand markets in the industry; and forecast the future outlook of such products. This study suggests the useful and meaningful process (or methodology) to identify the market growth pattern with quantitative growth model and data mining algorithm. The study employs the following methodology. At the first stage, past time series data are collected based on the target products or services of categorized industry. The data, such as the volume of sales and domestic consumption for a specific product or service, are collected from the relevant government ministry, the National Statistical Office, and other relevant government organizations. For collected data that may not be analyzed due to the lack of past data and the alteration of code names, data pre-processing work should be performed. At the second stage of this process, an optimal model for market forecasting should be selected. This model can be varied on the basis of the characteristics of each categorized industry. As this study is focused on the ICT industry, which has more frequent new technology appearances resulting in changes of the market structure, Logistic model, Gompertz model, and Bass model are selected. A hybrid model that combines different models can also be considered. The hybrid model considered for use in this study analyzes the size of the market potential through the Logistic and Gompertz models, and then the figures are used for the Bass model. The third stage of this process is to evaluate which model most accurately explains the data. In order to do this, the parameter should be estimated on the basis of the collected past time series data to generate the models' predictive value and calculate the root-mean squared error (RMSE). The model that shows the lowest average RMSE value for every product type is considered as the best model. At the fourth stage of this process, based on the estimated parameter value generated by the best model, a market growth pattern map is constructed with self-organizing map algorithm. A self-organizing map is learning with market pattern parameters for all products or services as input data, and the products or services are organized into an $N{\times}N$ map. The number of clusters increase from 2 to M, depending on the characteristics of the nodes on the map. The clusters are divided into zones, and the clusters with the ability to provide the most meaningful explanation are selected. Based on the final selection of clusters, the boundaries between the nodes are selected and, ultimately, the market growth pattern map is completed. The last step is to determine the final characteristics of the clusters as well as the market growth curve. The average of the market growth pattern parameters in the clusters is taken to be a representative figure. Using this figure, a growth curve is drawn for each cluster, and their characteristics are analyzed. Also, taking into consideration the product types in each cluster, their characteristics can be qualitatively generated. We expect that the process and system that this paper suggests can be used as a tool for forecasting demand in the ICT and other industries.

Estimation of Reliability of Real-time Control Parameters for Animal Wastewater Treatment Process and Establishment of an Index for Supplemental Carbon Source Addition (가축분뇨처리공정의 자동제어 인자 신뢰성 평가 및 적정 외부탄소원 공급량 지표 확립)

  • Pak, JaeIn;Ra, Jae In-
    • Journal of Animal Science and Technology
    • /
    • v.50 no.4
    • /
    • pp.561-572
    • /
    • 2008
  • Responses of real-time control parameters, such as ORP, DO and pH, to the conditions of biological animal wastewater treatment process were examined to evaluate the stability of real-time control using each parameter. Also an optimum index for supplemental carbon source addition based on NOx-N level was determined under a consideration of denitrification rate by endogenous respiration of microorganism and residual organic matter in liquor. Experiment was performed with lab-scale sequencing batch reactor(SBR) and working volume of the process was 45L. The distinctive nitrogen break point(NBP) on ORP-and DO-time profiles, which mean the termination of nitrification, started disappearing with the maintenance of low NH4-N loading rate. Also the NBP on ORP-and DO-time profiles was no longer observed when high NOx-N was loaded into the reactor, and the sensitivity of ORP became dull with the increase of NOx-N level. However, the distinctive NBP was constantly occurred on pH(mV)-time profile, maintaining unique profile patterns. This stable occurrence of NBP on pH(mV)-time profile was lasted even at very high NOx-N:NH4-N ratio(over 80:1) in reactor, and the specific point could be easily detected by tracking moving slope change(MSC) of the curve. Revelation of NBP on pH(mV)-time profile and recognition of the realtime control point using MSC were stable at a condition of over 300mg/L NOx-N level in reactor. The occurrence of distinctive NBP was persistent on pH(mV)-time profile even at a level of 10,000mg/L STOC(soluble total organic carbon) and the recognition of NBP was feasible by tracing MSC, but that point on ORP and DO-time profiles began to disappear with the increase of STOC level in reactor. The denitrfication rate by endogenous respiration and residual organic matter was about 0.4mg/L.hr., and it was found that 0.83 would be accepted as an index for supplemental carbon source addition when 0.1 of safety factor was applied.

A Study on Risk Parity Asset Allocation Model with XGBoos (XGBoost를 활용한 리스크패리티 자산배분 모형에 관한 연구)

  • Kim, Younghoon;Choi, HeungSik;Kim, SunWoong
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.135-149
    • /
    • 2020
  • Artificial intelligences are changing world. Financial market is also not an exception. Robo-Advisor is actively being developed, making up the weakness of traditional asset allocation methods and replacing the parts that are difficult for the traditional methods. It makes automated investment decisions with artificial intelligence algorithms and is used with various asset allocation models such as mean-variance model, Black-Litterman model and risk parity model. Risk parity model is a typical risk-based asset allocation model which is focused on the volatility of assets. It avoids investment risk structurally. So it has stability in the management of large size fund and it has been widely used in financial field. XGBoost model is a parallel tree-boosting method. It is an optimized gradient boosting model designed to be highly efficient and flexible. It not only makes billions of examples in limited memory environments but is also very fast to learn compared to traditional boosting methods. It is frequently used in various fields of data analysis and has a lot of advantages. So in this study, we propose a new asset allocation model that combines risk parity model and XGBoost machine learning model. This model uses XGBoost to predict the risk of assets and applies the predictive risk to the process of covariance estimation. There are estimated errors between the estimation period and the actual investment period because the optimized asset allocation model estimates the proportion of investments based on historical data. these estimated errors adversely affect the optimized portfolio performance. This study aims to improve the stability and portfolio performance of the model by predicting the volatility of the next investment period and reducing estimated errors of optimized asset allocation model. As a result, it narrows the gap between theory and practice and proposes a more advanced asset allocation model. In this study, we used the Korean stock market price data for a total of 17 years from 2003 to 2019 for the empirical test of the suggested model. The data sets are specifically composed of energy, finance, IT, industrial, material, telecommunication, utility, consumer, health care and staple sectors. We accumulated the value of prediction using moving-window method by 1,000 in-sample and 20 out-of-sample, so we produced a total of 154 rebalancing back-testing results. We analyzed portfolio performance in terms of cumulative rate of return and got a lot of sample data because of long period results. Comparing with traditional risk parity model, this experiment recorded improvements in both cumulative yield and reduction of estimated errors. The total cumulative return is 45.748%, about 5% higher than that of risk parity model and also the estimated errors are reduced in 9 out of 10 industry sectors. The reduction of estimated errors increases stability of the model and makes it easy to apply in practical investment. The results of the experiment showed improvement of portfolio performance by reducing the estimated errors of the optimized asset allocation model. Many financial models and asset allocation models are limited in practical investment because of the most fundamental question of whether the past characteristics of assets will continue into the future in the changing financial market. However, this study not only takes advantage of traditional asset allocation models, but also supplements the limitations of traditional methods and increases stability by predicting the risks of assets with the latest algorithm. There are various studies on parametric estimation methods to reduce the estimated errors in the portfolio optimization. We also suggested a new method to reduce estimated errors in optimized asset allocation model using machine learning. So this study is meaningful in that it proposes an advanced artificial intelligence asset allocation model for the fast-developing financial markets.