• Title/Summary/Keyword: Estimation Models

Search Result 2,804, Processing Time 0.031 seconds

A Study on the Determinants of Land Price in a New Town (신도시 택지개발사업지역에서 토지가격 결정요인에 관한 연구)

  • Jeong, Tae Yun
    • Korea Real Estate Review
    • /
    • v.28 no.1
    • /
    • pp.79-90
    • /
    • 2018
  • The purpose of this study was to estimate the pricing factors of residential lands in new cities by estimating the pricing model of residential lands. For this purpose, hedonic equations for each quantile of the conditional distribution of land prices were estimated using quantile regression methods and the sale price date of Jangyu New Town in Gimhae. In this study, a quantile regression method that models the relation between a set of explanatory variables and each quantile of land price was adopted. As a result, the differences in the effects of the characteristics by price quantile were confirmed. The number of years that elapsed after the completion of land construction is the quadratic effect in the model because its impact may give rise to a non-linear price pattern. Age appears to decrease the price until certain years after the construction, and increases the price afterward. In the estimation of the quantile regression, land age appears to have a statistically significant impact on land price at the traditional level, and the turning point appears to be shorter for the low quantiles than for the higher quantiles. The positive effects of the use of land for commercial and residential purposes were found to be the biggest. Land demand is preferred if there are more than two roads on the ground. In this case, the amount of sunshine will improve. It appears that the shape of a square wave is preferred to a free-looking land. This is because the square land is favorable for development. The variables of the land used for commercial and residential purposes have a greater impact on low-priced residential lands. This is because such lands tend to be mostly used for rental housing and have different characteristics from residential houses. Residential land prices have different characteristics depending on the price level, and it is necessary to consider this in the evaluation of the collateral value and the drafting of real estate policy.

Estimation of Surface fCO2 in the Southwest East Sea using Machine Learning Techniques (기계학습법을 이용한 동해 남서부해역의 표층 이산화탄소분압(fCO2) 추정)

  • HAHM, DOSHIK;PARK, SOYEONA;CHOI, SANG-HWA;KANG, DONG-JIN;RHO, TAEKEUN;LEE, TONGSUP
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.24 no.3
    • /
    • pp.375-388
    • /
    • 2019
  • Accurate evaluation of sea-to-air $CO_2$ flux and its variability is crucial information to the understanding of global carbon cycle and the prediction of atmospheric $CO_2$ concentration. $fCO_2$ observations are sparse in space and time in the East Sea. In this study, we derived high resolution time series of surface $fCO_2$ values in the southwest East Sea, by feeding sea surface temperature (SST), salinity (SSS), chlorophyll-a (CHL), and mixed layer depth (MLD) values, from either satellite-observations or numerical model outputs, to three machine learning models. The root mean square error of the best performing model, a Random Forest (RF) model, was $7.1{\mu}atm$. Important parameters in predicting $fCO_2$ in the RF model were SST and SSS along with time information; CHL and MLD were much less important than the other parameters. The net $CO_2$ flux in the southwest East Sea, calculated from the $fCO_2$ predicted by the RF model, was $-0.76{\pm}1.15mol\;m^{-2}yr^{-1}$, close to the lower bound of the previous estimates in the range of $-0.66{\sim}-2.47mol\;m^{-2}yr^{-1}$. The time series of $fCO_2$ predicted by the RF model showed a significant variation even in a short time interval of a week. For accurate evaluation of the $CO_2$ flux in the Ulleung Basin, it is necessary to conduct high resolution in situ observations in spring when $fCO_2$ changes rapidly.

Comparative Analysis of Nitrogen Concentration of Rainfall in South Korea for Nonpoint Source Pollution Model Application (비점오염모델 적용을 위한 우리나라 행정구역별 강수 중 질소농도 비교분석)

  • Choi, Dong Ho;Kim, Min-Kyeong;Hur, Seung-Oh;Hong, Sung-Chang;Choi, Soon-Kun
    • Korean Journal of Environmental Agriculture
    • /
    • v.37 no.3
    • /
    • pp.189-196
    • /
    • 2018
  • BACKGROUND: Water quality management of river requires quantification of pollutant loads and implementation of measures through monitoring study, but it requires labour and costs. Therefore, many researchers are performing nonpoint source pollution analysis using computer models. However, calibration of model parameters needs observed data. Nitrogen concentration in rainfall is one of the factors to be considered when estimating the pollutant loads through application of the nonpoint source pollution model, but the default value provided by the model is used when there are no observed data. Therefore, this study aims to provide the representative nitrogen concentration of the rainfall for the administrative district ensuring rational modeling and reliable results. METHODS AND RESULTS: In this study, rainfall monitoring data from June 2015 to December 2017 were used to determine the nitrogen concentration in rainfall for each administrative district. Range of the $NO_3{^-}$ and $NH_4{^+}$ concentrations were 0.41~6.05 mg/L, 0.39~2.27 mg/L, respectively, and T-N concentration was 0.80~7.71 mg/L. Furthermore, the national average of T-N concentration in this study was $2.84{\pm}1.42mg/L$, which was similar to the national average of T-N 3.03 mg/L presented by the Ministry of Environment in 2015. Therefore, the nitrogen concentrations suggested in this study can be considered to be resonable values. CONCLUSION: The nitrogen concentrations estimated in this study showed regional differences. Therefore, when estimating the pollutant loads through application of the nonpoint source pollution model, resonable parameter estimation of nitrogen concentration in rainfall is possible by reflecting the regional characteristics.

Feasibility of Tax Increase in Korean Welfare State via Estimation of Optimal Tax burden Ratio (적정조세부담률 추정을 통한 한국 복지국가 증세가능성에 관한 연구)

  • Kim, SeongWook
    • 한국사회정책
    • /
    • v.20 no.3
    • /
    • pp.77-115
    • /
    • 2013
  • The purpose of this study is to present empirical evidence for discussion of financing social welfare via estimating optimal tax burden in the main member countries of the OECD by using Hausman-Taylor method considering endogeneity of explanatory variables. Also, the author produced an international tax comparison index reflecting theoretical hypotheses on revenue-expenditure nexus within a model to compare real tax burden by countries and to examine feasibility of tax increase in Korea. As a result of the analysis, the higher the level of tax burden was, the higher the level of welfare expenditure was, indicating the connection between high burden and high welfare from the aspect of scale. The results also indicated that the subject countries recently entered into the state of low tax burden. Meanwhile, Korea had maintained low burden until the late 1990s but the tax burden soared up since the financial crisis related to the IMF. However, due to the impact of foreign economy and the tax reduction policy, it reentered into the low-burden state after 2009. On the other hand, the degree of social welfare expenditure's reducing tax burden has been gradually enhanced since the crisis. In this context, the current optimal tax burden ratio of Korea as of 2010 may be 25.8%~26.5% of GDP based on input of welfare expenditure variables, a percent that Korea was investigated to be a 'high tax burden-low ITC' country whose tax increase of 0.7~1.4%p may be feasible and that the success of tax system reform for tax increase might be higher probability when compare to others. However, measures of increasing social security contributions and consumption tax were analyzed to be improper from the aspect of managing finance when compared to increase in other tax items, considering the relatively higher ITC. Tax increase is not necessarily required though there may be room for tax increase; the optimal tax burden ratio can be understood as the level that may be achieved on average when compared to other nations, not as the "proper" level. Thus, discussion of tax increase should be accompanied with comprehensive understanding of models of economic developmental difference from nations and institutional & historical attributes included in specific tax mix.

Performance of Investment Strategy using Investor-specific Transaction Information and Machine Learning (투자자별 거래정보와 머신러닝을 활용한 투자전략의 성과)

  • Kim, Kyung Mock;Kim, Sun Woong;Choi, Heung Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.65-82
    • /
    • 2021
  • Stock market investors are generally split into foreign investors, institutional investors, and individual investors. Compared to individual investor groups, professional investor groups such as foreign investors have an advantage in information and financial power and, as a result, foreign investors are known to show good investment performance among market participants. The purpose of this study is to propose an investment strategy that combines investor-specific transaction information and machine learning, and to analyze the portfolio investment performance of the proposed model using actual stock price and investor-specific transaction data. The Korea Exchange offers daily information on the volume of purchase and sale of each investor to securities firms. We developed a data collection program in C# programming language using an API provided by Daishin Securities Cybosplus, and collected 151 out of 200 KOSPI stocks with daily opening price, closing price and investor-specific net purchase data from January 2, 2007 to July 31, 2017. The self-organizing map model is an artificial neural network that performs clustering by unsupervised learning and has been introduced by Teuvo Kohonen since 1984. We implement competition among intra-surface artificial neurons, and all connections are non-recursive artificial neural networks that go from bottom to top. It can also be expanded to multiple layers, although many fault layers are commonly used. Linear functions are used by active functions of artificial nerve cells, and learning rules use Instar rules as well as general competitive learning. The core of the backpropagation model is the model that performs classification by supervised learning as an artificial neural network. We grouped and transformed investor-specific transaction volume data to learn backpropagation models through the self-organizing map model of artificial neural networks. As a result of the estimation of verification data through training, the portfolios were rebalanced monthly. For performance analysis, a passive portfolio was designated and the KOSPI 200 and KOSPI index returns for proxies on market returns were also obtained. Performance analysis was conducted using the equally-weighted portfolio return, compound interest rate, annual return, Maximum Draw Down, standard deviation, and Sharpe Ratio. Buy and hold returns of the top 10 market capitalization stocks are designated as a benchmark. Buy and hold strategy is the best strategy under the efficient market hypothesis. The prediction rate of learning data using backpropagation model was significantly high at 96.61%, while the prediction rate of verification data was also relatively high in the results of the 57.1% verification data. The performance evaluation of self-organizing map grouping can be determined as a result of a backpropagation model. This is because if the grouping results of the self-organizing map model had been poor, the learning results of the backpropagation model would have been poor. In this way, the performance assessment of machine learning is judged to be better learned than previous studies. Our portfolio doubled the return on the benchmark and performed better than the market returns on the KOSPI and KOSPI 200 indexes. In contrast to the benchmark, the MDD and standard deviation for portfolio risk indicators also showed better results. The Sharpe Ratio performed higher than benchmarks and stock market indexes. Through this, we presented the direction of portfolio composition program using machine learning and investor-specific transaction information and showed that it can be used to develop programs for real stock investment. The return is the result of monthly portfolio composition and asset rebalancing to the same proportion. Better outcomes are predicted when forming a monthly portfolio if the system is enforced by rebalancing the suggested stocks continuously without selling and re-buying it. Therefore, real transactions appear to be relevant.

A fundamental study on the automation of tunnel blasting design using a machine learning model (머신러닝을 이용한 터널발파설계 자동화를 위한 기초연구)

  • Kim, Yangkyun;Lee, Je-Kyum;Lee, Sean Seungwon
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.24 no.5
    • /
    • pp.431-449
    • /
    • 2022
  • As many tunnels generally have been constructed, various experiences and techniques have been accumulated for tunnel design as well as tunnel construction. Hence, there are not a few cases that, for some usual tunnel design works, it is sufficient to perform the design by only modifying or supplementing previous similar design cases unless a tunnel has a unique structure or in geological conditions. In particular, for a tunnel blast design, it is reasonable to refer to previous similar design cases because the blast design in the stage of design is a preliminary design, considering that it is general to perform additional blast design through test blasts prior to the start of tunnel excavation. Meanwhile, entering the industry 4.0 era, artificial intelligence (AI) of which availability is surging across whole industry sector is broadly utilized to tunnel and blasting. For a drill and blast tunnel, AI is mainly applied for the estimation of blast vibration and rock mass classification, etc. however, there are few cases where it is applied to blast pattern design. Thus, this study attempts to automate tunnel blast design by means of machine learning, a branch of artificial intelligence. For this, the data related to a blast design was collected from 25 tunnel design reports for learning as well as 2 additional reports for the test, and from which 4 design parameters, i.e., rock mass class, road type and cross sectional area of upper section as well as bench section as input data as well as16 design elements, i.e., blast cut type, specific charge, the number of drill holes, and spacing and burden for each blast hole group, etc. as output. Based on this design data, three machine learning models, i.e., XGBoost, ANN, SVM, were tested and XGBoost was chosen as the best model and the results show a generally similar trend to an actual design when assumed design parameters were input. It is not enough yet to perform the whole blast design using the results from this study, however, it is planned that additional studies will be carried out to make it possible to put it to practical use after collecting more sufficient blast design data and supplementing detailed machine learning processes.

Estimation of Structural Deterioration of Sewer using Markov Chain Model (마르코프 연쇄 모델을 이용한 하수관로의 구조적 노후도 추정)

  • Kang, Byong Jun;Yoo, Soon Yu;Zhang, Chuanli;Park, Kyoo Hong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.43 no.4
    • /
    • pp.421-431
    • /
    • 2023
  • Sewer deterioration models can offer important information on prediction of future condition of the asset to decision makers in their implementing sewer pipe networks management program. In this study, Markov chain model was used to estimate sewer deterioration trend based on the historical structural condition assessment data obtained by CCTV inspection. The data used in this study were limited to Hume pipe with diameter of 450 mm and 600 mm in three sub-catchment areas in city A, which were collected by CCTV inspection projects performed in 1998-1999 and 2010-2011. As a result, it was found that sewers in sub-catchment area EM have deteriorated faster than those in other two sub-catchments. Various main defects were to generate in 29% of 450 mm sewers and 38% of 600 mm in 35 years after the installation, while serious failure in 62% of 450 mm sewers and 74% of 600 mm in 100 years after the installation in sub-catchment area EM. In sub-catchment area SN, main defects were to generate in 26% of 450 mm sewers and 35% of 600 mm in 35 years after the installation, while in sub-catchment area HK main defects were to generate in 27% of 450 mm sewers and 37% of 600 mm in 35 years after the installation. Larger sewer pipes of 600 mm were found to deteriorate faster than smaller sewer pipes of 450 mm by about 12 years. Assuming that the percentage of main defects generation could be set as 40% to estimate the life expectancy of the sewers, it was estimated as 60 years in sub-catchment area SN, 42 years in sub-catchment area EM, 59 years in sub-catchment area HK for 450 mm sewer pipes, respectively. For 600 mm sewer pipes, on the other hand, it was estimated as 43 years, 34 years, 39 years in sub-catchment areas SN, EM, and HK, respectively.

Estimation for Ground Air Temperature Using GEO-KOMPSAT-2A and Deep Neural Network (심층신경망과 천리안위성 2A호를 활용한 지상기온 추정에 관한 연구)

  • Taeyoon Eom;Kwangnyun Kim;Yonghan Jo;Keunyong Song;Yunjeong Lee;Yun Gon Lee
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.2
    • /
    • pp.207-221
    • /
    • 2023
  • This study suggests deep neural network models for estimating air temperature with Level 1B (L1B) datasets of GEO-KOMPSAT-2A (GK-2A). The temperature at 1.5 m above the ground impact not only daily life but also weather warnings such as cold and heat waves. There are many studies to assume the air temperature from the land surface temperature (LST) retrieved from satellites because the air temperature has a strong relationship with the LST. However, an algorithm of the LST, Level 2 output of GK-2A, works only clear sky pixels. To overcome the cloud effects, we apply a deep neural network (DNN) model to assume the air temperature with L1B calibrated for radiometric and geometrics from raw satellite data and compare the model with a linear regression model between LST and air temperature. The root mean square errors (RMSE) of the air temperature for model outputs are used to evaluate the model. The number of 95 in-situ air temperature data was 2,496,634 and the ratio of datasets paired with LST and L1B show 42.1% and 98.4%. The training years are 2020 and 2021 and 2022 is used to validate. The DNN model is designed with an input layer taking 16 channels and four hidden fully connected layers to assume an air temperature. As a result of the model using 16 bands of L1B, the DNN with RMSE 2.22℃ showed great performance than the baseline model with RMSE 3.55℃ on clear sky conditions and the total RMSE including overcast samples was 3.33℃. It is suggested that the DNN is able to overcome cloud effects. However, it showed different characteristics in seasonal and hourly analysis and needed to append solar information as inputs to make a general DNN model because the summer and winter seasons showed a low coefficient of determinations with high standard deviations.

Retrieval of Hourly Aerosol Optical Depth Using Top-of-Atmosphere Reflectance from GOCI-II and Machine Learning over South Korea (GOCI-II 대기상한 반사도와 기계학습을 이용한 남한 지역 시간별 에어로졸 광학 두께 산출)

  • Seyoung Yang;Hyunyoung Choi;Jungho Im
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_3
    • /
    • pp.933-948
    • /
    • 2023
  • Atmospheric aerosols not only have adverse effects on human health but also exert direct and indirect impacts on the climate system. Consequently, it is imperative to comprehend the characteristics and spatiotemporal distribution of aerosols. Numerous research endeavors have been undertaken to monitor aerosols, predominantly through the retrieval of aerosol optical depth (AOD) via satellite-based observations. Nonetheless, this approach primarily relies on a look-up table-based inversion algorithm, characterized by computationally intensive operations and associated uncertainties. In this study, a novel high-resolution AOD direct retrieval algorithm, leveraging machine learning, was developed using top-of-atmosphere reflectance data derived from the Geostationary Ocean Color Imager-II (GOCI-II), in conjunction with their differences from the past 30-day minimum reflectance, and meteorological variables from numerical models. The Light Gradient Boosting Machine (LGBM) technique was harnessed, and the resultant estimates underwent rigorous validation encompassing random, temporal, and spatial N-fold cross-validation (CV) using ground-based observation data from Aerosol Robotic Network (AERONET) AOD. The three CV results consistently demonstrated robust performance, yielding R2=0.70-0.80, RMSE=0.08-0.09, and within the expected error (EE) of 75.2-85.1%. The Shapley Additive exPlanations(SHAP) analysis confirmed the substantial influence of reflectance-related variables on AOD estimation. A comprehensive examination of the spatiotemporal distribution of AOD in Seoul and Ulsan revealed that the developed LGBM model yielded results that are in close concordance with AERONET AOD over time, thereby confirming its suitability for AOD retrieval at high spatiotemporal resolution (i.e., hourly, 250 m). Furthermore, upon comparing data coverage, it was ascertained that the LGBM model enhanced data retrieval frequency by approximately 8.8% in comparison to the GOCI-II L2 AOD products, ameliorating issues associated with excessive masking over very illuminated surfaces that are often encountered in physics-based AOD retrieval processes.

Estimation of the Surface Currents using Mean Dynamic Topography and Satellite Altimeter Data in the East Sea (평균역학고도장과 인공위성고도계 자료를 이용한 동해 표층해류 추산)

  • Lee, Sang-Hyun;Byun, Do-Seong;Choi, Byoung-Ju;Lee, Eun-Il
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.14 no.4
    • /
    • pp.195-204
    • /
    • 2009
  • In order to estimate sea surface current fields in the East Sea, we examined characteristics of mean dynamic topography (MDT) fields (or mean surface current field, MSC) generated from three different methods. This preliminary investigation evaluates the accuracy of surface currents estimated from satellite-derived sea level anomaly (SLA) data and three MDT fields in the East Sea. AVISO (Archiving, Validation and Interpretation of Satellite Oceanographic data) provides a MDT field derived from satellite observation and numerical models with $0.25^{\circ}$ horizontal resolution. Steric height field relative to 500 dbar from temperature and salinity profiles in the East Sea supplies another MDT field. Trajectory data of surface drifters (ARGOS) in the East Sea for 14 years provide another MSC field. Absolute dynamic topography (ADT) field is calculated by adding SLA to each MDT. Application of geostrophic equation to three different ADT fields yields three surface geostrophic current fields. Comparisons were made between the estimated surface currents from the three different methods and in-situ current measurements from a ship-mounted ADCP (Acoustic Doppler Current Profiler) in the southwestern East Sea in 2005. For offshore areas more than 50 km away from the land, the correlation coefficients (R) between the estimated versus the measured currents range from 0.58 to 0.73, with 17.1 to $21.7\;cm\;s^{-1}$ root mean square deviation (RMSD). For coastal ocean within 50 km from the land, however, R ranges from 0.06 to 0.46 and RMSD ranges from 15.5 to $28.0\;cm\;s^{-1}$. Results from this study reveal that a new approach in producing MDT and SLA is required to improve the accuracy of surface current estimations for the shallow costal zones of the East Sea.