• Title/Summary/Keyword: Estimation Models

Search Result 2,813, Processing Time 0.03 seconds

Comparative Analysis of Nitrogen Concentration of Rainfall in South Korea for Nonpoint Source Pollution Model Application (비점오염모델 적용을 위한 우리나라 행정구역별 강수 중 질소농도 비교분석)

  • Choi, Dong Ho;Kim, Min-Kyeong;Hur, Seung-Oh;Hong, Sung-Chang;Choi, Soon-Kun
    • Korean Journal of Environmental Agriculture
    • /
    • v.37 no.3
    • /
    • pp.189-196
    • /
    • 2018
  • BACKGROUND: Water quality management of river requires quantification of pollutant loads and implementation of measures through monitoring study, but it requires labour and costs. Therefore, many researchers are performing nonpoint source pollution analysis using computer models. However, calibration of model parameters needs observed data. Nitrogen concentration in rainfall is one of the factors to be considered when estimating the pollutant loads through application of the nonpoint source pollution model, but the default value provided by the model is used when there are no observed data. Therefore, this study aims to provide the representative nitrogen concentration of the rainfall for the administrative district ensuring rational modeling and reliable results. METHODS AND RESULTS: In this study, rainfall monitoring data from June 2015 to December 2017 were used to determine the nitrogen concentration in rainfall for each administrative district. Range of the $NO_3{^-}$ and $NH_4{^+}$ concentrations were 0.41~6.05 mg/L, 0.39~2.27 mg/L, respectively, and T-N concentration was 0.80~7.71 mg/L. Furthermore, the national average of T-N concentration in this study was $2.84{\pm}1.42mg/L$, which was similar to the national average of T-N 3.03 mg/L presented by the Ministry of Environment in 2015. Therefore, the nitrogen concentrations suggested in this study can be considered to be resonable values. CONCLUSION: The nitrogen concentrations estimated in this study showed regional differences. Therefore, when estimating the pollutant loads through application of the nonpoint source pollution model, resonable parameter estimation of nitrogen concentration in rainfall is possible by reflecting the regional characteristics.

Feasibility of Tax Increase in Korean Welfare State via Estimation of Optimal Tax burden Ratio (적정조세부담률 추정을 통한 한국 복지국가 증세가능성에 관한 연구)

  • Kim, SeongWook
    • 한국사회정책
    • /
    • v.20 no.3
    • /
    • pp.77-115
    • /
    • 2013
  • The purpose of this study is to present empirical evidence for discussion of financing social welfare via estimating optimal tax burden in the main member countries of the OECD by using Hausman-Taylor method considering endogeneity of explanatory variables. Also, the author produced an international tax comparison index reflecting theoretical hypotheses on revenue-expenditure nexus within a model to compare real tax burden by countries and to examine feasibility of tax increase in Korea. As a result of the analysis, the higher the level of tax burden was, the higher the level of welfare expenditure was, indicating the connection between high burden and high welfare from the aspect of scale. The results also indicated that the subject countries recently entered into the state of low tax burden. Meanwhile, Korea had maintained low burden until the late 1990s but the tax burden soared up since the financial crisis related to the IMF. However, due to the impact of foreign economy and the tax reduction policy, it reentered into the low-burden state after 2009. On the other hand, the degree of social welfare expenditure's reducing tax burden has been gradually enhanced since the crisis. In this context, the current optimal tax burden ratio of Korea as of 2010 may be 25.8%~26.5% of GDP based on input of welfare expenditure variables, a percent that Korea was investigated to be a 'high tax burden-low ITC' country whose tax increase of 0.7~1.4%p may be feasible and that the success of tax system reform for tax increase might be higher probability when compare to others. However, measures of increasing social security contributions and consumption tax were analyzed to be improper from the aspect of managing finance when compared to increase in other tax items, considering the relatively higher ITC. Tax increase is not necessarily required though there may be room for tax increase; the optimal tax burden ratio can be understood as the level that may be achieved on average when compared to other nations, not as the "proper" level. Thus, discussion of tax increase should be accompanied with comprehensive understanding of models of economic developmental difference from nations and institutional & historical attributes included in specific tax mix.

Performance of Investment Strategy using Investor-specific Transaction Information and Machine Learning (투자자별 거래정보와 머신러닝을 활용한 투자전략의 성과)

  • Kim, Kyung Mock;Kim, Sun Woong;Choi, Heung Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.65-82
    • /
    • 2021
  • Stock market investors are generally split into foreign investors, institutional investors, and individual investors. Compared to individual investor groups, professional investor groups such as foreign investors have an advantage in information and financial power and, as a result, foreign investors are known to show good investment performance among market participants. The purpose of this study is to propose an investment strategy that combines investor-specific transaction information and machine learning, and to analyze the portfolio investment performance of the proposed model using actual stock price and investor-specific transaction data. The Korea Exchange offers daily information on the volume of purchase and sale of each investor to securities firms. We developed a data collection program in C# programming language using an API provided by Daishin Securities Cybosplus, and collected 151 out of 200 KOSPI stocks with daily opening price, closing price and investor-specific net purchase data from January 2, 2007 to July 31, 2017. The self-organizing map model is an artificial neural network that performs clustering by unsupervised learning and has been introduced by Teuvo Kohonen since 1984. We implement competition among intra-surface artificial neurons, and all connections are non-recursive artificial neural networks that go from bottom to top. It can also be expanded to multiple layers, although many fault layers are commonly used. Linear functions are used by active functions of artificial nerve cells, and learning rules use Instar rules as well as general competitive learning. The core of the backpropagation model is the model that performs classification by supervised learning as an artificial neural network. We grouped and transformed investor-specific transaction volume data to learn backpropagation models through the self-organizing map model of artificial neural networks. As a result of the estimation of verification data through training, the portfolios were rebalanced monthly. For performance analysis, a passive portfolio was designated and the KOSPI 200 and KOSPI index returns for proxies on market returns were also obtained. Performance analysis was conducted using the equally-weighted portfolio return, compound interest rate, annual return, Maximum Draw Down, standard deviation, and Sharpe Ratio. Buy and hold returns of the top 10 market capitalization stocks are designated as a benchmark. Buy and hold strategy is the best strategy under the efficient market hypothesis. The prediction rate of learning data using backpropagation model was significantly high at 96.61%, while the prediction rate of verification data was also relatively high in the results of the 57.1% verification data. The performance evaluation of self-organizing map grouping can be determined as a result of a backpropagation model. This is because if the grouping results of the self-organizing map model had been poor, the learning results of the backpropagation model would have been poor. In this way, the performance assessment of machine learning is judged to be better learned than previous studies. Our portfolio doubled the return on the benchmark and performed better than the market returns on the KOSPI and KOSPI 200 indexes. In contrast to the benchmark, the MDD and standard deviation for portfolio risk indicators also showed better results. The Sharpe Ratio performed higher than benchmarks and stock market indexes. Through this, we presented the direction of portfolio composition program using machine learning and investor-specific transaction information and showed that it can be used to develop programs for real stock investment. The return is the result of monthly portfolio composition and asset rebalancing to the same proportion. Better outcomes are predicted when forming a monthly portfolio if the system is enforced by rebalancing the suggested stocks continuously without selling and re-buying it. Therefore, real transactions appear to be relevant.

A fundamental study on the automation of tunnel blasting design using a machine learning model (머신러닝을 이용한 터널발파설계 자동화를 위한 기초연구)

  • Kim, Yangkyun;Lee, Je-Kyum;Lee, Sean Seungwon
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.24 no.5
    • /
    • pp.431-449
    • /
    • 2022
  • As many tunnels generally have been constructed, various experiences and techniques have been accumulated for tunnel design as well as tunnel construction. Hence, there are not a few cases that, for some usual tunnel design works, it is sufficient to perform the design by only modifying or supplementing previous similar design cases unless a tunnel has a unique structure or in geological conditions. In particular, for a tunnel blast design, it is reasonable to refer to previous similar design cases because the blast design in the stage of design is a preliminary design, considering that it is general to perform additional blast design through test blasts prior to the start of tunnel excavation. Meanwhile, entering the industry 4.0 era, artificial intelligence (AI) of which availability is surging across whole industry sector is broadly utilized to tunnel and blasting. For a drill and blast tunnel, AI is mainly applied for the estimation of blast vibration and rock mass classification, etc. however, there are few cases where it is applied to blast pattern design. Thus, this study attempts to automate tunnel blast design by means of machine learning, a branch of artificial intelligence. For this, the data related to a blast design was collected from 25 tunnel design reports for learning as well as 2 additional reports for the test, and from which 4 design parameters, i.e., rock mass class, road type and cross sectional area of upper section as well as bench section as input data as well as16 design elements, i.e., blast cut type, specific charge, the number of drill holes, and spacing and burden for each blast hole group, etc. as output. Based on this design data, three machine learning models, i.e., XGBoost, ANN, SVM, were tested and XGBoost was chosen as the best model and the results show a generally similar trend to an actual design when assumed design parameters were input. It is not enough yet to perform the whole blast design using the results from this study, however, it is planned that additional studies will be carried out to make it possible to put it to practical use after collecting more sufficient blast design data and supplementing detailed machine learning processes.

Estimation of Structural Deterioration of Sewer using Markov Chain Model (마르코프 연쇄 모델을 이용한 하수관로의 구조적 노후도 추정)

  • Kang, Byong Jun;Yoo, Soon Yu;Zhang, Chuanli;Park, Kyoo Hong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.43 no.4
    • /
    • pp.421-431
    • /
    • 2023
  • Sewer deterioration models can offer important information on prediction of future condition of the asset to decision makers in their implementing sewer pipe networks management program. In this study, Markov chain model was used to estimate sewer deterioration trend based on the historical structural condition assessment data obtained by CCTV inspection. The data used in this study were limited to Hume pipe with diameter of 450 mm and 600 mm in three sub-catchment areas in city A, which were collected by CCTV inspection projects performed in 1998-1999 and 2010-2011. As a result, it was found that sewers in sub-catchment area EM have deteriorated faster than those in other two sub-catchments. Various main defects were to generate in 29% of 450 mm sewers and 38% of 600 mm in 35 years after the installation, while serious failure in 62% of 450 mm sewers and 74% of 600 mm in 100 years after the installation in sub-catchment area EM. In sub-catchment area SN, main defects were to generate in 26% of 450 mm sewers and 35% of 600 mm in 35 years after the installation, while in sub-catchment area HK main defects were to generate in 27% of 450 mm sewers and 37% of 600 mm in 35 years after the installation. Larger sewer pipes of 600 mm were found to deteriorate faster than smaller sewer pipes of 450 mm by about 12 years. Assuming that the percentage of main defects generation could be set as 40% to estimate the life expectancy of the sewers, it was estimated as 60 years in sub-catchment area SN, 42 years in sub-catchment area EM, 59 years in sub-catchment area HK for 450 mm sewer pipes, respectively. For 600 mm sewer pipes, on the other hand, it was estimated as 43 years, 34 years, 39 years in sub-catchment areas SN, EM, and HK, respectively.

Estimation for Ground Air Temperature Using GEO-KOMPSAT-2A and Deep Neural Network (심층신경망과 천리안위성 2A호를 활용한 지상기온 추정에 관한 연구)

  • Taeyoon Eom;Kwangnyun Kim;Yonghan Jo;Keunyong Song;Yunjeong Lee;Yun Gon Lee
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.2
    • /
    • pp.207-221
    • /
    • 2023
  • This study suggests deep neural network models for estimating air temperature with Level 1B (L1B) datasets of GEO-KOMPSAT-2A (GK-2A). The temperature at 1.5 m above the ground impact not only daily life but also weather warnings such as cold and heat waves. There are many studies to assume the air temperature from the land surface temperature (LST) retrieved from satellites because the air temperature has a strong relationship with the LST. However, an algorithm of the LST, Level 2 output of GK-2A, works only clear sky pixels. To overcome the cloud effects, we apply a deep neural network (DNN) model to assume the air temperature with L1B calibrated for radiometric and geometrics from raw satellite data and compare the model with a linear regression model between LST and air temperature. The root mean square errors (RMSE) of the air temperature for model outputs are used to evaluate the model. The number of 95 in-situ air temperature data was 2,496,634 and the ratio of datasets paired with LST and L1B show 42.1% and 98.4%. The training years are 2020 and 2021 and 2022 is used to validate. The DNN model is designed with an input layer taking 16 channels and four hidden fully connected layers to assume an air temperature. As a result of the model using 16 bands of L1B, the DNN with RMSE 2.22℃ showed great performance than the baseline model with RMSE 3.55℃ on clear sky conditions and the total RMSE including overcast samples was 3.33℃. It is suggested that the DNN is able to overcome cloud effects. However, it showed different characteristics in seasonal and hourly analysis and needed to append solar information as inputs to make a general DNN model because the summer and winter seasons showed a low coefficient of determinations with high standard deviations.

Retrieval of Hourly Aerosol Optical Depth Using Top-of-Atmosphere Reflectance from GOCI-II and Machine Learning over South Korea (GOCI-II 대기상한 반사도와 기계학습을 이용한 남한 지역 시간별 에어로졸 광학 두께 산출)

  • Seyoung Yang;Hyunyoung Choi;Jungho Im
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_3
    • /
    • pp.933-948
    • /
    • 2023
  • Atmospheric aerosols not only have adverse effects on human health but also exert direct and indirect impacts on the climate system. Consequently, it is imperative to comprehend the characteristics and spatiotemporal distribution of aerosols. Numerous research endeavors have been undertaken to monitor aerosols, predominantly through the retrieval of aerosol optical depth (AOD) via satellite-based observations. Nonetheless, this approach primarily relies on a look-up table-based inversion algorithm, characterized by computationally intensive operations and associated uncertainties. In this study, a novel high-resolution AOD direct retrieval algorithm, leveraging machine learning, was developed using top-of-atmosphere reflectance data derived from the Geostationary Ocean Color Imager-II (GOCI-II), in conjunction with their differences from the past 30-day minimum reflectance, and meteorological variables from numerical models. The Light Gradient Boosting Machine (LGBM) technique was harnessed, and the resultant estimates underwent rigorous validation encompassing random, temporal, and spatial N-fold cross-validation (CV) using ground-based observation data from Aerosol Robotic Network (AERONET) AOD. The three CV results consistently demonstrated robust performance, yielding R2=0.70-0.80, RMSE=0.08-0.09, and within the expected error (EE) of 75.2-85.1%. The Shapley Additive exPlanations(SHAP) analysis confirmed the substantial influence of reflectance-related variables on AOD estimation. A comprehensive examination of the spatiotemporal distribution of AOD in Seoul and Ulsan revealed that the developed LGBM model yielded results that are in close concordance with AERONET AOD over time, thereby confirming its suitability for AOD retrieval at high spatiotemporal resolution (i.e., hourly, 250 m). Furthermore, upon comparing data coverage, it was ascertained that the LGBM model enhanced data retrieval frequency by approximately 8.8% in comparison to the GOCI-II L2 AOD products, ameliorating issues associated with excessive masking over very illuminated surfaces that are often encountered in physics-based AOD retrieval processes.

Estimation of the Surface Currents using Mean Dynamic Topography and Satellite Altimeter Data in the East Sea (평균역학고도장과 인공위성고도계 자료를 이용한 동해 표층해류 추산)

  • Lee, Sang-Hyun;Byun, Do-Seong;Choi, Byoung-Ju;Lee, Eun-Il
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.14 no.4
    • /
    • pp.195-204
    • /
    • 2009
  • In order to estimate sea surface current fields in the East Sea, we examined characteristics of mean dynamic topography (MDT) fields (or mean surface current field, MSC) generated from three different methods. This preliminary investigation evaluates the accuracy of surface currents estimated from satellite-derived sea level anomaly (SLA) data and three MDT fields in the East Sea. AVISO (Archiving, Validation and Interpretation of Satellite Oceanographic data) provides a MDT field derived from satellite observation and numerical models with $0.25^{\circ}$ horizontal resolution. Steric height field relative to 500 dbar from temperature and salinity profiles in the East Sea supplies another MDT field. Trajectory data of surface drifters (ARGOS) in the East Sea for 14 years provide another MSC field. Absolute dynamic topography (ADT) field is calculated by adding SLA to each MDT. Application of geostrophic equation to three different ADT fields yields three surface geostrophic current fields. Comparisons were made between the estimated surface currents from the three different methods and in-situ current measurements from a ship-mounted ADCP (Acoustic Doppler Current Profiler) in the southwestern East Sea in 2005. For offshore areas more than 50 km away from the land, the correlation coefficients (R) between the estimated versus the measured currents range from 0.58 to 0.73, with 17.1 to $21.7\;cm\;s^{-1}$ root mean square deviation (RMSD). For coastal ocean within 50 km from the land, however, R ranges from 0.06 to 0.46 and RMSD ranges from 15.5 to $28.0\;cm\;s^{-1}$. Results from this study reveal that a new approach in producing MDT and SLA is required to improve the accuracy of surface current estimations for the shallow costal zones of the East Sea.

Estimation of potential distribution of sweet potato weevil (Cylas formicarius) and climate change impact using MaxEnt (MaxEnt를 활용한 개미바구미(Cylas formicarius)의 잠재 분포와 기후변화 영향 모의)

  • Jinsol Hong;Heewon Hong;Sumin Pi;Soohyun Lee;Jae Ha Shin;Yongeun Kim;Kijong Cho
    • Korean Journal of Environmental Biology
    • /
    • v.41 no.4
    • /
    • pp.505-518
    • /
    • 2023
  • The key to invasive pest management lies in preemptive action. However, most current research using species distribution models is conducted after an invasion has occurred. This study modeled the potential distribution of the globally notorious sweet potato pest, the sweet potato weevil(Cylas formicarius), that has not yet invaded Korea using MaxEnt. Using global occurrence data, bioclimatic variables, and topsoil characteristics, MaxEnt showed high explanatory power as both the training and test areas under the curve exceeded 0.9. Among the environmental variables used in this study, minimum temperature in the coldest month (BIO06), precipitation in the driest month (BIO14), mean diurnal range (BIO02), and bulk density (BDOD) were identified as key variables. The predicted global distribution showed high values in most countries where the species is currently present, with a significant potential invasion risk in most South American countries where C. formicarius is not yet present. In Korea, Jeju Island and the southwestern coasts of Jeollanam-do showed very high probabilities. The impact of climate change under shared socioeconomic pathway (SSP) scenarios indicated an expansion along coasts as climate change progresses. By applying the 10th percentile minimum training presence rule, the potential area of occurrence was estimated at 1,439 km2 under current climate conditions and could expand up to 9,485 km2 under the SSP585 scenario. However, the model predicted that an inland invasion would not be serious. The results of this study suggest a need to focus on the risk of invasion in islands and coastal areas.

Business Application of Convolutional Neural Networks for Apparel Classification Using Runway Image (합성곱 신경망의 비지니스 응용: 런웨이 이미지를 사용한 의류 분류를 중심으로)

  • Seo, Yian;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.1-19
    • /
    • 2018
  • Large amount of data is now available for research and business sectors to extract knowledge from it. This data can be in the form of unstructured data such as audio, text, and image data and can be analyzed by deep learning methodology. Deep learning is now widely used for various estimation, classification, and prediction problems. Especially, fashion business adopts deep learning techniques for apparel recognition, apparel search and retrieval engine, and automatic product recommendation. The core model of these applications is the image classification using Convolutional Neural Networks (CNN). CNN is made up of neurons which learn parameters such as weights while inputs come through and reach outputs. CNN has layer structure which is best suited for image classification as it is comprised of convolutional layer for generating feature maps, pooling layer for reducing the dimensionality of feature maps, and fully-connected layer for classifying the extracted features. However, most of the classification models have been trained using online product image, which is taken under controlled situation such as apparel image itself or professional model wearing apparel. This image may not be an effective way to train the classification model considering the situation when one might want to classify street fashion image or walking image, which is taken in uncontrolled situation and involves people's movement and unexpected pose. Therefore, we propose to train the model with runway apparel image dataset which captures mobility. This will allow the classification model to be trained with far more variable data and enhance the adaptation with diverse query image. To achieve both convergence and generalization of the model, we apply Transfer Learning on our training network. As Transfer Learning in CNN is composed of pre-training and fine-tuning stages, we divide the training step into two. First, we pre-train our architecture with large-scale dataset, ImageNet dataset, which consists of 1.2 million images with 1000 categories including animals, plants, activities, materials, instrumentations, scenes, and foods. We use GoogLeNet for our main architecture as it has achieved great accuracy with efficiency in ImageNet Large Scale Visual Recognition Challenge (ILSVRC). Second, we fine-tune the network with our own runway image dataset. For the runway image dataset, we could not find any previously and publicly made dataset, so we collect the dataset from Google Image Search attaining 2426 images of 32 major fashion brands including Anna Molinari, Balenciaga, Balmain, Brioni, Burberry, Celine, Chanel, Chloe, Christian Dior, Cividini, Dolce and Gabbana, Emilio Pucci, Ermenegildo, Fendi, Giuliana Teso, Gucci, Issey Miyake, Kenzo, Leonard, Louis Vuitton, Marc Jacobs, Marni, Max Mara, Missoni, Moschino, Ralph Lauren, Roberto Cavalli, Sonia Rykiel, Stella McCartney, Valentino, Versace, and Yve Saint Laurent. We perform 10-folded experiments to consider the random generation of training data, and our proposed model has achieved accuracy of 67.2% on final test. Our research suggests several advantages over previous related studies as to our best knowledge, there haven't been any previous studies which trained the network for apparel image classification based on runway image dataset. We suggest the idea of training model with image capturing all the possible postures, which is denoted as mobility, by using our own runway apparel image dataset. Moreover, by applying Transfer Learning and using checkpoint and parameters provided by Tensorflow Slim, we could save time spent on training the classification model as taking 6 minutes per experiment to train the classifier. This model can be used in many business applications where the query image can be runway image, product image, or street fashion image. To be specific, runway query image can be used for mobile application service during fashion week to facilitate brand search, street style query image can be classified during fashion editorial task to classify and label the brand or style, and website query image can be processed by e-commerce multi-complex service providing item information or recommending similar item.