• Title/Summary/Keyword: Validation Region

Search Result 291, Processing Time 0.031 seconds

A Two-Stage Learning Method of CNN and K-means RGB Cluster for Sentiment Classification of Images (이미지 감성분류를 위한 CNN과 K-means RGB Cluster 이-단계 학습 방안)

  • Kim, Jeongtae;Park, Eunbi;Han, Kiwoong;Lee, Junghyun;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.139-156
    • /
    • 2021
  • The biggest reason for using a deep learning model in image classification is that it is possible to consider the relationship between each region by extracting each region's features from the overall information of the image. However, the CNN model may not be suitable for emotional image data without the image's regional features. To solve the difficulty of classifying emotion images, many researchers each year propose a CNN-based architecture suitable for emotion images. Studies on the relationship between color and human emotion were also conducted, and results were derived that different emotions are induced according to color. In studies using deep learning, there have been studies that apply color information to image subtraction classification. The case where the image's color information is additionally used than the case where the classification model is trained with only the image improves the accuracy of classifying image emotions. This study proposes two ways to increase the accuracy by incorporating the result value after the model classifies an image's emotion. Both methods improve accuracy by modifying the result value based on statistics using the color of the picture. When performing the test by finding the two-color combinations most distributed for all training data, the two-color combinations most distributed for each test data image were found. The result values were corrected according to the color combination distribution. This method weights the result value obtained after the model classifies an image's emotion by creating an expression based on the log function and the exponential function. Emotion6, classified into six emotions, and Artphoto classified into eight categories were used for the image data. Densenet169, Mnasnet, Resnet101, Resnet152, and Vgg19 architectures were used for the CNN model, and the performance evaluation was compared before and after applying the two-stage learning to the CNN model. Inspired by color psychology, which deals with the relationship between colors and emotions, when creating a model that classifies an image's sentiment, we studied how to improve accuracy by modifying the result values based on color. Sixteen colors were used: red, orange, yellow, green, blue, indigo, purple, turquoise, pink, magenta, brown, gray, silver, gold, white, and black. It has meaning. Using Scikit-learn's Clustering, the seven colors that are primarily distributed in the image are checked. Then, the RGB coordinate values of the colors from the image are compared with the RGB coordinate values of the 16 colors presented in the above data. That is, it was converted to the closest color. Suppose three or more color combinations are selected. In that case, too many color combinations occur, resulting in a problem in which the distribution is scattered, so a situation fewer influences the result value. Therefore, to solve this problem, two-color combinations were found and weighted to the model. Before training, the most distributed color combinations were found for all training data images. The distribution of color combinations for each class was stored in a Python dictionary format to be used during testing. During the test, the two-color combinations that are most distributed for each test data image are found. After that, we checked how the color combinations were distributed in the training data and corrected the result. We devised several equations to weight the result value from the model based on the extracted color as described above. The data set was randomly divided by 80:20, and the model was verified using 20% of the data as a test set. After splitting the remaining 80% of the data into five divisions to perform 5-fold cross-validation, the model was trained five times using different verification datasets. Finally, the performance was checked using the test dataset that was previously separated. Adam was used as the activation function, and the learning rate was set to 0.01. The training was performed as much as 20 epochs, and if the validation loss value did not decrease during five epochs of learning, the experiment was stopped. Early tapping was set to load the model with the best validation loss value. The classification accuracy was better when the extracted information using color properties was used together than the case using only the CNN architecture.

Construction and estimation of soil moisture site with FDR and COSMIC-ray (SM-FC) sensors for calibration/validation of satellite-based and COSMIC-ray soil moisture products in Sungkyunkwan university, South Korea (위성 토양수분 데이터 및 COSMIC-ray 데이터 보정/검증을 위한 성균관대학교 내 FDR 센서 토양수분 측정 연구(SM-FC) 및 데이터 분석)

  • Kim, Hyunglok;Sunwoo, Wooyeon;Kim, Seongkyun;Choi, Minha
    • Journal of Korea Water Resources Association
    • /
    • v.49 no.2
    • /
    • pp.133-144
    • /
    • 2016
  • In this study, Frequency Domain Reflectometry (FDR) and COSMIC-ray soil moisture (SM) stations were installed at Sungkyunkwan University in Suwon, South Korea. To provide reliable information about SM, soil property test, time series analysis of measured soil moisture, and comparison of measured SM with satellite-based SM product are conducted. In 2014, six FDR stations were set up for obtaining SM. Each of the stations had four FDR sensors with soil depth from 5 cm to 40 cm at 5~10 cm different intervals. The result showed that study region had heterogeneous soil layer properties such as sand and loamy sand. The measured SM data showed strong coupling with precipitation. Furthermore, they had a high correlation coefficient and a low root mean square deviation (RMSD) as compared to the satellite-based SM products. After verifying the accuracy of the data in 2014, four FDR stations and one COSMIC-ray station were additionally installed to establish the Soil Moisture site with FDR and COSMIC-ray, called SM-FC. COSMIC-ray-based SM had a high correlation coefficient of 0.95 compared with mean SM of FDR stations. From these results, the SM-FC will give a valuable insight for researchers into investigate satellite- and model-based SM validation study in South Korea.

Recent Changes in Bloom Dates of Robinia pseudoacacia and Bloom Date Predictions Using a Process-Based Model in South Korea (최근 12년간 아까시나무 만개일의 변화와 과정기반모형을 활용한 지역별 만개일 예측)

  • Kim, Sukyung;Kim, Tae Kyung;Yoon, Sukhee;Jang, Keunchang;Lim, Hyemin;Lee, Wi Young;Won, Myoungsoo;Lim, Jong-Hwan;Kim, Hyun Seok
    • Journal of Korean Society of Forest Science
    • /
    • v.110 no.3
    • /
    • pp.322-340
    • /
    • 2021
  • Due to climate change and its consequential spring temperature rise, flowering time of Robinia pseudoacacia has advanced and a simultaneous blooming phenomenon occurred in different regions in South Korea. These changes in flowering time became a major crisis in the domestic beekeeping industry and the demand for accurate prediction of flowering time for R. pseudoacacia is increasing. In this study, we developed and compared performance of four different models predicting flowering time of R. pseudoacacia for the entire country: a Single Model for the country (SM), Modified Single Model (MSM) using correction factors derived from SM, Group Model (GM) estimating parameters for each region, and Local Model (LM) estimating parameters for each site. To achieve this goal, the bloom date data observed at 26 points across the country for the past 12 years (2006-2017) and daily temperature data were used. As a result, bloom dates for the north central region, where spring temperature increase was more than two-fold higher than southern regions, have advanced and the differences compared with the southwest region decreased by 0.7098 days per year (p-value=0.0417). Model comparisons showed MSM and LM performed better than the other models, as shown by 24% and 15% lower RMSE than SM, respectively. Furthermore, validation with 16 additional sites for 4 years revealed co-krigging of LM showed better performance than expansion of MSM for the entire nation (RMSE: p-value=0.0118, Bias: p-value=0.0471). This study improved predictions of bloom dates for R. pseudoacacia and proposed methods for reliable expansion to the entire nation.

Study of Prediction Model Improvement for Apple Soluble Solids Content Using a Ground-based Hyperspectral Scanner (지상용 초분광 스캐너를 활용한 사과의 당도예측 모델의 성능향상을 위한 연구)

  • Song, Ahram;Jeon, Woohyun;Kim, Yongil
    • Korean Journal of Remote Sensing
    • /
    • v.33 no.5_1
    • /
    • pp.559-570
    • /
    • 2017
  • A partial least squares regression (PLSR) model was developed to map the internal soluble solids content (SSC) of apples using a ground-based hyperspectral scanner that could simultaneously acquire outdoor data and capture images of large quantities of apples. We evaluated the applicability of various preprocessing techniques to construct an optimal prediction model and calculated the optimal band through a variable importance in projection (VIP)score. From the 515 bands of hyperspectral images extracted at wavelengths of 360-1019 nm, 70 reflectance spectra of apples were extracted, and the SSC ($^{\circ}Brix$) was measured using a digital photometer. The optimal prediction model wasselected considering the root-mean-square error of cross-validation (RMSECV), root-mean-square error of prediction (RMSEP) and coefficient of determination of prediction $r_p^2$. As a result, multiplicative scatter correction (MSC)-based preprocessing methods were better than others. For example, when a combination of MSC and standard normal variate (SNV) was used, RMSECV and RMSEP were the lowest at 0.8551 and 0.8561 and $r_c^2$ and $r_p^2$ were the highest at 0.8533 and 0.6546; wavelength ranges of 360-380, 546-690, 760, 915, 931-939, 942, 953, 971, 978, 981, 988, and 992-1019 nm were most influential for SSC determination. The PLSR model with the spectral value of the corresponding region confirmed that the RMSEP decreased to 0.6841 and $r_p^2$ increased to 0.7795 as compared to the values of the entire wavelength band. In this study, we confirmed the feasibility of using a hyperspectral scanner image obtained from outdoors for the SSC measurement of apples. These results indicate that the application of field data and sensors could possibly expand in the future.

Oceanic Application of Satellite Synthetic Aperture Radar - Focused on Sea Surface Wind Retrieval - (인공위성 합성개구레이더 영상 자료의 해양 활용 - 해상풍 산출을 중심으로 -)

  • Jang, Jae-Cheol;Park, Kyung-Ae
    • Journal of the Korean earth science society
    • /
    • v.40 no.5
    • /
    • pp.447-463
    • /
    • 2019
  • Sea surface wind is a fundamental element for understanding the oceanic phenomena and for analyzing changes of the Earth environment caused by global warming. Global research institutes have developed and operated scatterometers to accurately and continuously observe the sea surface wind, with the accuracy of approximately ${\pm}20^{\circ}$ for wind direction and ${\pm}2m\;s^{-1}$ for wind speed. Given that the spatial resolution of the scatterometer is 12.5-25.0 km, the applicability of the data to the coastal area is limited due to complicated coastal lines and many islands around the Korean Peninsula. In contrast, Synthetic Aperture Radar (SAR), one of microwave sensors, is an all-weather instrument, which enables us to retrieve sea surface wind with high resolution (<1 km) and compensate the sparse resolution of the scatterometer. In this study, we investigated the Geophysical Model Functions (GMF), which are the algorithms for retrieval of sea surface wind speed from the SAR data depending on each band such as C-, L-, or X-band radar. We reviewed in the simulation of the backscattering coefficients for relative wind direction, incidence angle, and wind speed by applying LMOD, CMOD, and XMOD model functions, and analyzed the characteristics of each GMF. We investigated previous studies about the validation of wind speed from the SAR data using these GMFs. The accuracy of sea surface wind from SAR data changed with respect to observation mode, GMF type, reference data for validation, preprocessing method, and the method for calculation of relative wind direction. It is expected that this study contributes to the potential users of SAR images who retrieve wind speeds from SAR data at the coastal region around the Korean Peninsula.

Predicting of the $^{14}C$ Activity in Rice Plants Exposed to $^{14}CO_2$ Gas for a Short Period of Time ($^{14}CO_2$가스에 단기간 노출된 벼의 $^{14}C$ 오염 예측)

  • Jun, In;Lim, Kwang-Muk;Keum, Dong-Kwon;Choi, Young-Ho;Han, Moon-Hee
    • Journal of Radiation Protection and Research
    • /
    • v.33 no.4
    • /
    • pp.135-141
    • /
    • 2008
  • This paper describes a dynamic compartment model to predict the time-dependent $^{14}C$ activity in a plant as a result of a direct exposure to an amount of $^{14}CO_2$ for a short period of time, and experimental results for the model validation. In the model, the plant consists of two compartments of the body and ears, and five carbon fluxes between the compartments, which are the function of parameters relating to the growth and photosynthesis of a plant, are considered. Model predictions were made for an investigation into the effects of the exposure time, the elapsed exposure time, and the model parameters on the $^{14}C$ radioactivity of a plant. The present model converged to a region where the specific activity model is applicable when the elapsed time of the exposure was extended up to the harvest time of a plant. The $^{14}C$ activity of a plant was predicted to be the greatest when the exposure had happened in the period between the flowering and ears-maturity on account of the most vigorous photosynthesis rate for the period. Comparison of model predictions with the observed 14C radioactivity of rice plants showed that the present model could predict the $^{14}C$ radioactivity of the rice plants reasonably well.

Odor Modeling of acetaldehyde in Gumi National Industrial Complex (구미국가산업단지의 아세트알데히드 악취모델링)

  • Lee, Eun Ju;Akhtar, Muhammad Saeed;Lim, Kwang-Hee
    • Korean Chemical Engineering Research
    • /
    • v.54 no.1
    • /
    • pp.22-35
    • /
    • 2016
  • In this study CALPUFF modeling was performed to establish a correlation between regions of frequent civil odor complaints near Gumi national industrial complex and odor-emission facilities of synthetic fiber manufacturers in the same area as main acetaldehyde-emission point sources. As a result of the CALPUFF modeling, the maximum concentration of acetaldehyde in Gumi national industrial complex was reduced from O ($10^{-5}g/m^3$) to O ($10^{-6}g/m^3$) upon improving emission facilities of T company so that the maximum concentrations of acetaldehyde frequently appeared in complex 3. In addition, the predicted range of the maximum acetaldehyde concentration in Gumi national industrial complex was also improved in comparison with that prior to improving emission facilities of T company. These maximum concentrations of acetaldehyde obtained to estimate the expected contribution of total acetaldehyde point source by CALPUFF modeling showed the similar values to those measured in 'HAPs investigation in the region of Gumi-Daegu' and were consistent to the trend of civil odor complaints. Therefore, the expected contribution of total acetaldehyde point source was validated. The relative contribution of T company upon improving its emission facilities was predicted to be lowered by more than factor of two, compared to that prior to improving its emission facilities. To the contrary, the relative contribution of W company upon improving emission facilities of T company was predicted to be increased by more than factor of two, compared to that prior to improving emission facilities of T company. This indicates that the contribution of aldehyde point sources of W company was relatively increased upon improving emission facilities of T company.

Calibration and Validation of Ocean Color Satellite Imagery (해양수색 위성자료의 검.보정)

  • ;B. G. Mitchell
    • Journal of Environmental Science International
    • /
    • v.10 no.6
    • /
    • pp.431-436
    • /
    • 2001
  • Variations in phytoplankton concentrations result from changes of the ocean color caused by phytoplankton pigments. Thus, ocean spectral reflectance for low chlorophyll waters are blue and high chlorophyll waters tend to have green reflectance. In the Korea region, clear waters and the open sea in the Kuroshio regions of the East China Sea have low chlorophyll. As one moves even closer In the northwestern part of the East China Sea, the situation becomes much more optically complicated, with contributions not only from higher concentration of phytoplankton, but also from sediments and dissolved materials from terrestrial and sea bottom sources. The color often approaches yellow-brown in the turbidity waters (Case Ⅱ waters). To verify satellite ocean color retrievals, or to develop new algorithms for complex case Ⅱ regions requires ship-based studies. In this study, we compared the chlorophyll retrievals from NASA's SeaWiFS sensor with chlorophyll values determined with standard fluorometric methods during two cruises on Korean NFRDI ships. For the SeaWiFS data, we used the standard NASA SeaWiFS algorithm to estimate the chlorophyll_a distribution around the Korean waters using Orbview/ SeaWiFS satellite data acquired by our HPRT station at NFRDl. We studied In find out the relationship between the measured chlorophyll_a from the ship and the estimated chlorophyll_a from the SeaWiFs satellite data around the northern part of the East China Sea, in February, and May, 2000. The relationship between the measured chlorophyll_a and the SeaWiFS chlorophyll_a shows following the equations (1) In the northern part of the East China Sea. Chlorophyll_a =0.121Ln(X) + 0.504, R²= 0.73 (1) We also determined total suspended sediment mass (55) and compared it with SeaWiFS spectral band ratio. A suspended solid algorithm was composed of in-.situ data and the ratio (L/sub WN/(490 ㎚)L/sub WN/(555 ㎚) of the SeaWiFS wavelength bands. The relationship between the measured suspended solid and the SeaWiFS band ratio shows following the equation (2) in the northern part of the East China Sea. SS = -0.703 Ln(X) + 2.237, R²= 0.62 (2) In the near future, NFRDI will develop algorithms for quantifying the ocean color properties around the Korean waters, with the data from regular ocean observations using its own research vessels and from three satellites, KOMPSAT/OSMl, Terra/MODIS and Orbview/SeaWiFS.

  • PDF

Accuracy of Imputation of Microsatellite Markers from BovineSNP50 and BovineHD BeadChip in Hanwoo Population of Korea

  • Sharma, Aditi;Park, Jong-Eun;Park, Byungho;Park, Mi-Na;Roh, Seung-Hee;Jung, Woo-Young;Lee, Seung-Hwan;Chai, Han-Ha;Chang, Gul-Won;Cho, Yong-Min;Lim, Dajeong
    • Genomics & Informatics
    • /
    • v.16 no.1
    • /
    • pp.10-13
    • /
    • 2018
  • Until now microsatellite (MS) have been a popular choice of markers for parentage verification. Recently many countries have moved or are in process of moving from MS markers to single nucleotide polymorphism (SNP) markers for parentage testing. FAO-ISAG has also come up with a panel of 200 SNPs to replace the use of MS markers in parentage verification. However, in many countries most of the animals were genotyped by MS markers till now and the sudden shift to SNP markers will render the data of those animals useless. As National Institute of Animal Science in South Korea plans to move from standard ISAG recommended MS markers to SNPs, it faces the dilemma of exclusion of old animals that were genotyped by MS markers. Thus to facilitate this shift from MS to SNPs, such that the existing animals with MS data could still be used for parentage verification, this study was performed. In the current study we performed imputation of MS markers from the SNPs in the 500-kb region of the MS marker on either side. This method will provide an easy option for the labs to combine the data from the old and the current set of animals. It will be a cost efficient replacement of genotyping with the additional markers. We used 1,480 Hanwoo animals with both the MS data and SNP data to impute in the validation animals. We also compared the imputation accuracy between BovineSNP50 and BovineHD BeadChip. In our study the genotype concordance of 40% and 43% was observed in the BovineSNP50 and BovineHD BeadChip respectively.

Finite Element Analysis of Ultra High Performance Fiber Reinforced Concrete 50M Composite Box Girder (초고강도 섬유보강 콘크리트 50M 합성 박스거더의 유한요소해석)

  • Makhbal, Tsas-Orgilmaa;Kim, Do-Hyun;Han, Sang-Mook
    • Journal of the Korean Recycled Construction Resources Institute
    • /
    • v.6 no.2
    • /
    • pp.100-107
    • /
    • 2018
  • The material and geometrical nonlinear finite elment analysis of UHPFRC 50M composite box girder was carried out. Constitute law in tension and compressive region of UHPFRC and HPC were modeled based on specimen test. The accuracy of nonlinear FEM analysis was verified by the experimental result of UHPFRC 50M composite girder. The UHPFRC 50M segmental composite box girder which has 1.5% steel fiber of volume fraction, 135MPa compressive strength and 18MPa tensile strength was tested. The post-tensioned UHPFRC composite girder consisted of three segment UHPFRC U-girder and High Strength Concrete reinforced slab. The parts of UHPFRC girder were modeled by 8nodes hexahedron elements and reinforcement bars and tendons were built by 2nodes linear elements by Midas FEA software. The constitutive laws of concrete materials were selected Multi-linear model both of tension and compression function under total strain crack model, which was included in classifying of smeared crack model. The nonlinearity of reinforcement elements and tendon was simulated by Von Mises criteria. The nonlinear static analysis was applied by incremental-iteration method with convergence criteria of Newton-Raphson. The validation of numerical analysis was verified by comparison with experimental result and numerical analysis result of load-deflection response, neutral axis coordinate change, and cracking pattern of girder. The load-deflection response was fitted very well with comparison to the experimental result. The finite element analysis is seen to satisfactorily predict flexural behavioral responses of post-tensioned, reinforced UHPFRC composite box girder.