• Title/Summary/Keyword: value prediction

Search Result 2,422, Processing Time 0.027 seconds

Evaluation for Rock Cleavage Using Distribution of Microcrack Spacings (I) (미세균열의 간격 분포를 이용한 결의 평가(I))

  • Park, Deok-Won
    • The Journal of the Petrological Society of Korea
    • /
    • v.25 no.1
    • /
    • pp.13-27
    • /
    • 2016
  • The characteristics of the rock cleavage inherent in Jurassic granite from Geochang were analysed. The phases of distribution of microcrack spacings were derived from the enlarged photomicrographs(${\times}6.7$) of the thin section. The evaluation for the six directions of rock cleavages was performed using nine parameters such as (1) frequency of microcrack spacing(N), (2) frequency ratio(${\leq}1mm$ and 4 mm >) to total spacing frequency(N:191), (3) spacing ratio(${\leq}1mm$) to total spacing(118.49 mm), (4) mean spacing($S_{mean}$), (5) difference value($S_{mean}-S_{median}$) between mean spacing and median spacing($S_{median}$), (6) density of spacing, (7) median spacing, (8) reduction ratio of spacing frequency to length frequency and (9) magnitude of exponent(${\lambda}$ and b) related to the distribution type of diagram. Especially the close dependence between the above spacing parameters and the parameters from the spacing-cumulative frequency diagrams was derived. The results of correlation analysis between the values of parameters for three rock cleavages and those for three planes are as follows. The values of (I) parameters(1, 2 and 3), (II) parameters(4, 5 and 6), (III) parameter(7), (IV) parameter(8) and (V) parameter(9) show the various orders of H(hardway, H1+H2) < G(grain, G1+G2) < R(rift, R1+R2), R < G < H, R < H < G, G < H < R and H < G < R, respectively. On the contrary, the values of the above four groups(I~IV) of parameters for three planes show reverse orders. This type of correlation analysis is useful for discriminating three quarrying planes. Six spacing-cumulative frequency diagrams were arranged in increasing order on the value of main parameter($S_{mean}-S_{median}$). These diagrams show an order of R2 < R1 < G2 < G1 < H2 < H1 from the related chart. In other words, the above six diagrams can be summarized in order of rift(R1+R2) < grain(G1+G2) < hardway(H1+H2). These results indicate a relative magnitude of rock cleavage related to microcrack spacing. Especially, the above main parameter could provide advanced information for prediction the order of arrangement among the diagrams.

Consolidation Behavior of Soft Ground by Prefabricated Vertical Drains (페이퍼드레인 공법에 의한 연약지반의 압밀거동)

  • Lee, Dal Won;Kang, Yea Mook;Kim, Seong Wan;Chee, In Taeg
    • Korean Journal of Agricultural Science
    • /
    • v.24 no.2
    • /
    • pp.145-155
    • /
    • 1997
  • The large scaled field test by prefabricated vertical drains was performed to evaluate the superiority of vertical discharge capacity for drain materials through compare and analyze the time-settlement behavior with drain spacing and the compression index and consolidation coefficient obtained by laboratory experiments and field monitoring system. 1. The relation of measurement settlement($S_m$) versus design settlement($S_t$) and measurement consolidation ratio($U_m$) versus design consolidation ratio($U_t$) were shown $S_m=(1.0{\sim}1.1)S_t$, $U_m=(1.13{\sim}1.17)U_t$ at 1.0m drain spacing and $S_m=(0.7{\sim}0.8)S_t$, $U_m=(0.92{\sim}0.99)U_t$ at l.5m drain spacing, respectively. 2. The relation of field compressing index($C_{cfield}$) and virgin compression index($V_{cclab.}$) was shown $C_{cfield}=(1.0{\sim}1.2)V_{cclab.}$, But it was nearly same value when considered the error with determination method of virgin compression index and prediction method of total settlement. 3. Field consolidation coefficient was larger than laboratory consolidation coefficient, and the consolidation coefficient ratio($C_h/C_v$) were $C_h=(2.4{\sim}3.0)C_v$. $C_h=(3.5{\sim}4.3)C_v$ at 1.0m and 1.5m drain spacing and increased with increasing of drain spacing. 4. The evaluation of vertical discharge capacity with drain spacing from the results of the consolidation coefficient ratio showed largely superior in case the Mebra drain and Amer drain than other drain materials at 1.0m and 1.5m drain spacing, while the values showed nearly same value in case same drain spacing.

  • PDF

Prediction of Evapotranspiration from Grape Vines in Suwon with the FAO Penman-Monteith Equation (FAO Penman-Monteith 공식을 이용한 수원지역 포도 수체 증발산량 예측)

  • Yun, Seok-Kyu;Hur, Seung-Oh;Kim, Seung-Heui;Park, Seo-Jun;Kim, Jeong-Bae;Choi, In-Myung
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.11 no.3
    • /
    • pp.111-117
    • /
    • 2009
  • Food and Agricultural Organization (FAO) Penman-Monteith (PM) equation is one of the most widely used equations for predicting evapotranspiration (ET) of crops. The ET rate and the base crop coefficients ($K_{cb}$) of the two different grape vines (i.e., Campbell Early and Kyoho) cultivated in Suwon were calculated by using the FAO PM equation. The ET rate of Campbell Early was $2.41\;mm\;day^{-1}$ and that of Kyoho was $2.22\;mm\;day^{-1}$ in August when the leaf area index was 2.2. During this period, the $K_{cb}$ of Campbell Early based on the FAO PM equation was on average 0.49 with the maximum value of 0.72. On the other hand, the $K_{cb}$ of Kyoho was averaged to be 0.45 with the maximum value of 0.64. The seasonal leaf area index for two grape cultivars was measured as 0.15 in April, 0.5 in May, 1.4 in June, 2.2 in July-September, and 1.5 in October. The $K_{cb}$ of Campbell Early showed a seasonal variation, changing from 0.03 in April to 0.11 in May, 0.31 in June, 0.49 in July-September, and 0.33 in October. The magnitudes and the seasonality of $K_{cb}$ of Kyoho were similar to those of Campbell Early.

Product Recommender Systems using Multi-Model Ensemble Techniques (다중모형조합기법을 이용한 상품추천시스템)

  • Lee, Yeonjeong;Kim, Kyoung-Jae
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.39-54
    • /
    • 2013
  • Recent explosive increase of electronic commerce provides many advantageous purchase opportunities to customers. In this situation, customers who do not have enough knowledge about their purchases, may accept product recommendations. Product recommender systems automatically reflect user's preference and provide recommendation list to the users. Thus, product recommender system in online shopping store has been known as one of the most popular tools for one-to-one marketing. However, recommender systems which do not properly reflect user's preference cause user's disappointment and waste of time. In this study, we propose a novel recommender system which uses data mining and multi-model ensemble techniques to enhance the recommendation performance through reflecting the precise user's preference. The research data is collected from the real-world online shopping store, which deals products from famous art galleries and museums in Korea. The data initially contain 5759 transaction data, but finally remain 3167 transaction data after deletion of null data. In this study, we transform the categorical variables into dummy variables and exclude outlier data. The proposed model consists of two steps. The first step predicts customers who have high likelihood to purchase products in the online shopping store. In this step, we first use logistic regression, decision trees, and artificial neural networks to predict customers who have high likelihood to purchase products in each product group. We perform above data mining techniques using SAS E-Miner software. In this study, we partition datasets into two sets as modeling and validation sets for the logistic regression and decision trees. We also partition datasets into three sets as training, test, and validation sets for the artificial neural network model. The validation dataset is equal for the all experiments. Then we composite the results of each predictor using the multi-model ensemble techniques such as bagging and bumping. Bagging is the abbreviation of "Bootstrap Aggregation" and it composite outputs from several machine learning techniques for raising the performance and stability of prediction or classification. This technique is special form of the averaging method. Bumping is the abbreviation of "Bootstrap Umbrella of Model Parameter," and it only considers the model which has the lowest error value. The results show that bumping outperforms bagging and the other predictors except for "Poster" product group. For the "Poster" product group, artificial neural network model performs better than the other models. In the second step, we use the market basket analysis to extract association rules for co-purchased products. We can extract thirty one association rules according to values of Lift, Support, and Confidence measure. We set the minimum transaction frequency to support associations as 5%, maximum number of items in an association as 4, and minimum confidence for rule generation as 10%. This study also excludes the extracted association rules below 1 of lift value. We finally get fifteen association rules by excluding duplicate rules. Among the fifteen association rules, eleven rules contain association between products in "Office Supplies" product group, one rules include the association between "Office Supplies" and "Fashion" product groups, and other three rules contain association between "Office Supplies" and "Home Decoration" product groups. Finally, the proposed product recommender systems provides list of recommendations to the proper customers. We test the usability of the proposed system by using prototype and real-world transaction and profile data. For this end, we construct the prototype system by using the ASP, Java Script and Microsoft Access. In addition, we survey about user satisfaction for the recommended product list from the proposed system and the randomly selected product lists. The participants for the survey are 173 persons who use MSN Messenger, Daum Caf$\acute{e}$, and P2P services. We evaluate the user satisfaction using five-scale Likert measure. This study also performs "Paired Sample T-test" for the results of the survey. The results show that the proposed model outperforms the random selection model with 1% statistical significance level. It means that the users satisfied the recommended product list significantly. The results also show that the proposed system may be useful in real-world online shopping store.

Study on sea fog detection near Korea peninsula by using GMS-5 Satellite Data (GMS-5 위성자료를 이용한 한반도 주변 해무탐지 연구)

  • 윤홍주
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.4 no.4
    • /
    • pp.875-884
    • /
    • 2000
  • Sea fog/stratus is very difficult to detect because of the characteristics of air-sea interaction and locality ,and the scantiness of the observed data from the oceans such as ships or ocean buoys. The aim of our study develops new algorism for sea fog detection by using Geostational Meteorological Satellite-5(GMS-5) and suggests the technics of its continuous detection. In this study, atmospheric synoptic patterns on sea fog day of May, 1999 are classified; cold air advection type(OOUTC, May 10, 1999) and warm air advection type(OOUTC, May 12, 1999), respectively, and we collected two case days in order to analyze variations of water vapor at Osan observation station during May 9-10, 1999.So as to detect daytime sea fog/stratus(OOUTC, May 10, 1999), composite image, visible accumulated histogram method and surface albedo method are used. The characteristic value during day showed A(min) .20% and DA < 10% when visible accumulated histogram method was applied. And the sea fog region which is detected is similar in composite image analysis and surface albedo method. Inland observation which visibility and relative humidity is beneath 1Km and 80%, respectively, at OOUTC, May 10,1999; Poryoung for visble accumulated histogram method and Poryoung, Mokp'o and Kangnung for surface albedo method. In case of nighttime sea fog(18UTC, May 10, 1999), IR accumulated histogram method and Maximum brightness temperature method are used, respectively. Maxium brightness temperature method dectected sea fog better than IR accumulated histogram method with the charateristic value that is T_max < T_max_trs, and then T_max is beneath 700hPa temperature of GDAPS(Global Data Assimilation and Prediction System). Sea fog region which is detected by Maxium brighness temperature method was similar to the result of National Oceanic and Atmosheric Administratio/Advanced Very High Resolution Radiometer (NOAA/AVHRR) DCD(Dual Channel Difference), but usually visibility and relative humidity are not agreed well in inland.

  • PDF

Effect of Green Tea and Saw Dust Contents on Dynamic Modulus of Elasticity of Hybrid Composite Boards and Prediction of Static Bending Strength Performances (이종복합보드의 동적탄성률에 미치는 녹차와 톱밥 배합비율의 영향 및 정적 휨 강도성능의 예측)

  • Park, Han-Min;Lee, Soo-Kyeong;Seok, Ji-Hoon;Choi, Nam-Kyeong;Kwon, Chang-Bae;Heo, Hwang-Sun;Byeon, Hee-Seop;Yang, Jae-Kyung;Kim, Jong-Chul
    • Journal of agriculture & life science
    • /
    • v.46 no.2
    • /
    • pp.9-17
    • /
    • 2012
  • In this study, in addition to the green tea - wood fiber hybrid composite boards of previous researches, to make effective use of saw dust of domestic cypress tree with functionalities and application as interior materials, eco-friendly hybrid composite boards were manufactured from wood fiber, green tea and saw dust of cypress tree. We investigated the effect of the component ratio of saw dust and green tea on dynamic MOE (modulus of elasticity). Dynamic MOE was within 1.41~1.65 GPa, and showed the highest value in wood fiber : green tea : saw dust = 50 : 40 : 10 of the component ratio, and had the lowest value in 50 : 30 : 20 of component ratio. These values were 1.4~1.6 times higher than static bending MOE of wood fiber - saw dust - green tea hybrid composite boards, and were 2.0~2.9 times lower than those of green tea - wood fiber hybrid composite boards reported in the previous researches. From the results of correlation regression analyses between dynamic MOE and static strength performances, a very high correlation coefficients were obtained, therefore it was found that static bending strength performances can be estimated with a high reliability from dynamic MOE.

Evaluation of Moisture and Feed Values for Winter Annual Forage Crops Using Near Infrared Reflectance Spectroscopy (근적외선분광법을 이용한 동계사료작물 풀 사료의 수분함량 및 사료가치 평가)

  • Kim, Ji Hea;Lee, Ki Won;Oh, Mirae;Choi, Ki Choon;Yang, Seung Hak;Kim, Won Ho;Park, Hyung Soo
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.39 no.2
    • /
    • pp.114-120
    • /
    • 2019
  • This study was carried out to explore the accuracy of near infrared spectroscopy(NIRS) for the prediction of moisture content and chemical parameters on winter annual forage crops. A population of 2454 winter annual forages representing a wide range in chemical parameters was used in this study. Samples of forage were scanned at 1nm intervals over the wavelength range 680-2500nm and the optical data was recorded as log 1/Reflectance(log 1/R), which scanned in intact fresh condition. The spectral data were regressed against a range of chemical parameters using partial least squares(PLS) multivariate analysis in conjunction with spectral math treatments to reduced the effect of extraneous noise. The optimum calibrations were selected based on the highest coefficients of determination in cross validation($R^2$) and the lowest standard error of cross-validation(SECV). The results of this study showed that NIRS calibration model to predict the moisture contents and chemical parameters had very high degree of accuracy except for barely. The $R^2$ and SECV for integrated winter annual forages calibration were 0.99(SECV 1.59%) for moisture, 0.89(SECV 1.15%) for acid detergent fiber, 0.86(SECV 1.43%) for neutral detergent fiber, 0.93(SECV 0.61%) for crude protein, 0.90(SECV 0.45%) for crude ash, and 0.82(SECV 3.76%) for relative feed value on a dry matter(%), respectively. Results of this experiment showed the possibility of NIRS method to predict the moisture and chemical composition of winter annual forage for routine analysis method to evaluate the feed value.

Conditional Generative Adversarial Network based Collaborative Filtering Recommendation System (Conditional Generative Adversarial Network(CGAN) 기반 협업 필터링 추천 시스템)

  • Kang, Soyi;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.157-173
    • /
    • 2021
  • With the development of information technology, the amount of available information increases daily. However, having access to so much information makes it difficult for users to easily find the information they seek. Users want a visualized system that reduces information retrieval and learning time, saving them from personally reading and judging all available information. As a result, recommendation systems are an increasingly important technologies that are essential to the business. Collaborative filtering is used in various fields with excellent performance because recommendations are made based on similar user interests and preferences. However, limitations do exist. Sparsity occurs when user-item preference information is insufficient, and is the main limitation of collaborative filtering. The evaluation value of the user item matrix may be distorted by the data depending on the popularity of the product, or there may be new users who have not yet evaluated the value. The lack of historical data to identify consumer preferences is referred to as data sparsity, and various methods have been studied to address these problems. However, most attempts to solve the sparsity problem are not optimal because they can only be applied when additional data such as users' personal information, social networks, or characteristics of items are included. Another problem is that real-world score data are mostly biased to high scores, resulting in severe imbalances. One cause of this imbalance distribution is the purchasing bias, in which only users with high product ratings purchase products, so those with low ratings are less likely to purchase products and thus do not leave negative product reviews. Due to these characteristics, unlike most users' actual preferences, reviews by users who purchase products are more likely to be positive. Therefore, the actual rating data is over-learned in many classes with high incidence due to its biased characteristics, distorting the market. Applying collaborative filtering to these imbalanced data leads to poor recommendation performance due to excessive learning of biased classes. Traditional oversampling techniques to address this problem are likely to cause overfitting because they repeat the same data, which acts as noise in learning, reducing recommendation performance. In addition, pre-processing methods for most existing data imbalance problems are designed and used for binary classes. Binary class imbalance techniques are difficult to apply to multi-class problems because they cannot model multi-class problems, such as objects at cross-class boundaries or objects overlapping multiple classes. To solve this problem, research has been conducted to convert and apply multi-class problems to binary class problems. However, simplification of multi-class problems can cause potential classification errors when combined with the results of classifiers learned from other sub-problems, resulting in loss of important information about relationships beyond the selected items. Therefore, it is necessary to develop more effective methods to address multi-class imbalance problems. We propose a collaborative filtering model using CGAN to generate realistic virtual data to populate the empty user-item matrix. Conditional vector y identify distributions for minority classes and generate data reflecting their characteristics. Collaborative filtering then maximizes the performance of the recommendation system via hyperparameter tuning. This process should improve the accuracy of the model by addressing the sparsity problem of collaborative filtering implementations while mitigating data imbalances arising from real data. Our model has superior recommendation performance over existing oversampling techniques and existing real-world data with data sparsity. SMOTE, Borderline SMOTE, SVM-SMOTE, ADASYN, and GAN were used as comparative models and we demonstrate the highest prediction accuracy on the RMSE and MAE evaluation scales. Through this study, oversampling based on deep learning will be able to further refine the performance of recommendation systems using actual data and be used to build business recommendation systems.

Observation of Ice Gradient in Cheonji, Baekdu Mountain Using Modified U-Net from Landsat -5/-7/-8 Images (Landsat 위성 영상으로부터 Modified U-Net을 이용한 백두산 천지 얼음변화도 관측)

  • Lee, Eu-Ru;Lee, Ha-Seong;Park, Sun-Cheon;Jung, Hyung-Sup
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_2
    • /
    • pp.1691-1707
    • /
    • 2022
  • Cheonji Lake, the caldera of Baekdu Mountain, located on the border of the Korean Peninsula and China, alternates between melting and freezing seasonally. There is a magma chamber beneath Cheonji, and variations in the magma chamber cause volcanic antecedents such as changes in the temperature and water pressure of hot spring water. Consequently, there is an abnormal region in Cheonji where ice melts quicker than in other areas, freezes late even during the freezing period, and has a high-temperature water surface. The abnormal area is a discharge region for hot spring water, and its ice gradient may be used to monitor volcanic activity. However, due to geographical, political and spatial issues, periodic observation of abnormal regions of Cheonji is limited. In this study, the degree of ice change in the optimal region was quantified using a Landsat -5/-7/-8 optical satellite image and a Modified U-Net regression model. From January 22, 1985 to December 8, 2020, the Visible and Near Infrared (VNIR) band of 83 Landsat images including anomalous regions was utilized. Using the relative spectral reflectance of water and ice in the VNIR band, unique data were generated for quantitative ice variability monitoring. To preserve as much information as possible from the visible and near-infrared bands, ice gradient was noticed by applying it to U-Net with two encoders, achieving good prediction accuracy with a Root Mean Square Error (RMSE) of 140 and a correlation value of 0.9968. Since the ice change value can be seen with high precision from Landsat images using Modified U-Net in the future may be utilized as one of the methods to monitor Baekdu Mountain's volcanic activity, and a more specific volcano monitoring system can be built.

Prediction of Key Variables Affecting NBA Playoffs Advancement: Focusing on 3 Points and Turnover Features (미국 프로농구(NBA)의 플레이오프 진출에 영향을 미치는 주요 변수 예측: 3점과 턴오버 속성을 중심으로)

  • An, Sehwan;Kim, Youngmin
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.263-286
    • /
    • 2022
  • This study acquires NBA statistical information for a total of 32 years from 1990 to 2022 using web crawling, observes variables of interest through exploratory data analysis, and generates related derived variables. Unused variables were removed through a purification process on the input data, and correlation analysis, t-test, and ANOVA were performed on the remaining variables. For the variable of interest, the difference in the mean between the groups that advanced to the playoffs and did not advance to the playoffs was tested, and then to compensate for this, the average difference between the three groups (higher/middle/lower) based on ranking was reconfirmed. Of the input data, only this year's season data was used as a test set, and 5-fold cross-validation was performed by dividing the training set and the validation set for model training. The overfitting problem was solved by comparing the cross-validation result and the final analysis result using the test set to confirm that there was no difference in the performance matrix. Because the quality level of the raw data is high and the statistical assumptions are satisfied, most of the models showed good results despite the small data set. This study not only predicts NBA game results or classifies whether or not to advance to the playoffs using machine learning, but also examines whether the variables of interest are included in the major variables with high importance by understanding the importance of input attribute. Through the visualization of SHAP value, it was possible to overcome the limitation that could not be interpreted only with the result of feature importance, and to compensate for the lack of consistency in the importance calculation in the process of entering/removing variables. It was found that a number of variables related to three points and errors classified as subjects of interest in this study were included in the major variables affecting advancing to the playoffs in the NBA. Although this study is similar in that it includes topics such as match results, playoffs, and championship predictions, which have been dealt with in the existing sports data analysis field, and comparatively analyzed several machine learning models for analysis, there is a difference in that the interest features are set in advance and statistically verified, so that it is compared with the machine learning analysis result. Also, it was differentiated from existing studies by presenting explanatory visualization results using SHAP, one of the XAI models.