• Title/Summary/Keyword: Sample Selection Model

Search Result 197, Processing Time 0.027 seconds

An Analysis of Relationship between the Level of Satisfaction of Domestic Products and Purchase Intention of Imported Organic Products (국내산 친환경농산물 만족도와 수입산 유기농산물 구입의향 관계 분석)

  • Han, Jae-Hwan;Jeong, Hak-Kyun
    • Korean Journal of Organic Agriculture
    • /
    • v.29 no.2
    • /
    • pp.159-171
    • /
    • 2021
  • The purpose of this paper is to analyze the relationship between the level of satisfaction of domestic Environment-friendly agricultural products and purchase intent of imported organic products. To accomplish the objective of the study a consumer survey was administered for quantitative analysis regarding consumption patterns. The bivariate probit with sample selection model was employed for empirical analysis on the relationship. The estimation results showed that to increase continuously the consumption, it is necessary to improve the quality satisfaction compared to the price, and that it is also necessary to increase the reliability of the certification system and the awareness that the consumption is helpful for health promotion to increase the quality satisfaction compared to price. In addition, it was concluded that in order to induce the purchase of domestic organic products rather than imported organic products, efforts to improve the safety of domestic products, remove the risk of residual pesticides, and increase the reliability of domestic products compared to imported products are needed. Therefore, to reduce the proportion of purchases of imported organic products and increase the consumption of domestic products, raising awareness that the consumption is conducive to health promotion, enhancing the safety of domestic products, and providing accurate information on the safety of imported products are required.

Development of Calibration Model for Firmness Evaluation of Apple Fruit using Near-infrared Reflectance Spectroscopy (사과 경도의 비파괴측정을 위한 검량식 개발 및 정확도 향상을 위한 연구)

  • 손미령;조래광
    • Food Science and Preservation
    • /
    • v.6 no.1
    • /
    • pp.29-36
    • /
    • 1999
  • Using Fuji apple fruits cultivated in Kyungpook prefecture, the calibration model for firmness evaluation of fruits by near infrared(NIR) reflectance spectroscopy was developed, and the various influence factors such as instrument variety, measuring method, sample group, apple peel and selection of firmness point were investigated. Spectra of sample were recorded in wavelength range of 1100∼2500nm using NIR spectrometer (InfraAlyzer 500), and data were analyzed by stepwise multiple linear regression of IDAS program. The accuracy of calibration model was the highest when using sample group with wide range, and the firmness mean values obtained in graph by texture analyser(TA) were used as standard data. Chemometrics models were developed using a calibration set of 324 samples and an independent validation set of 216 samples to evaluate the predictive ability of the models. The correlation coefficients and standard error of prediction were 0.84 and 0.094kg, respectively. Using developed calibration model, it was possible to monitor the firmness change of fruits during storage frequently. Time, which was reached to firmness high value in graph by TA, is possible to use as new parameter for freshness of fruit surface during storage.

  • PDF

The Effect of Crop Diversification on Agricultural Income (작목다각화가 농업소득에 미치는 영향)

  • Choi, Do Hyeong;Choi, Eunji;Lee, Seong Woo
    • Journal of Korean Society of Rural Planning
    • /
    • v.27 no.4
    • /
    • pp.1-12
    • /
    • 2021
  • The purpose of this study is to analyze the effect of crop diversification on farm households' agricultural income. Abundant literature have explored the determinants and efficient strategies for crop diversification. Yet, there is a paucity of research studies that empirically test the effectiveness of crop diversification as a profitable farm management strategy. Utilizing the 2015 Agricultural Census, this study adopts a quasi-experimental research design to compare the outcomes between farm households that opted for crop diversification and farm households that did not engage in such a strategy. In doing so, this study applies the Heckman Selection Model and the decomposition technique to address the problem of selection bias and to identify the causal effect. Our empirical results show that farms that implement diversification are more likely to earn higher agricultural income than non-diversified farms, although the difference would not be much substantial. This study concludes with several policy proposals to stabilize agricultural income in conjunction with crop diversification.

Effect of Experimental Layout on Model Selection under Variance Components Models: A Simulation Study (분산성분모형에서 요인의 배치구조가 모형선택법에 미치는 영향에 대한 실험연구)

  • Lee, Yonghee
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.5
    • /
    • pp.1035-1046
    • /
    • 2015
  • Variance components models incorporate various random factors in the form of linear models. There are two experimental Layouts for the classification of factors under variance components models: nested classification and crossed classification. We consider two-way variance components models and investigate the effect of experimental Layout on the performance of model selection criteria AIC and BIC. The effect of experimental Layout is studied through a simulation study with various combinations of parameters in a systematic fashion. The simulation study shows differences in performance of model selection methods between the two classification. There is a particular tendency to prefer the smaller model than the true model when the variance component of a nested factor becomes relatively larger than a nesting factor that is persistent even when the sample size is not small.

The Determinants of R&D and Product Innovation Pattern in High-Technology Industry and Low-Technology Industry: A Hurdle Model and Heckman Sample Selection Model Approach (고기술산업과 저기술산업의 제품혁신패턴 및 연구개발 결정요인 분석: Hurdle 모형과 Heckman 표본선택모형을 중심으로)

  • Lee, Yunha;Kang, Seung-Gyu;Park, Jaemin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.10
    • /
    • pp.76-91
    • /
    • 2019
  • There have been many studies to examine the patterns in innovations reflecting industry-specific characteristics from an evolutionary economics perspective. The purpose of this study is to identify industry-specific differences in product innovation patterns and determinants of innovation performance. For this, Korean manufacturing is classified into high-tech industries and low-tech industries according to technology intensity. It is also pointed out that existing research does not reflect the decision-making process of firms' R&D implementations. In order to solve this problem, this study presents a Heckman sample selection model and a double-hurdle model as alternatives, and analyzes data from 1,637 firms in the 2014 Survey on Technology of SMEs. As a result, it was confirmed that the determinants and patterns of manufacturing in small and medium-size enterprise (SME) product innovation are significantly different between high-tech and low-tech industries. Also, through an extended empirical model, we found that there exist a sample selection bias and a hurdle-like threshold in the decision-making process. In this study, the industry-specific features and patterns of product innovation are examined from a multi-sided perspective, and it is meaningful that the decision-making process for manufacturing SMEs' R&D performance can be better understood.

Default Prediction for Real Estate Companies with Imbalanced Dataset

  • Dong, Yuan-Xiang;Xiao, Zhi;Xiao, Xue
    • Journal of Information Processing Systems
    • /
    • v.10 no.2
    • /
    • pp.314-333
    • /
    • 2014
  • When analyzing default predictions in real estate companies, the number of non-defaulted cases always greatly exceeds the defaulted ones, which creates the two-class imbalance problem. This lowers the ability of prediction models to distinguish the default sample. In order to avoid this sample selection bias and to improve the prediction model, this paper applies a minority sample generation approach to create new minority samples. The logistic regression, support vector machine (SVM) classification, and neural network (NN) classification use an imbalanced dataset. They were used as benchmarks with a single prediction model that used a balanced dataset corrected by the minority samples generation approach. Instead of using prediction-oriented tests and the overall accuracy, the true positive rate (TPR), the true negative rate (TNR), G-mean, and F-score are used to measure the performance of default prediction models for imbalanced dataset. In this paper, we describe an empirical experiment that used a sampling of 14 default and 315 non-default listed real estate companies in China and report that most results using single prediction models with a balanced dataset generated better results than an imbalanced dataset.

The Development of a Tool for Selection of LAN Switch with QoS (QoS를 고려한 LAN 스위치 선정 도구 개발)

  • Lee, Phil-Jai;Lee, Jong-Moo;Shin, In-Chul
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.10
    • /
    • pp.2533-2543
    • /
    • 1997
  • It is necessary to understand and apply the concept of Quality of Service(QoS) for the objective selection among the computer network equipment. Because ITU-T E.800 recommendation covers the service quality of provider's viewpoint and the satisfaction of user, it can be used to evaluate and select the product of computer network systems. This paper is concerned with the development of an evaluation model using QoS and software tool for selection of the most suitable LAN switch. We apply the Analytic Hierarchy Process(AHP) method of Saaty which has been in a multiple criteria framework for an effective group decision process to the selection of LAN switch. The sample data are collected and processed from a questionnaire of professionals in the network field. And we implement a prototype tool for the selection of LAN switch according to the suggested selection model and analyse the result. The result of our research is expected to be a useful tool for decision making to evaluate and select the LAN switch and also can be applied to the decision making of evaluation and selection related to the product of computer network with QoS.

  • PDF

A Comparative Study on Prediction Performance of the Bankruptcy Prediction Models for General Contractors in Korea Construction Industry

  • Seung-Kyu Yoo;Jae-Kyu Choi;Ju-Hyung Kim;Jae-Jun Kim
    • International conference on construction engineering and project management
    • /
    • 2011.02a
    • /
    • pp.432-438
    • /
    • 2011
  • The purpose of the present thesis is to develop bankruptcy prediction models capable of being applied to the Korean construction industry and to deduce an optimal model through comparative evaluation of final developed models. A study population was selected as general contractors in the Korean construction industry. In order to ease the sample securing and reliability of data, it was limited to general contractors receiving external audit from the government. The study samples are divided into a bankrupt company group and a non-bankrupt company group. The bankruptcy, insolvency, declaration of insolvency, workout and corporate reorganization were used as selection criteria of a bankrupt company. A company that is not included in the selection criteria of the bankrupt company group was selected as a non-bankrupt company. Accordingly, the study sample is composed of a total of 112 samples and is composed of 48 bankrupt companies and 64 non-bankrupt companies. A financial ratio was used as early predictors for development of an estimation model. A total of 90 financial ratios were used and were divided into growth, profitability, productivity and added value. The MDA (Multivariate Discriminant Analysis) model and BLRA (Binary Logistic Regression Analysis) model were used for development of bankruptcy prediction models. The MDA model is an analysis method often used in the past bankruptcy prediction literature, and the BLRA is an analysis method capable of avoiding equal variance assumption. The stepwise (MDA) and forward stepwise method (BLRA) were used for selection of predictor variables in case of model construction. Twenty two variables were finally used in MDA and BLRA models according to timing of bankruptcy. The ROC-Curve Analysis and Classification Analysis were used for analysis of prediction performance of estimation models. The correct classification rate of an individual bankruptcy prediction model is as follows: 1) one year ago before the event of bankruptcy (MDA: 83.04%, BLRA: 93.75%); 2) two years ago before the event of bankruptcy (MDA: 77.68%, BLRA: 78.57%); 3) 3 years ago before the event of bankruptcy (MDA: 84.82%, BLRA: 91.96%). The AUC (Area Under Curve) of an individual bankruptcy prediction model is as follows. : 1) one year ago before the event of bankruptcy (MDA: 0.933, BLRA: 0.978); 2) two years ago before the event of bankruptcy (MDA: 0.852, BLRA: 0.875); 3) 3 years ago before the event of bankruptcy (MDA: 0.938, BLRA: 0.975). As a result of the present research, accuracy of the BLRA model is higher than the MDA model and its prediction performance is improved.

  • PDF

Finding Biomarker Genes for Type 2 Diabetes Mellitus using Chi-2 Feature Selection Method and Logistic Regression Supervised Learning Algorithm

  • Alshamlan, Hala M
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.2
    • /
    • pp.9-13
    • /
    • 2021
  • Type 2 diabetes mellitus (T2D) is a complex diabetes disease that is caused by high blood sugar, insulin resistance, and a relative lack of insulin. Many studies are trying to predict variant genes that causes this disease by using a sample disease model. In this paper we predict diabetic and normal persons by using fisher score feature selection, chi-2 feature selection and Logistic Regression supervised learning algorithm with best accuracy of 90.23%.

Classification Prediction Error Estimation System of Microarray for a Comparison of Resampling Methods Based on Multi-Layer Perceptron (다층퍼셉트론 기반 리 샘플링 방법 비교를 위한 마이크로어레이 분류 예측 에러 추정 시스템)

  • Park, Su-Young;Jeong, Chai-Yeoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.2
    • /
    • pp.534-539
    • /
    • 2010
  • In genomic studies, thousands of features are collected on relatively few samples. One of the goals of these studies is to build classifiers to predict the outcome of future observations. There are three inherent steps to build classifiers: a significant gene selection, model selection and prediction assessment. In the paper, with a focus on prediction assessment, we normalize microarray data with quantile-normalization methods that adjust quartile of all slide equally and then design a system comparing several methods to estimate 'true' prediction error of a prediction model in the presence of feature selection and compare and analyze a prediction error of them. LOOCV generally performs very well with small MSE and bias, the split sample method and 2-fold CV perform with small sample size very pooly. For computationally burdensome analyses, 10-fold CV may be preferable to LOOCV.