• Title/Summary/Keyword: sample selection bias

Search Result 68, Processing Time 0.027 seconds

An application and development of an activity lesson guessing a population ratio by sampling with replacement in 'Closed box' ('닫힌 상자'에서의 복원추출에 의한 모비율 추측 활동수업 개발 및 적용)

  • Lee, Gi Don
    • The Mathematical Education
    • /
    • v.57 no.4
    • /
    • pp.413-431
    • /
    • 2018
  • In this study, I developed an activity oriented lesson to support the understanding of probabilistic and quantitative estimating population ratios according to the standard statistical principles and discussed its implications in didactical respects. The developed activity lesson, as an efficient physical simulation activity by sampling with replacement, simulates unknown populations and real problem situations through completely closed 'Closed Box' in which we can not see nor take out the inside balls, and provides teaching and learning devices which highlight the representativeness of sample ratios and the sampling variability. I applied this activity lesson to the gifted students who did not learn estimating population ratios and collected the research data such as the activity sheets and recording and transcribing data of students' presenting, and analyzed them by Qualitative Content Analysis. As a result of an application, this activity lesson was effective in recognizing and reflecting on the representativeness of sample ratios and recognizing the random sampling variability. On the other hand, in order to show the sampling variability clearer, I discussed appropriately increasing the total number of the inside balls put in 'Closed Box' and the active involvement of the teachers to make students pay attention to controlling possible selection bias in sampling processes.

Exports and Firm Innovation (수출이 기업혁신에 미치는 영향)

  • Yim, Jeong-Dae
    • Korea Trade Review
    • /
    • v.44 no.3
    • /
    • pp.227-252
    • /
    • 2019
  • This study explores the effects of exports on the innovation of Korean firms listed on two Korean stock markets, the Korean Stock Exchange and the Korean Securities Dealers Quotations, between 1999 and 2016. By matching exporting firms to non-exporting ones with propensity score matching, this study accounts for a problem from sample selection bias that may arise from differences in firm-characteristics between the two groups. From the study results, first, both export participation and export volume significantly increase subsequent innovation performance, as measured by the number of patent applications. This result seems to support the "learning by exporting" hypothesis for Korean listed firms. Second, both export participation and export volume narrow innovation scope, proxied as the number of unique International Patent Classification (IPC) codes of the patent applied, the degree to which patents are concentrated in a particular class, and the degree of proximity in the patents. The findings of innovation scope suggest a possible explanation that the learning effect appears in familiar technology fields that firms have previously held, rather than in unfamiliar ones. Third, these results are robust using alternative proxies in the innovation scope, Tobit regressions to consider the non-trivial portion of sample firms with patent applications equal to zeros, and generalized method of moments (GMM) to control for the persistence of innovation measures hearing over years. Finally, the two main results are more pronounced in large firms than in small and medium-sized ones. As for Chaebol firms, however, these results do not appear.

ASSESSING CALIBRATION ROBUSTNESS FOR INTACT FRUIT

  • Guthrie, John A.;Walsh, Kerry B.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1154-1154
    • /
    • 2001
  • Near infra-red (NIR) spectroscopy has been used for the non-invasive assessment of intact fruit for eating quality attributes such as total soluble solids (TSS) content. However, little information is available in the literature with respect to the robustness of such calibration models validated against independent populations (however, see Peiris et al. 1998 and Guthrie et al. 1998). Many studies report ‘prediction’ statistics in which the calibration and prediction sets are subsets of the same population (e. g. a three year calibration validated against a set from the same population, Peiris et al. 1998; calibration and validation subsets of the same initial population, Guthrie and Walsh 1997 and McGlone and Kawano 1998). In this study, a calibration was developed across 84 melon fruit (R$^2$= 0.86$^{\circ}$Brix, SECV = 0.38$^{\circ}$Brix), which predicted well on fruit excluded from the calibration set but taken from the same population (n = 24, SEP = 0.38$^{\circ}$Brix with 0.1$^{\circ}$Brix bias), relative to an independent group (same variety and farm but different harvest date) (n = 24, SEP= 0.66$^{\circ}$ Brix with 0.1$^{\circ}$Brix bias). Prediction on a different variety, different growing district and time was worse (n = 24, SEP = 1.2$^{\circ}$Brix with 0.9$^{\circ}$Brix bias). Using an ‘in-line’ unit based on a silicon diode array spectrometer, as described in Walsh et al. (2000), we collected spectra from fruit populations covering different varieties, growing districts and time. The calibration procedure was optimized in terms of spectral window, derivative function and scatter correction. Performance of a calibration across new populations of fruit (different varieties, growing districts and harvest date) is reported. Various calibration sample selection techniques (primarily based on Mahalanobis distances), were trialled to structure the calibration population to improve robustness of prediction on independent sets. Optimization of calibration population structure (using the ISI protocols of neighbourhood and global distances) resulted in the elimination of over 50% of the initial data set. The use of the ISI Local Calibration routine was also investigated.

  • PDF

The Determinants of R&D and Product Innovation Pattern in High-Technology Industry and Low-Technology Industry: A Hurdle Model and Heckman Sample Selection Model Approach (고기술산업과 저기술산업의 제품혁신패턴 및 연구개발 결정요인 분석: Hurdle 모형과 Heckman 표본선택모형을 중심으로)

  • Lee, Yunha;Kang, Seung-Gyu;Park, Jaemin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.10
    • /
    • pp.76-91
    • /
    • 2019
  • There have been many studies to examine the patterns in innovations reflecting industry-specific characteristics from an evolutionary economics perspective. The purpose of this study is to identify industry-specific differences in product innovation patterns and determinants of innovation performance. For this, Korean manufacturing is classified into high-tech industries and low-tech industries according to technology intensity. It is also pointed out that existing research does not reflect the decision-making process of firms' R&D implementations. In order to solve this problem, this study presents a Heckman sample selection model and a double-hurdle model as alternatives, and analyzes data from 1,637 firms in the 2014 Survey on Technology of SMEs. As a result, it was confirmed that the determinants and patterns of manufacturing in small and medium-size enterprise (SME) product innovation are significantly different between high-tech and low-tech industries. Also, through an extended empirical model, we found that there exist a sample selection bias and a hurdle-like threshold in the decision-making process. In this study, the industry-specific features and patterns of product innovation are examined from a multi-sided perspective, and it is meaningful that the decision-making process for manufacturing SMEs' R&D performance can be better understood.

Impact of Digital Divide on Online Political Participation: With Focus on the Gap of Operational Skills of Digital Device Users (온라인 정치참여에서 디지털 정보격차의 영향: 디지털 기기 이용자의 기기 운용 기술 격차를 중심으로)

  • Jang, Changki;Sung, WookJoon
    • Informatization Policy
    • /
    • v.27 no.1
    • /
    • pp.36-54
    • /
    • 2020
  • This study empirically analyzes the impact of digital divide between digital device usage motivation and operational skills on online political participation. The analysis was performed using the National Information Society Agency's 2018 digital divide survey data from September to December 2018 and applying the Heckman selection model to control the sample selection bias that may occur between internet users and non-users. The result shows the gap in motivation and device operational skills of individual citizens using digital devices has significant impact on online political participation. In socio-economic terms, it shows the age, education level and regional factors also have significant impact on online political participation, while gender and income levels do not. This study holds significance in that there are different patterns of digital divide between digital devices, identifying the motivation to use a digital device as an important factor for mobile device users, and the device operational skills, for personal computer users.

한 인구학도의 회고

  • 김택일
    • Korea journal of population studies
    • /
    • v.11 no.1
    • /
    • pp.1-13
    • /
    • 1988
  • This study examines the sampling bias that may have resulted from the large number of missing observations. Despite well-designed and reliable sampling procedures, the observed sample values in DSFH(Demographic Survey on Changes in Family and Household Structure, Japan) included many missing observations. The head administerd survey method of DSFH resulted in a large number of missing observations regarding characteristics of elderly non-head parents and their children. In addition, the response probability of a particular item in DSFH significantly differs by characteristics of elderly parents and their children. Furthermore, missing observations of many items occurred simultaneously. This complex pattern of missing observations critically limits the ability to produce an unbiased analysis. First, the large number of missing observations is likely to cause a misleading estimate of the standard error. Even worse, the possible dependency of missing observations on their latent values is likely to produce biased estimates of covariates. Two models are employed to solve the possible inference biases. First, EM algorithm is used to infer the missing values based on the knowledge of the association between the observed values and other covariates. Second, a selection model was employed given the suspicion that the probability of missing observations of proximity depends on its unobserved outcome.

  • PDF

Reference values for pulp oxygen saturation as a diagnostic tool in endodontics: a systematic review and meta-analysis

  • Paula Lambert;Sergio Augusto Quevedo Miguens Jr;Caroline Solda;Juliana Tomaz Sganzerla;Leandro Azambuja Reichert;Carlos Estrela;Fernando Branco Barletta
    • Restorative Dentistry and Endodontics
    • /
    • v.45 no.4
    • /
    • pp.48.1-48.11
    • /
    • 2020
  • Objectives: This systematic review aimed to identify mean oxygen saturation values (SpO2) using pulse oximetry in permanent maxillary anterior teeth. Materials and Methods: The MEDLINE, Scientific Electronic Library Online, Cochrane Central Register of Controlled Trials, EMBASE, and Literatura Latino Americana em Ciências da Saúde electronic databases were searched. Combinations and variations of "oximetry" AND "dental pulp test" were used as search terms. Studies reporting means and standard deviations of SpO2 values were included. Two reviewers independently extracted data following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist. Heterogeneity was assessed using the I2 statistic, and all analyses were performed using R software. Study quality was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool and the Newcastle-Ottawa scale. Results: Of the 251 studies identified, 19 met the eligibility criteria and were included (total sample, 4,541 teeth). In the meta-analysis, the mean SpO2 values were 84.94% (95% confidence interval [CI], 84.85%-85.04%) for the central incisors, 89.29% (95% CI, 89.22%-89.35%) for the lateral incisors, and 89.20% (95% CI, 89.05%-89.34%) for the canines. The studies were predominantly low-quality due to the high risk of bias associated with the index test, unclear risk regarding patient selection, and concerns about outcome assessment. Conclusions: Although most studies were low-quality, the oxygen saturation levels in normal pulp could be established (minimum saturation, 77.52%). Despite the risk of bias of the included studies, the reference values reported herein are clinically relevant for assessments of changes in pulp status.

Detecting Protest Responses (지불거부응답의 판별)

  • OH, Hyungna
    • KDI Journal of Economic Policy
    • /
    • v.34 no.1
    • /
    • pp.135-168
    • /
    • 2012
  • This study analyzes ways to detect protest responses (hereafter, PR zero-bid) in the contingent valuation method (CVM). In order to distinguish PR zero-bids from true zero-bids (non-PR zero bids), this study adopts the concept of the implicit willingness to pay employing the Hicksian compensating surplus and the Taylor's 1st order approximation. When a respondent proposes a zero-bid (i.e., WTP=0) and chooses a PR filtering item to indicate that her implicit WTP is not necessary zero, her response is identified as a PR zero bid. PR filtering items falling into the PR zero bids category include the uncertainty of information, distrust in the government and project achievement, disagreement to project plans, discontent with the fairness of public works and their payment method and animosity against the CVM itself. The empirical analysis shows that PR zero bids take place systematically in particular respondent groups: respondents who have never used similar facilities before nor plans to use the facility provided by the public project, the employed, and low income groups. In conclusion, the study suggests that a CVM questionnaire needs to be designed carefully to minimize problems associated with PR zero bids and the potential risks of having sample selection bias should be concerned.

  • PDF

Consumers' Subjective Risk Perceptions of Tab Water and Stated Preferences for Safe Drinking Water (소비자들의 수돗물에 대한 주관적 위험인지와 안전한 음용수에 대한 진술선호 분석)

  • Eom, Young Sook
    • Environmental and Resource Economics Review
    • /
    • v.15 no.2
    • /
    • pp.147-175
    • /
    • 2006
  • This paper attempts to incorporate three important factors-perceptions, behavior and valuation-in analysing consumers' responses to health risks from environmental pollutants. Using a survey sample of 500 consumers in the Chonbuk province area, this paper empirically investigated determinants of risk perceptions from using tap water as drinking water. Most consumers were considerably concerned about health risks from drinking tap water. Moreover, those subjective concerns were not random, but were systematically related to individuals' demographic variables such as age, gender, and family size. Those subjective beliefs also influenced respondents' purchase intentions on safer water bottles, in response to a contingent behavior question of presenting two types of water bottles. The technical risk information provided in the survey had significant effects on purchase intentions only when it was interacted with respondents' actual averting practice. In addition, the sample selection effects were present by eliminating respondents who decided not to purchase either of two types of water bottles. The potential selection bias had impacts on the coefficients of the price difference variable, and subsequently the estimates of the price increments for health risk reductions.

  • PDF

The Effect of Obesity on Medical Costs and Health Service Uses (비만이 의료비와 의료이용에 미친 영향 분석)

  • Kim, Da-Yang;Kwak, Jin-Mi;Choi, So-Young;Lee, Kwang-Soo
    • The Korean Journal of Health Service Management
    • /
    • v.11 no.3
    • /
    • pp.65-78
    • /
    • 2017
  • Objectives : Obesity is a worldwide health concern due to an increasing obese population. This study proposed to analyze the differences in medical costs and care utilization between obese and normal group using propensity score matching. Methods : Data were collected from the sample cohort database by the Korea National Health Insurance Corporation. Propensity score matching(PSM) was applied to control selection bias, and factors affecting obesity were used as covariates in PSM. Results : The results showed higher medical costs and care utilization in the obese group than the normal group. According to gender and medical type, there were differences in the relationships between obesity and medical charges and utilization. In particular, the differences in the female population were larger in both outpatients and inpatients than the male population. Conclusions : It is important to manage obesity, because obesity has a negative effect on national health insurance costs. These findings suggest directions for future research.