Search | Korea Science

표본의 대표성, 비편향성 그리고 효율성

김규성
- Proceedings of the Korean Statistical Society Conference
- /
- 2004.11a
- /
- pp.149-154
- /
- 2004
이 논문에서는 표본조사에서 자주 사용되는 표본의 대표성, 비편향성, 그리고 효율성에 개넘에 대하여 고찰하였다. 표본의 대표성은 조사단위의 포함확률로 표현되며 조사모집단의 포함범위와 연관이 있는 반면, 비편향성과 효율성은 표집설계와 추정량에 관련된 개념이다. 비편향성과 효율성은 표본의 대표성을 전제로 하며 가중치 부여로 나타난다
PDF

Graph Learning System for Analyzing Bias among News Using Keyword Distance Model (주제어 문장거리를 이용한 뉴스 편향성 분석 그래프 학습)

Cho Chanwoo;Cho Chanhyung
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.533-538
- /
- 2023
문서에서 저자의 의도와 주제, 그 안에 포함된 감성을 분석하는 것은 자연어 연구의 핵심적인 주제이다. 이와 유사하게 특정 글에 포함된 정치적 문화적 편향을 분석하는 것 역시 매우 의미 있는 연구주제이다. 우리는 최근 발생한 한 사건에 대하여 여러 신문사와 해당 신문사에서 생산한 기사를 중심으로 해당 글의 정치적 편향을 정량화 하는 방법을 제시한다. 그 방법은 선택된 주제어들의 문장 공간에서의 거리를 중심으로 그래프를 생성하고, 생성된 그래프의 기계학습을 통하여 편향과 특징을 분석하였다. 그리고 그 그래프들의 시간적 변화를 추적하여 특정 신문사에서 특정 사건에 대한 입장이 시간적으로 어떻게 변화하였는지를 동적으로 보여주는 그래프 애니메이션 시스템을 개발하였다. 실험을 위하여 최근 이슈에 대하여 12개의 신문사에서 약 2000여 개의 기사를 수집하였다. 그 결과, 약 82%의 정확도로 일반적으로 알려진 정치적 편향을 예측할 수 있었다. 또한, 학습 데이터에 쓰이지 않은 신문기사를 활용하여도 같은 정도의 정확도를 보임을 알 수 있었다. 우리는 이를 통하여 신문기사에서의 정치적 편향은 작성자나 신문사의 특성이 아니라 주제어들의 문장 공간에서의 거리 관계로 특성화할 수 있음을 보였다. 할 수 있다.
PDF

지배적 피드백 루프에 대한 인지적 편향

김병관;김동환
- Proceedings of the Korean System Dynamics Society
- /
- 2000.07a
- /
- pp.135-152
- /
- 2000
지배적 피드백 루프는 구조가 시스템의 행동을 유발한다는 점에 있어서 매우 중요한 개념이다. 본 논문에서는 지배적 피드백 루프의 전환을 완만한 전환(continuous shifts)과 급격한 전환(discrete shifts)의 두 가지로 분류하였다. 본 연구에서는 지배적 피드백 루프의 전환에 대한 인지적 편향을 세 가지의 가설로 분류하여 제시하였다. 이에는 1) 완만한 전환에 대한 인식의 실패, 2) 의사결정 자들의 급격한 전환에 의존하는 경향, 3) 지배적 피드백 루프의 인식에 있어서 수준변수와 변화율 변수간의 차이 등이 포함된다. 마지막으로 본 논문에서는 지배적 피드백 루프에 의한 인지적 편향이 의사결정과정의 시간지연과 정책 개입의 시기에 대하여 어떠한 시사점을 주는지에 관하여 논의하였다.

Measurement of Political Polarization in Korean Language Model by Quantitative Indicator (한국어 언어 모델의 정치 편향성 검증 및 정량적 지표 제안)

Jeongwook Kim;Gyeongmin Kim;Imatitikua Danielle Aiyanyo;Heuiseok Lim
- Annual Conference on Human and Language Technology
- /
- 2022.10a
- /
- pp.16-21
- /
- 2022
사전학습 말뭉치는 위키백과 문서 뿐만 아니라 인터넷 커뮤니티의 텍스트 데이터를 포함한다. 이는 언어적 관념 및 사회적 편향된 정보를 포함하므로 사전학습된 언어 모델과 파인튜닝한 언어 모델은 편향성을 내포한다. 이에 따라 언어 모델의 중립성을 평가할 수 있는 지표의 필요성이 대두되었으나, 아직까지 언어 인공지능 모델의 정치적 중립성에 대해 정량적으로 평가할 수 있는 척도는 존재하지 않는다. 본 연구에서는 언어 모델의 정치적 편향도를 정량적으로 평가할 수 있는 지표를 제시하고 한국어 언어 모델에 대해 평가를 수행한다. 실험 결과, 위키피디아로 학습된 언어 모델이 가장 정치 중립적인 경향성을 나타내었고, 뉴스 댓글과 소셜 리뷰 데이터로 학습된 언어 모델의 경우 정치 보수적, 그리고 뉴스 기사를 기반으로 학습된 언어 모델에서 정치 진보적인 경향성을 나타냈다. 또한, 본 논문에서 제안하는 평가 방법의 안정성 검증은 각 언어 모델의 정치적 편향 평가 결과가 일관됨을 입증한다.
PDF

The Study on the impact of optimistic bias and control illusion in COVID 19 Preventive Behavior (COVID 19 방역행동에 있어서 낙관적 편견과 통제성 편향의 영향에 관한 연구)

Jeong, Hyeonju
- Journal of the Korea Convergence Society
- /
- v.13 no.2
- /
- pp.223-233
- /
- 2022
In addition to optimistic bias which can be a biased phenomenon in perceived susceptibility, including illusion of control which is a distorted phenomenon, the current study attempted to demonstrate the influential relationship between these two important variables and COVID 19 personal preventive behaviors and social distancing practice. Conducting Survey utilizing online pannel from Macromill Embrain, the present study performed regression analysis, setting personal preventive behavioral variables such as mask wearing, hand washing, using hand sanitizer as independent variable, and analyzed how these independent variables influence control illusion and optimistic bias. As a result, COVID 19 personal preventive behavior didn't have direct effect on optimistic bias and control illusion except for hand washing. Finding, also, showed that control illusion affected optimistic bias, and the relation between these variables was different depending on demographic variable such as gender and age.
https://doi.org/10.15207/JKCS.2022.13.02.223 인용 PDF KSCI

Approximate Variance of Least Square Estimators for Regression Coefficient under Inclusion Probability Proportional to Size Sampling (포함확률비례추출에서 회귀계수 최소제곱추정량의 근사분산)

Kim, Kyu-Seong
- Communications for Statistical Applications and Methods
- /
- v.19 no.1
- /
- pp.23-32
- /
- 2012
This paper deals with the bias and variance of regression coefficient estimators in a finite population. We derive approximate formulas for the bias, variance and mean square error of two estimators when we select a fixed-size inclusion probability proportional to the size sample and then estimate regression coefficients by the ordinary least square estimator as well as the weighted least square estimator based on the selected sample data. Necessary and sufficient conditions for the comparison of the two estimators in terms of variance and mean square error are suggested. In addition, a simple example is introduced to numerically compare the variance and mean square error of the two estimators.
https://doi.org/10.5351/CKSS.2012.19.1.023 인용 PDF KSCI

Representative of Sample and Efficiency of Estimation (표본의 대표성과 추정의 효율성)

Kim, Kyu-Seong
- Survey Research
- /
- v.6 no.1
- /
- pp.39-62
- /
- 2005
In this paper we investigate some concepts frequently called in sample surveys such as 'representative of sample' as well as 'consistency', 'unbiasedness', and 'efficiency' in estimation. The first is strongly related with sampling procedure including coverage rate of survey population, response rate in establishment survey, and recruit rate of final samples. The others, however, are concerned with both sampling design and corresponding estimators simultaneously. Whereas both consistency and unbiasedness are based on the representative sample, efficiency does not depend on the representative sample. The representative of sample can be increased by raising the rate of coverage, response and recruit as well. Consistency may be investigated according to variables of interest and auxiliary variables. The well-known raing-ratio weighting method is a method to increase consistency of auxiliary variables by means of matching population size in each cell. Efficiency is not directly related with the representative of sample, and allocation methods such as proportional and Neyman allocation in stratified sampling and post-stratification are all methods to increase the efficiency of estimation under the condition of satisfying the representative of sample.
PDF

Systematic Forecasting Bias of Exit Poll: Analysis of Exit Poll for 2010 Local Elections (출구조사의 체계적인 예측 편향에 대한 분석: 2010년 지방선거 출구조사를 중심으로)

Kim, Young-Won;Choi, Yun-Jung
- Survey Research
- /
- v.12 no.3
- /
- pp.25-48
- /
- 2011
In this paper, we overview the sample design, sampling error, non-response rate and prediction errors of the exit poll conducted for 2010 local elections and discusses how to detect a prediction bias in exit poll. To investigate the bias problem in exit poll in regional(Si-Do) level, we analyze exit poll data for 2007 presidential election and 2006 local elections as well as 2010 local elections in Korea. The measure of predictive accuracy A proposed by Martin et al.(2005) is used to assess the exit poll bias. The empirical studies based on three exit polls clearly show that there exits systematic bias in exit poll and the predictive bias of candidates affiliated to conservative party (such as Hannara-Dang) is serious in the specific regions. The result of this study on systematic bias will be very useful to improving the exit poll methodology in Korea.
PDF

Weighting Effect on the Weighted Mean in Finite Population (유한모집단에서 가중평균에 포함된 가중치의 효과)

Kim, Kyu-Seong
- Survey Research
- /
- v.7 no.2
- /
- pp.53-69
- /
- 2006
Weights can be made and imposed in both sample design stage and analysis stage in a sample survey. While in design stage weights are related with sample data acquisition quantities such as sample selection probability and response rate, in analysis stage weights are connected with external quantities, for instance population quantities and some auxiliary information. The final weight is the product of all weights in both stage. In the present paper, we focus on the weight in analysis stage and investigate the effect of such weights imposed on the weighted mean when estimating the population mean. We consider a finite population with a pair of fixed survey value and weight in each unit, and suppose equal selection probability designs. Under the condition we derive the formulas of the bias as well as mean square error of the weighted mean and show that the weighted mean is biased and the direction and amount of the bias can be explained by the correlation between survey variate and weight: if the correlation coefficient is positive, then the weighted mein over-estimates the population mean, on the other hand, if negative, then under-estimates. Also the magnitude of bias is getting larger when the correlation coefficient is getting greater. In addition to theoretical derivation about the weighted mean, we conduct a simulation study to show quantities of the bias and mean square errors numerically. In the simulation, nine weights having correlation coefficient with survey variate from -0.2 to 0.6 are generated and four sample sizes from 100 to 400 are considered and then biases and mean square errors are calculated in each case. As a result, in the case or 400 sample size and 0.55 correlation coefficient, the amount or squared bias of the weighted mean occupies up to 82% among mean square error, which says the weighted mean might be biased very seriously in some cases.
PDF

Coverage Rates for Households by Landline Telephone Frames in Korea (국내 유선 전화조사에서 표본추출틀의 포함률)

Hong, Sung-Joon;Park, So-Hyung;Kim, Sun-Woong
- Survey Research
- /
- v.10 no.1
- /
- pp.33-56
- /
- 2009
Landline telephone surveys of the population of households or individuals in Korea often use telephone directories as sampling frames. Recently, the frequency of unlisted numbers in the directories has been increased and the number of households without landline phones has become larger with a spread of mobile phones. Landline telephone coverage has currently reached to a level that raises concerns about the currently due to a coverage bias on the statistics in question. In this paper, we first present the distribution of telephone ownership in Korea and make a comparison with some selected countries. Second, we describe the characteristics of telephone directories. Next, we directly or indirectly estimate the telephone coverage rates of the frames, and show that it may nationally be lower than 65.6% based on additional information. We conclude with remarks about future studies to reduce coverage bias, including the developments of efficient random digit dialing sampling methods.
PDF

Search Result 98, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)