• Title/Summary/Keyword: 표본추출방법

Search Result 609, Processing Time 0.024 seconds

Estimation of Population Mean Using Modified Systematic Sampling and Least Squares Method (변형된 계통추출과 최소제곱법을 이용한 모평균 추정)

  • 김혁주
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.1
    • /
    • pp.105-117
    • /
    • 2004
  • In this paper, a new method is developed for estimating the mean of a population which has a linear trend. This method involves drawing a sample by the modified systematic sampling, and then estimating the population mean with an adjusted estimator, not with the sample mean itself. We use the method of least squares in determining the adjusted estimator. The proposed method is shown to be more and more efficient as the linear trend becomes stronger. It turns out to be relatively efficient as compared with the conventional methods if $\sigma$$^2$the variance of the random error term in the infinite superpopulation model, is not very large.

Face Recognition using Dimension Reduction Features based on Partial Least Squares (부분 최소제곱법 기반한 차원 축소 특징을 이용한 얼굴 인식)

  • Lee, Chang-Beom;Kim, Do-Hyang;Park, Hyuk-Ro;Baek, Jangsun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2004.05a
    • /
    • pp.745-748
    • /
    • 2004
  • 얼굴 이미지의 대부분은 표본의 수보다 특징 변수의 수가 많기 때문에 이러한 점을 고려한 특징 추출 방법이 필요하다. 본 논문에서는 부분 최소제곱법을 이용하여 특징 벡터의 차원을 축소하는 방법을 제안한다. 전통적인 차원 축소 방법인 주성분 분석은 클래스의 정보를 고려하지 않고 최대 변이를 가지는 성분을 추출하기 때문에, 클래스의 구분에 필요한 특징을 필수적으로 추출하지 못한다. 이에 비해, 부분 최소제곱법은 클래스 변수에 대한 정보를 포함하여 성분을 추출한다. 그러므로, 분류를 하는데 있어서는 주성분 분석에 의해 추출된 성분보다는 부분 최소제곱법에 의해 추출된 성분이 보다 더 예측적이다. 맨체스터와 ORL 얼굴 데이터베이스를 이용하여 실험한 결과, 분류와 차원 축소 측면에서 주성분 분석 방법보다는 부분 최소제곱법을 이용한 방법이 그 성능이 우수함을 알 수 있었다.

  • PDF

Development of Sample Survey Design for the Industrial Research and Development Statistics (표본조사에 의한 기업 연구개발활동 통계 작성방안)

  • Cho, Seong-Pyo;Park, Sun-Young;Han, Ki-In;Noh, Min-Sun
    • Journal of Technology Innovation
    • /
    • v.17 no.2
    • /
    • pp.1-23
    • /
    • 2009
  • The Survey on the Industrial Research and Development(R&D) is the primary source of information on R&D performed by Korea industrial sector. The results of the survey are used to assess trends in R&D expenditures. Government agencies, corporations, and research organizations use the data to investigate productivity determinants, formulate tax policy, and compare individual company performance with industry averages. Recently, Korea Industrial Technology Association(KOITA) has collected the data by complete enumeration. Koita has, currently, considered sample survey because the number of R&D institutions in industry has been dramatically increased. This study develops survey design for the industrial research and development(R&D) statistics by introducing a sample survey. Companies are divided into 8 groups according to the amount of R&D expenditures and firm size or type. We collect the sample from 24 or 8 sampling strata and compare the results with those of complete enumeration survey. The estimates from 24 sampling strata are not significantly different to the results of complete enumeration survey. We propose the survey design as follows: Companies are divided into 11 groups including the companies of which R&D expenditures are unknown. All large companies are included in the survey and medium and small companies are sampled from 70% and 3%. Simple random sampling (SRS) is applied to the small company partition since they show uniform distribution in R&D expenditures. The independent probability proportionate to size (PPS) sampling procedure may be applied to those companies identified as 'not R&D performers'. When respondents do not provide the requested information, estimates for the missing data are made using imputation algorithms. In the future study, new key variables should be developed in survey questionnaires.

  • PDF

A study on Link Travel Time Estimating Methodology for Traffic Information Service (Determination of an Adequate Sample Size) (교통정보제공을 위한 구간통행시간 산출 방법론 연구 (적정표본수 결정방법을 중심으로))

  • 이영인;이정희
    • Journal of Korean Society of Transportation
    • /
    • v.20 no.3
    • /
    • pp.55-67
    • /
    • 2002
  • 구간검지체계를 기반으로 한 첨단교통정보제공시스템(Advanced Traveler Information Systems)은 그 기능 수행시 다음의 중요 고려사항을 지닌다. 첫째는 제공 정보의 신뢰성이며, 둘째는 정보수집비용에 관련한 수집자료수의 한계이다. 본 논문에서는 이러한 한계성 극복을 위해 보다 대표성 있는 교통정보 형태의 설정 및 통계적으로 신뢰성 있는 정보산출을 위해 요구되는 적정표본수의 결정에 대한 연구를 수행하였다. 도시고속도로(올림픽대로)와 도시간선도로(천호대로)의 실측 구간통행시간분포 분석결과 단일교차로 구간의 경우 다른 구간들의 단일봉(unimodal)의 정규분포형태와는 다른 두 개의 봉우리를 지닌 분포형태(bimodal)가 나타났다. 따라서 이러한 구간은 기존과는 다른 새로운 교통정보 형태가 필요하며, 본 논문에서는 모든 통과차량들의 평균통행시간으로 정의되는 한 개의 대표치가 아닌 신호주기에 의한 정지여부에 따라 분리되는 주행시간과 지체시간 또는 주행속도와 통행속도 개념의 세분화된 정보형태를 설정하였다. 또한 중심극한정리를 기초로 한 통계적인 표본수 결정식을 이용하여 설정된 신뢰수준 하에서의 정보산출을 위해 요구되는 적정 표본수를 산출하였다. 그 결과, 교통이 혼잡할수록 요구되는 표본수는 적어지는 것으로 나타났다. 우선 적정 표본수 만큼의 표본추출을 하고 제안된 정보산출 방법에 의해 교통정보를 산출한 후 실측치와의 오차를 비교하였다. 그 결과 산출된 교통정보는 신뢰수준 95%와 허용오차 5㎞/h를 만족하였다. 다음으로 구간검지체계를 이용하여 정보를 산출하는 타시스템 교통정보와의 오차율을 비교하였다. 그 결과, 실측치와 본 연구의 산출방법에 의한 교통정보, 로티스교통정보 및 차량번호판 인식시스템의 교통정보와의 비교 결과 제안된 교통정보형태의 타당성을 볼 수 있었다.

A Study on the Efficiency of the BLS Nonresponse Adjustment According to the Correlation and Sample Size (상관관계와 표본 크기에 따른 BLS 무응답 보정의 효율성 비교)

  • Kim, Seok;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.6
    • /
    • pp.1301-1313
    • /
    • 2009
  • Efficiency and sensitivity of BLS adjustment method have been studied and the method is known to provide more accurate estimate of total by using properly adjusted weights of samples. However, BLS methods provide different efficiencies according to the magnitudes of correlation coefficients and the sizes of samples in strata. In this paper we study the efficiency of the BLS adjustment according to the sample sizes and correlations in strata. For this study, 2007 monthly labor survey data is used.

A Recognition of Electric Pole and Wire on Power Distribution Facility Map (배전설비도면의 전주 및 전선 인식)

  • 이봉재;김계영;한칠성;조선구
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10b
    • /
    • pp.446-448
    • /
    • 2000
  • 본 논문에서는 배전설비도면의 주요 기호인 전주와 전선인식 방법에 관하여 기술한다. 본 논문에서는 원형성에 근거하여 전주후보를 추출한 후 이들 사이의 연결성에 근거하여 전선을 인식한 다음, 전주후보들 중에서 전주를 확인함으로서 전주와 전선을 인식하는 방법을 제안한다. 제안된 방법은 한국전력공사의 배전설비도면들 중에서 무작위로 추출한 표본 약 30매를 대상으로 실험하고 그 결과를 제시한다.

  • PDF

Korean women wage analysis using selection models (표본 선택 모형을 이용한 국내 여성 임금 데이터 분석)

  • Jeong, Mi Ryang;Kim, Mijeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1077-1085
    • /
    • 2017
  • In this study, we have found the major factors which affect Korean women's wage analysing the data provided by 2015 Korea Labor Panel Survey (KLIPS). In general, wage data is difficult to analyze because random sampling is infeasible. Heckman sample selection model is the most widely used method for analysing the data with sample selection. Heckman proposed two kinds of selection models: the one is the model with maximum likelihood method and the other is the Heckman two stage model. Heckman two stage model is known to be robust to the normal assumption of bivariate error terms. Recently, Marchenko and Genton (2012) proposed the Heckman selectiont model which generalizes the Heckman two stage model and concluded that Heckman selection-t model is more robust to the error assumptions. Employing the two models, we carried out the analysis of the data and we compared those results.

Mean Estimation in Two-phase Sampling (이중추출에서 모평균 추정)

  • 김규성;김진석;이선순
    • The Korean Journal of Applied Statistics
    • /
    • v.14 no.1
    • /
    • pp.13-24
    • /
    • 2001
  • In this paper, we investigated mean estimation methods in two-phase sampling. Under the fixed expected cost we reviewed the optimal sample sizes, minimum variances and approximate unbiased variance estimators for usual ratio estimator, stratified sample mean with proportional allocation and Rao's allocation of the second phase sample. Also we proposed combined ratio estimator, which uses both ratio estimation and stratification and derived optimal sample size, minimum variance and unbiased variance estimator. Through a limited simulation study, we compared estimators by design effects and came to know that ratio estimator is more efficient than stratified sample mean in some cases and inefficient in the other cases, but combined ratio estimator is more efficient than others in most cases.

  • PDF

The Investigate of Human Strength Demand of Information Electrical the Kind of Occupation (정보전기 직종의 인력 수요에 대한 고찰)

  • Kim, Soo-Yong;Lee, Seung-Ho
    • Journal of Engineering Education Research
    • /
    • v.11 no.4
    • /
    • pp.58-63
    • /
    • 2008
  • This thesis investigated way of employment, education course of a training school of electrical company. And student more than. Faced a human power demand in an education demand and a field rehearsal student demand and and analyzed it. The sample extraction used industrial classification, work of scale, Assignment sample extraction way (quota Sampling). All data called at a silver phone and, and the investigated, The data parser analyzed the statistics that used Microsoft Excel.

A Study on Estimation of Vehicle Miles Traveled (자동차주행거리 추정방안 연구)

  • Ahn, Won-Chul;Park, Dong-Joo;Heo, Tae-Young;Yeon, Ji-Youn;Kim, Chan-Sung
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.6
    • /
    • pp.64-76
    • /
    • 2014
  • This study identified the causes of errors that could take place in the estimation process of vehicle miles traveled and quantified the effects of each of those causes on the estimation accuracy of vehicle miles traveled via error rate to propose an efficient way to estimate vehicle miles traveled. The study proceeded as follows: first, the study established survey data of vehicle miles traveled in the pilot test areas to test the accuracy of a method to estimate vehicle miles traveled. Second, the causes of errors with the estimation of vehicle miles traveled were categorized into errors with the sample size, sampling methods, and homogeneous link setting methods. In addition, many different methodologies were set to minimize errors with the estimation of vehicle miles traveled according to each of the causes. Third, error rates of estimation of vehicle miles traveled were compared and analyzed according to each of the methodologies. Finally, a toy network was established to propose a way of estimating vehicle miles traveled by taking the local characteristics into consideration. The study finds its significance in that it proposed an efficient way to estimate vehicle miles traveled through an experiment and planning approach and made use of survey data of vehicle miles traveled to test estimation accuracy. The proposed way of estimating vehicle miles traveled by taking into account the local characteristics will make a contribution to the estimation of vehicle miles traveled by the areas in future along with the level of data offered in the study.