• Title/Summary/Keyword: 두 모집단

Search Result 86, Processing Time 0.025 seconds

어가경제조사를 위한 새로운 표본설계

  • Ryu, Je-Bok;Kim, Yeong-Won;Park, Jin-U
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.11a
    • /
    • pp.35-42
    • /
    • 2002
  • 본 연구에서는 2000년 어업총조사에서 얻은 어가를 모집단으로 하여 어가경세조사를 위한 표본설계룰 하였다. 진체 어가를 전업 및 1종 겸업어가를 포함하는 부차모집단1과 2종 겸업어가로 구성된 부차모집단2로 구분하였다. 새로운 표본설계에서는 최적 집락크기를 구하고, 층화를 위해서 SAS Enterprise Miner에서 제공하고 있는 의사결정나무모형(Decision Tree Model)을 이용하였다. 층별 표본배정은 네이만 배정법을 사용하였고 두 가지 추정법을 제시하였다.

  • PDF

A Case Study of Basic Data Science Education using Public Big Data Collection and Spreadsheets for Teacher Education (교사교육을 위한 공공 빅데이터 수집 및 스프레드시트 활용 기초 데이터과학 교육 사례 연구)

  • Hur, Kyeong
    • Journal of The Korean Association of Information Education
    • /
    • v.25 no.3
    • /
    • pp.459-469
    • /
    • 2021
  • In this paper, a case study of basic data science practice education for field teachers and pre-service teachers was studied. In this paper, for basic data science education, spreadsheet software was used as a data collection and analysis tool. After that, we trained on statistics for data processing, predictive hypothesis, and predictive model verification. In addition, an educational case for collecting and processing thousands of public big data and verifying the population prediction hypothesis and prediction model was proposed. A 34-hour, 17-week curriculum using a spreadsheet tool was presented with the contents of such basic education in data science. As a tool for data collection, processing, and analysis, unlike Python, spreadsheets do not have the burden of learning program- ming languages and data structures, and have the advantage of visually learning theories of processing and anal- ysis of qualitative and quantitative data. As a result of this educational case study, three predictive hypothesis test cases were presented and analyzed. First, quantitative public data were collected to verify the hypothesis of predicting the difference in the mean value for each group of the population. Second, by collecting qualitative public data, the hypothesis of predicting the association within the qualitative data of the population was verified. Third, by collecting quantitative public data, the regression prediction model was verified according to the hypothesis of correlation prediction within the quantitative data of the population. And through the satisfaction analysis of pre-service and field teachers, the effectiveness of this education case in data science education was analyzed.

Approximate Variance of Least Square Estimators for Regression Coefficient under Inclusion Probability Proportional to Size Sampling (포함확률비례추출에서 회귀계수 최소제곱추정량의 근사분산)

  • Kim, Kyu-Seong
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.1
    • /
    • pp.23-32
    • /
    • 2012
  • This paper deals with the bias and variance of regression coefficient estimators in a finite population. We derive approximate formulas for the bias, variance and mean square error of two estimators when we select a fixed-size inclusion probability proportional to the size sample and then estimate regression coefficients by the ordinary least square estimator as well as the weighted least square estimator based on the selected sample data. Necessary and sufficient conditions for the comparison of the two estimators in terms of variance and mean square error are suggested. In addition, a simple example is introduced to numerically compare the variance and mean square error of the two estimators.

A minimum combination t-test method for testing differences in population means based on a group of samples of size one (크기가 1인 표본들로 구성된 집단에 기반한 모평균의 차이를 검정하기 위한 최소 조합 t-검정 방법)

  • Heo, Miyoung;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.2
    • /
    • pp.301-309
    • /
    • 2017
  • It is often possible to test for differences in population means when two or more samples are extracted from each N population. However, it is not possible to test for the mean difference if one sample is extracted from each population since a sample mean does not exist. But, by dividing a group of samples extracted one by one into two groups and generating a sample mean, we can identify a heterogeneity that may exist within the group by comparing the differences of the groups' mean. Therefore, we propose a minimum combination t-test method that can test the mean difference by the number of combinations that can be divided into two groups. In this paper, we proposed a method to test differences between means to check heterogeneity in a group of extracted samples. We verified the performance of the method by simulation study and obtained the results through real data analysis.

A Stratified and Two Sample Stratified Conditional Unrelated Question Model (층화 및 층화 이표본 조건부 무관질문모형)

  • Lee, Gi-Sung
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2883-2893
    • /
    • 2018
  • We suggest a stratified conditional unrelated question randomized response model to more efficiently estimate a sensitive character A when the population is composed of several strata. In that model, only the respondents who answered "yes" through randomization device which was consisted of a less sensitive character B and a question forcing to answer "yes" respond to our suggested model and we deal with two allocation problems of proportional allocation and optimal one. We expand the suggested model into two sample stratified conditional unrelated question model to cover the case of unknowing unrelated character and deduce minimal variance through optimal sample size of stratum h. Finally, we show that the suggested model is more efficiency than stratified unrelated models and the stratified Carr et al.'s model (1982) under some given conditions, and show numerically that the smaller the values ${\pi}_{h2}$ and ${\pi}_{hy}$, the more efficiency the fit of the model.

Study on Levels of Mathematically Gifted Students' Understanding of Statistical Samples through Comparison with Non-Gifted Students (일반학급 학생들과의 비교를 통한 수학영재학급 학생들의 표본 개념 이해 수준 연구)

  • Ko, Eun-Sung;Lee, Kyeong-Hwa
    • Journal of Gifted/Talented Education
    • /
    • v.21 no.2
    • /
    • pp.287-307
    • /
    • 2011
  • The purpose of this study is to investigate levels of mathematically gifted students' understanding of statistical samples through comparison with non-gifted students. For this purpose, rubric for understanding of samples was developed based on the students' responses to tasks: no recognition of a part of population (level 0), consideration of samples as subsets of population (level 1), consideration of samples as a quasi-proportional, small-scale version of population (level 2), recognition of the importance of unbiased samples (level 3), and recognition of the effect of random sampling (level 4). Based on the rubric, levels of each student's understanding of samples were identified. t tests were conducted to test for statistically significant differences between mathematically gifted students and non-gifted students. For both of elementary and middle school graders, the t tests show that there is a statistically significant difference between mathematically gifted students and non-gifted students. Table of frequencies of each level, however, shows that levels of mathematically gifted students' understanding of samples were not distributed at the high levels but were overlapped with levels of non-gifted students' understanding of samples.

A STUDY ON THE RACIAL CLASSIFICATION OF ASIAN CHUM, ONCORHYNCHUS KETA(WALBAUM) BASED ON SCALE CHARACTERISTICS (인상(鱗相)에 의한 아시아계 백연어, Oncorhynchus keta(Walbaum)의 계통판정에 관한 연구)

  • KANG Yong Joo
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.7 no.2
    • /
    • pp.91-97
    • /
    • 1974
  • Two scale characters, the width ana circuli counts of the first-year band, were used in a discriminant function analysis to see how effectively the two scale characters would separate geographical chum stocks from the western North Pacific. A total of 476 scale samples were taken from spawning adults which ascended to rivers of Hokkaido, Japan, in 1956, and Kamchatka, the U.S.S.R., in 1957. The scale characters were examined for conformity to the statistical requirements of a discriminant function. As a result of the examinations the two characters were verified to be able to be used in a discriminant function analysis that would classify chum taken on the high seas to most Probable origin. A discriminant function computed using the two characters correctly classified 78.5 percent of the Hokkaido and Kamchatka chum fish. Of the two characters the number of the circuli could alone classify fish to its origin with nearly the same probability of correct classification as the discriminant function based on the two characters can.

  • PDF

Sea, wind, or bird: Origin of Fagus multinervis (Fagaceae) inferred from chloroplast DNA sequences (엽록체 염기서열을 통한 너도밤나무(너도밤나무과)의 기원 추론)

  • Oh, Sang-Hun
    • Korean Journal of Plant Taxonomy
    • /
    • v.45 no.3
    • /
    • pp.213-220
    • /
    • 2015
  • To elucidate the origin and patterns of establishment of insular plants on Ulleungdo Island, maternally inherited chloroplast DNA, which is useful for tracing seed movements, was used. Fagus multinervis, an endemic species that dominated broadleaf deciduous forests on Ulleungdo Island, is an excellent model for such a study. To understand the diversity and spatial distribution of the chloroplast haplotypes of F. multinervis, nucleotide sequences of the psbA-trnH region were determined from 144 individuals sampled throughout the island. Results of a phylogenetic analysis of the region with close relatives of F. multinervis suggest that F. multinervis is sister to a clade of F. japonica and F. engleriana. No haplotype variation was found within F. multinervis. This remarkably low cpDNA haplotype diversity is in contrast to the findings of previous allozyme studies of F. multinervis populations that showed high genetic diversity on Ulleungdo Island. Repeated colonization during the early stage of establishment via birds that migrated from a source area where the F. multinervis cpDNA haplotype was geographically structured may have resulted in the observed pattern of haplotype diversity. Alternatively, long-distance dispersal of seeds of the progenitor of F. multinervis via birds or typhoons to Ulleungdo may have been a single event, whereas the immigration of pollen from the mainland likely occurred frequently. Comparative phylogeographic studies of other species endemic to Ulleungdo Island and their close relatives on the neighboring mainland are necessary for a more complete understanding of the evolution of the island's native species.

Fourth Graders Engaged in Sampling: A Case Study (초등학교 4학년 학생들의 표집활동 분석: 사례연구)

  • Park, Min-Sun;Ko, Eun-Sung
    • School Mathematics
    • /
    • v.16 no.3
    • /
    • pp.503-518
    • /
    • 2014
  • This study examines fourth graders engaged in three concrete activities involving sampling from finite populations. The first included a survey of popular foods for school meals. The second had them take samples from a box containing white and black marbles to predict how many white and black marbles were in the box. The final activity required them to predict how many times the Korean letter '가' would appear in a Korean story book. The results show that the participants can experience and notice different ideas related to samples and sampling in different activities. In the first activity, they acknowledged that samples are useful for obtaining the information about populations. A population survey is difficult and is not overly useful. In the second activity, they recognized that samples cannot be identical to their population but that the information from a group of samples is similar to the information of the population. In the last activity, they devised some ideas about random sampling even though the ideas were immature.

  • PDF

Random Digit Dialing Telephone Survey and Major Findings (RDD 전화조사와 주요결과)

  • Kang, H.C.;Han, S.T.;Kim, J.Y.;Jung, Y.C.;Huh, M.H.
    • Survey Research
    • /
    • v.9 no.1
    • /
    • pp.1-22
    • /
    • 2008
  • Telephone directories ille still being used as the sampling frame in almost all fixed-line telephone surveys in Korea, causing potentially serious coverage error. RDD (random digit dialing) sampling is an obvious alternative to solve the problem. The aim of this paper is twofold: 1) proposal of RDD methodology suitable to the telephone system of Korea and 2) the identification of socio-demographic and socio-psychological differences between listed-number and unlisted-number respondents. Major findings of RDD telephone survey conducted experimental]y are as follows. 1) Population coverage by telephone directories is 60% or less. 2) Unlisted-number households have statistically larger income compared to listed-number households. 3) Unlisted-number households have smaller family size compared to listed-number households. 4) Unlisted-number respondents are more sensitive about confidentiality, leaks, 5) Unlisted-number respondents are more liberal compared to unlisted-number respondents. These facts suggest that directory-based telephone surveys tend to be biased in socio-economic aspects.

  • PDF