• Title/Summary/Keyword: 표본이질성

Search Result 26, Processing Time 0.028 seconds

The Effects of Household Characteristics and Poverty Duration on Poverty Exit Rate -Examining the Effects of Duration Dependency and Sample Heterogeneity - (가구특성과 빈곤지속기간이 빈곤탈피율에 미치는 영향 -지속기간의존성과 표본이질성에 대한 검증을 포함하여-)

  • Kim, Hwanjoon
    • Korean Journal of Social Welfare Studies
    • /
    • v.44 no.3
    • /
    • pp.301-322
    • /
    • 2013
  • By analyzing wave 1~11 (1998~2008) of Korean Labor and Income Panel Study(KLIPS) database, this study examines the effects of household characteristics and poverty duration on poverty exit. A special concern is to decide whether the decrease of poverty exit rates comes from true duration dependency or from the sample heterogeneity as poverty duration progresses. I also analyzed how the effects of independent variables are changed when unobserved heterogeneity is controlled. The results show that duration dependency disappears after controlling observed household characteristics and unobserved individual heterogeneity. This finding confirms that the apparent relationship between poverty exit rate and poverty duration is in fact a spurious association due to the sample heterogeneity rather than true duration dependency. In addition, the effects of household characteristics on poverty exit rate become more stronger when unobserved heterogeneity is controlled. Socioeconomic factors affecting poverty exit rates are such as householders' age, education, household composition, number of family members, labor force participation, and work status.

Statistical methods for testing tumor heterogeneity (종양 이질성을 검정을 위한 통계적 방법론 연구)

  • Lee, Dong Neuck;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.3
    • /
    • pp.331-348
    • /
    • 2019
  • Understanding the tumor heterogeneity due to differences in the growth pattern of metastatic tumors and rate of change is important for understanding the sensitivity of tumor cells to drugs and finding appropriate therapies. It is often possible to test for differences in population means using t-test or ANOVA when the group of N samples is distinct. However, these statistical methods can not be used unless the groups are distinguished as the data covered in this paper. Statistical methods have been studied to test heterogeneity between samples. The minimum combination t-test method is one of them. In this paper, we propose a maximum combinatorial t-test method that takes into account combinations that bisect data at different ratios. Also we propose a method based on the idea that examining the heterogeneity of a sample is equivalent to testing whether the number of optimal clusters is one in the cluster analysis. We verified that the proposed methods, maximum combination t-test method and gap statistic, have better type-I error and power than the previously proposed method based on simulation study and obtained the results through real data analysis.

A minimum combination t-test method for testing differences in population means based on a group of samples of size one (크기가 1인 표본들로 구성된 집단에 기반한 모평균의 차이를 검정하기 위한 최소 조합 t-검정 방법)

  • Heo, Miyoung;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.2
    • /
    • pp.301-309
    • /
    • 2017
  • It is often possible to test for differences in population means when two or more samples are extracted from each N population. However, it is not possible to test for the mean difference if one sample is extracted from each population since a sample mean does not exist. But, by dividing a group of samples extracted one by one into two groups and generating a sample mean, we can identify a heterogeneity that may exist within the group by comparing the differences of the groups' mean. Therefore, we propose a minimum combination t-test method that can test the mean difference by the number of combinations that can be divided into two groups. In this paper, we proposed a method to test differences between means to check heterogeneity in a group of extracted samples. We verified the performance of the method by simulation study and obtained the results through real data analysis.

Using Mixed Logit Model and Latent Class Model to Analyze Preference Heterogeneity in Choice Experiment Data (선택실험법 자료에서의 선호이질성 분석을 위한 혼합로짓모형 및 잠재계층모형의 활용)

  • Yoo, Byong Kook
    • Environmental and Resource Economics Review
    • /
    • v.21 no.4
    • /
    • pp.921-945
    • /
    • 2012
  • Conditional Logit (CL) model is widely used since its model estimation and interpretation of results of the model is relatively easy, on the other hand, it has the limit of preference heterogeneity of respondents being not fully considered. In this study we used the two models, Mixed Logit (ML) Model and Latent Class Model (LCM) to explain preference heterogeneity of respondents for protection for Boryeong Dam wetland. As a result of the examination for heterogeneity in Boryeong city and six metropolitan areas, we found there was significant difference between two regions. While there was explicit preference heterogeneity within respondents in Boryeong city, we found little heterogeneity within respondents in six metropolitan areas. Thus in the case of six metropolitan areas, CL model can be used for parameter estimation while in the case of Boryeong city, WTP estimates are based on parameter estimates from ML model to reflect the heterogeneity within respondents. Additionally, ML model with interaction and 2-class LCM for respondents in Boryeong city were used to explain the sources of the heterogeneity. The ML model with interaction has advantage of explaining individual unobserved heterogeneity. However The comarison between these two models reflects the fact that LCM provided added information that was not conveyed in the ML model with interaction. Thus, Preference heterogeneity within respondents in this study may be better explained by class level through LCM rather than indiviual level through ML model.

  • PDF

Investigation of Heterogeneity Measure for Nonstationary Regional Frequency Analysis (비정상성 지역빈도해석을 위한 지역구분에 따른 이질성 척도 검토)

  • Ahn, Hyunjun;Shin, Ju-Young;Jung, Tae-Ho;Heo, Jun-Haeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.340-340
    • /
    • 2018
  • 전 세계적으로 기후변화로 인해 나타나는 이상기후의 영향을 고려하기 위해서 수문빈도해석분야에서는 비정상성 빈도해석에 관한 연구가 활발히 진행 중이다. 자료의 비정상성을 고려하여 빈도해석을 수행하는 방법은 다양하게 연구되어오고 있는데, 그중 시간에 따른 자료의 변화를 고려할 수 있도록 기존 모형의 매개변수에 시간을 고려할 수 있는 변수를 더하여 모형을 구축하는 기법이 비정상성 빈도해석기법으로 널리 활용되고 있다. 한편, 이러한 비정상성 가정에 관련한 연구들은 주로 지점빈도해석 기법을 중심으로 개발되어왔을 뿐, 아직 지역빈도해석기법을 대상으로 시도된 비정상성 연구는 미비한 실정이다. 지역빈도해석은 수문학적 동질지역이라는 가정을 바탕으로 표본의 확장을 통해 지점빈도해석보다 비교적 안정적인 빈도해석을 수행할 수 있는 기법으로 널리 알려져 있다. 따라서 지역빈도해석에서 수문학적 동질지역의 구분은 지역빈도해석 절차 중 가장 중요한 절차라고 할 수 있다. 이러한 수문학적 동질지역 구분을 위해서는 지점별로 가지고 있는 위치 정보나 수문 자료의 통계값과 같은 해당 지점을 대표할 수 있는 인자들이 필요하다. 본 연구에서는 모의실험을 통해 경향성이 나타나는 가상의 지점 자료를 생성한 뒤, 지역구분을 통해 자료의 비정상성이 나타나는 지역의 지역구분 결과를 살펴보고 이질성 척도(heterogenity measure)를 산정하였다. 이를 바탕으로 비정상성 지역빈도해석에서 이질성 척도의 적용성을 검토하고자 한다. 본 연구의 결과는 추후 기후변화의 영향이 나타나는 수문학적 동질 및 비 동질지역의 분석 및 비정상성 지역빈도해석을 위한 기초자료로 활용될 것으로 기대된다.

  • PDF

Testing Independence in Contingency Tables with Clustered Data (집락자료의 분할표에서 독립성검정)

  • 정광모;이현영
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.2
    • /
    • pp.337-346
    • /
    • 2004
  • The Pearson chi-square goodness-of-fit test and the likelihood ratio tests are usually used for testing independence in two-way contingency tables under random sampling. But both of these tests may provide false results for the contingency table with clustered observations. In this case we consider the generalized linear mixed model which includes random effects of clustering in addition to the fixed effects of covariates. Both the heterogeneity between clusters and the dependency within a cluster can be explained via generalized linear mixed model. In this paper we introduce several types of generalized linear mixed model for testing independence in contingency tables with clustered observations. We also discuss the fitting of these models through a real dataset.

Application of Population Index Flood Model for Regional Frequency Analysis (지역빈도해석을 위한 모홍수지수모형의 적용)

  • Kim, Hanbeen;Joo, Kyungwon;Kim, Taereem;Heo, Jun-Haeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.299-299
    • /
    • 2018
  • 지역빈도해석은 수문관측자료의 보유기간이 짧은 지점 또는 미계측 지점에 대하여 보다 정확하며 신뢰할 수 있는 설계수문량을 산정하기 위해 널리 사용되고 있는 방법이다. 지역빈도해석에서 사용되는 가장 대표적인 모형인 홍수지수모형 (index flood model)은 각 지점의 표본평균을 홍수지수로 정의하고 이를 이용하여 설계수문량을 산정하는 방법이다. 모홍수지수모형 (population index flood model)은 표본평균을 홍수지수로 사용함으로써 발생하는 설계수문량의 왜곡과 오차를 극복하기 위해 제안된 방법으로 홍수지수를 미지의 모분포로 가정한 후 설계수문량을 산정한다. 본 연구에서는 모홍수지수모형을 국내 강우관측자료에 적용하여 지역빈도해석을 수행하고자 한다. 먼저, 이질성척도(heterogeneity)를 통해 지역동질성이 확인된 지역에 대하여 GEV 분포형을 적용한 비정상성 모홍수지수모형을 적용해 지역빈도해석을 수행하고 확률강우량을 산정하였다. 또한, 기존의 지점빈도해석 및 L-moment 기반의 지역빈도해석 결과와 비교를 통해 모홍수지수모형의 적용성을 확인하였다.

  • PDF

Analysis of Relationship between Housing Tenure and Birth in Newlywed Couples by Using Panel Data (패널자료를 이용한 신혼가구의 주택점유형태와 출산 관계 연구)

  • Shin, Hyungsub
    • Land and Housing Review
    • /
    • v.13 no.3
    • /
    • pp.39-55
    • /
    • 2022
  • In this study, we investigate the interrelationship between housing tenure and childbirth by exploiting the correlation probability effect method that accounts for household heterogeneity. Using the newlywed household panel from 2011 to 2022, we find that home ownership has a positive impact on childbirth in newlyweds. Specifically, newlywed households with housing tenure show a 6.2%p higher birth rate and a 5.7%p higher second childbirth than newlywed households living in rented houses. For the case of first childbirth, we employ the probability effect probit model since the endogeneity was not detected between housing tenure and birth rate. We document the differential effects of housing tenure on childbirth in that the first childbirth rate is higher for households without housing tenures. The negative effects on first childbirth could be attributed to the economic burden due to initial housing ownership, while housing tenure could eventually provide housing stability, leading to positive effects on more than one childbirth. Finally, we identify that households with childbirth over the last year show a 4.2%p and 3.9%p lower probabilities of housing tenure in the total sample and second childbirth sample, respectively. This suggests that the increased living cost due to childbirth could delay home ownership.

거시경제변수(巨視經濟變數)와 주가(株價) - 한국주식시장(韓國株式市場)에서의 실증분석 -

  • Jeong, Gi-Ung
    • The Korean Journal of Financial Management
    • /
    • v.8 no.2
    • /
    • pp.111-129
    • /
    • 1991
  • 본 논문에서는 재정가격결정모형(裁定價格決定模型)(Arbitrage Pricing Model)을 기초로 우리나라 주식시장에 영향을 주는 거시경제변수가 무엇인가를 찾고자 하였다. 방법론면에서는 과거변수(過去變數)(lagged variables)에 의해서만 기대치를 형성시키는 AIRMA(Autoregressike Integrated with Moving Average) 방법을 이용하기보다는 마코프속성(屬性)(Markov Property)을 갖는 상태공간모형(狀態空間模型) (State Space Model)을 이용하여 보다 합리적인 거시경제 요인의 이노베이션을 하였다. 또한 단순한 요인분석(要因分析)(factor analysis)에 의한 요인추출은 요인의 표본의존성(標本依存性)(Sample dependency)이 심하므로 그룹간 요인분석(inter-battery factor analysis)을 행하여 추정(推定)된 요인(要因)(요인값 : factor score)과 요인수를 결정하여 관련 거시경제변수를 선택한다. 그룹간 요인분석을 위한 그룹을 형성할 때 그룹내에서는 동질성을 그룹간에는 이질성을 최대한 살리는 것이 필요한데, 이를 위해 군집분석(群集分析)(Cluster Analysis)을 사용한 것이 특징이다. 결론적으로 우리나라 주식시장에 영향을 미치는 거시경제요인(巨視經濟要因)으로 단위노동비율, 제조업제품재고지수, 채권프리미엄, 수출물가지수, 정부부문 통화공급, 회사채수익률, 종합주가지수 등 7가지가 있는 것으로 분석되고 있다.

  • PDF

The Workflow for Computational Analysis of Single-cell RNA-sequencing Data (단일 세포 RNA 시퀀싱 데이터에 대한 컴퓨터 분석의 작업과정)

  • Sung-Hun WOO;Byung Chul JUNG
    • Korean Journal of Clinical Laboratory Science
    • /
    • v.56 no.1
    • /
    • pp.10-20
    • /
    • 2024
  • RNA-sequencing (RNA-seq) is a technique used for providing global patterns of transcriptomes in samples. However, it can only provide the average gene expression across cells and does not address the heterogeneity within the samples. The advances in single-cell RNA sequencing (scRNA-seq) technology have revolutionized our understanding of heterogeneity and the dynamics of gene expression at the single-cell level. For example, scRNA-seq allows us to identify the cell types in complex tissues, which can provide information regarding the alteration of the cell population by perturbations, such as genetic modification. Since its initial introduction, scRNA-seq has rapidly become popular, leading to the development of a huge number of bioinformatic tools. However, the analysis of the big dataset generated from scRNA-seq requires a general understanding of the preprocessing of the dataset and a variety of analytical techniques. Here, we present an overview of the workflow involved in analyzing the scRNA-seq dataset. First, we describe the preprocessing of the dataset, including quality control, normalization, and dimensionality reduction. Then, we introduce the downstream analysis provided with the most commonly used computational packages. This review aims to provide a workflow guideline for new researchers interested in this field.