• 제목/요약/키워드: Data Heterogeneity

검색결과 604건 처리시간 0.031초

미국 소득분포의 지역적 수렴에 대한 공간자료 분석(1969∼1999년) - 베타-수렴에 대한 비판적 검토 - (Spatial Data Analysis for the U.S. Regional Income Convergence,1969-1999: A Critical Appraisal of $\beta$-convergence)

  • Sang-Il Lee
    • 대한지리학회지
    • /
    • 제39권2호
    • /
    • pp.212-228
    • /
    • 2004
  • 본 연구는 지역간 소득분포의 수렴/발산의 주요 측면인 베타-수렴을 공간자료분석에 의거하여 비판적으로 검토하고 있다. 베타-수렴에 대한 통상적인 접근법은 두 가지 측면에서 문제점을 갖고 있다. 첫째, 회귀분석 결과 도출되는 잔차의 공간적 자기상관을 고려하지 못한다. 둘째, 베타-수렴의 국지적 변이, 즉 공간적 이질성을 탐색할 어떠한 절차도 제공하지 못한다. 이러한 비판적 검토를 바탕으로, 다양한 공간자료분석 기법들, 즉, 공간적 자기회기 모델(spatial autoregressive models), 이변량 국지통지(bivariate local statistics)를 이용한 탐색적 공간자료분석(ESDA: exploratory spatial data analysis) 기법, 그리고 지리적 가중회귀분석(GWR: geographically weighted regression)을 사용하여 1969-1999년 간의 미국 노동시장지역에 대한 소득 자료를 분석하였다. 주요 결과는 다음과 같다. 첫째, OSL모델을 적용한 결과 베타-수렴은 단지 부분적으로만 드러났고, 베타-수렴 계수도 시기별로 상당한 편차를 보였다. 둘째, 공간적 자기회기 모델의 분석 결과 OLS에 의해 유의한 것으로 나타난 베타-수렴 계수가 99% 신뢰수준에서 유의하지 않은 것으로 드러났다. 셋째, 탐색적 공간자료분석과 지리적 가중회귀분석의 결과는 베타-수렴의 경향에 상당한 정도의 공간적 이질성이 존재한다는 점을 보여주고 있다. 또한 이 공간적 이질성의 양상이 시기별로도 다양하게 드러남이 관찰되었다.

Enhancing LoRA Fine-tuning Performance Using Curriculum Learning

  • Daegeon Kim;Namgyu Kim
    • 한국컴퓨터정보학회논문지
    • /
    • 제29권3호
    • /
    • pp.43-54
    • /
    • 2024
  • 최근 언어모델을 활용하기 위한 연구가 활발히 이루어지며, 큰 규모의 언어모델이 다양한 과제에서 혁신적인 성과를 달성하고 있다. 하지만 실제 현장은 거대 언어모델 활용에 필요한 자원과 비용이 한정적이라는 한계를 접하면서, 최근에는 주어진 자원 내에서 모델을 효과적으로 활용할 수 있는 방법에 주목하고 있다. 대표적으로 학습 데이터를 난이도에 따라 구분한 뒤 순차적으로 학습하는 방법론인 커리큘럼 러닝이 주목받고 있지만, 난이도를 측정하는 방법이 복잡하거나 범용적이지 않다는 한계를 지닌다. 따라서, 본 연구에서는 신뢰할 수 있는 사전 정보를 통해 데이터의 학습 난이도를 측정하고, 이를 다양한 과제에 쉽게 활용할 수 있는 데이터 이질성 기반 커리큘럼 러닝 방법론을 제안한다. 제안방법론의 성능 평가를 위해 국가 R&D 과제 전문 문서 중 정보통신 분야 전문 문서 5,000건, 보건의료전문 문서 데이터 4,917건을 적용하여 실험을 수행한 결과, 제안 방법론이 LoRA 미세조정과 전체 미세조정 모두에서 전통적인 미세조정에 비해 분류 정확도 측면에서 우수한 성능을 나타냄을 확인했다.

스트리밍 빅데이터의 프라이버시 보호 동반 실용적 분석을 통한 지식 활용과 재사용 연구 (Research of Knowledge Management and Reusability in Streaming Big Data with Privacy Policy through Actionable Analytics)

  • 백주련;이영숙
    • 디지털산업정보학회논문지
    • /
    • 제12권3호
    • /
    • pp.1-9
    • /
    • 2016
  • The current meaning of "Big Data" refers to all the techniques for value eduction and actionable analytics as well management tools. Particularly, with the advances of wireless sensor networks, they yield diverse patterns of digital records. The records are mostly semi-structured and unstructured data which are usually beyond of capabilities of the management tools. Such data are rapidly growing due to their complex data structures. The complex type effectively supports data exchangeability and heterogeneity and that is the main reason their volumes are getting bigger in the sensor networks. However, there are many errors and problems in applications because the managing solutions for the complex data model are rarely presented in current big data environments. To solve such problems and show our differentiation, we aim to provide the solution of actionable analytics and semantic reusability in the sensor web based streaming big data with new data structure, and to empower the competitiveness.

P.intermedia의 유전자 이종성과 가족내 전이에 관한 연구 (TRANSMISSION OF PREVOTELLA INTERMEDIA BY GENOMIC DAN FINGERPRINTING)

  • 이승민;김각균;정종평
    • Journal of Periodontal and Implant Science
    • /
    • 제25권1호
    • /
    • pp.89-98
    • /
    • 1995
  • P. intermedia are considered an important pathogen in adult periodontitis, rapidly progressing periodontitis, refractory periodontitis, pregnancy gingivitis, acute necrotizing ulcerative gingivitis, pubertal gingivitis. So far 2 DNA homology groups and 3 serotypes of P. intermedia have been reported but there is no data available as yet regarding genetic diversity for the species P. intermedia. The purpose of this study is to investigate, using bacterial DNA restriction endonuclease analysis, genetic diversity between individual strains of P. intermedia which are indistinguishable by serotyping and biotyping, occurrence of an intrafamilial transmission and genetic heterogeneity between P. intermedia strains isolated within a patient and within the same serotypes. The families who have had no systemic disease, no experience of periodontal treatment for the previous 1 year and no experience of antibiotics for the previous 6 months were selected and subgingival plaque was collected at 4 sites in each person and incubated in the anaerobic chamber. P. intermedia were identified by colony shape, gram stain, biochemical test, SK-I03(Sunstar Inc.) test and IIF using monoclonal antibody was perfomed for the determination of serotypes. P. intermedia strains were grown in BHI broth and whole genomic DNA was extracted and digested by restriction endonuclease. The resulting DNA fragments were separated by agarose gel electrophoresis, stained and photographed under UV. As the results of this study, intrafamilial vertial transmissions could be assessed in 2 families and horizintal transmissions in another 2 families. There were different DNA digest patterns within a patient, so P. intermedia showed that individuals could be colonized by multiple clonal types at anyone time. And different serotypes could be found within a patient and in the same serotype within a patient, obvius genetic heterogeneity could not be assessed. But in the same serotype in different famies, there were differences in the DNA digest patterns.

  • PDF

크기가 1인 표본들로 구성된 집단에 기반한 모평균의 차이를 검정하기 위한 최소 조합 t-검정 방법 (A minimum combination t-test method for testing differences in population means based on a group of samples of size one)

  • 허미영;임창원
    • 응용통계연구
    • /
    • 제30권2호
    • /
    • pp.301-309
    • /
    • 2017
  • 일반적으로 각 N개의 모집단에서 2개 이상의 표본이 추출되었을 때, $H_0:{\mu}_1={\cdots}={\mu}_N$의 가설에 대하여 검정할 수 있지만 각 모집단으로부터 표본이 한 개씩 추출된다면 ${\bar{X}}$가 존재하지 않으므로 모평균의 차이 검정은 불가능하다. 하지만 하나씩 추출된 표본으로 구성된 집단을 두 집단으로 나누어 임의의 평균을 생성함으로써 평균의 차이를 비교한다면 표본들 사이에 존재할 수 있는 이질성을 파악할 수 있다. 따라서 우리는 두 집단으로 나눌 수 있는 조합의 수만큼 평균 차이를 검정할 수 있는 최소 조합 t-검정 방법을 제안하고자 한다. 최종적으로 본 논문에서는 한 개씩 추출된 표본들 사이의 이질성을 확인하기 위하여 평균 차이를 검정할 수 있는 방법을 제안하였고 모의실험 연구를 통해 성능을 확인하였고 실제 자료 분석을 통해 결과를 도출하였다.

Detecting response patterns of zooplankton to environmental parameters in shallow freshwater wetlands: discovery of the role of macrophytes as microhabitat for epiphytic zooplankton

  • Choi, Jong-Yun;Kim, Seong-Ki;Jeng, Kwang-Seuk;Joo, Gea-Jae
    • Journal of Ecology and Environment
    • /
    • 제38권2호
    • /
    • pp.133-143
    • /
    • 2015
  • Freshwater macrophytes improve the structural heterogeneity of microhabitats in water, often providing an important habitat for zooplankton. Some studies have focused on the overall influence of macrophytes on zooplankton, but the effects of macrophyte in relation to different habitat characteristics of zooplankton (e.g., epiphytic and pelagic) have not been intensively studied. We hypothesized that different habitat structures (i.e., macrophyte habitat) would strongly affect zooplankton distribution. We investigated zooplankton density and diversity, macrophyte characteristics (dry weight and species number), and environmental parameters in 40 shallow wetlands in South Korea. Patterns in the data were analyzed using a self-organizing map (SOM), which extracts information through competitive and adaptive properties. A total of 20 variables (11 environmental parameters and 9 zooplankton groups) were patterned onto the SOM. Based on a U-matrix, 3 clusters were identified from the model. Zooplankton assemblages were positively related to macrophyte characteristics (i.e., dry weight and species number). In particular, epiphytic species (i.e., epiphytic rotifers and cladocerans) exhibited a clear relationship with macrophyte characteristics, while large biomass and greater numbers of macrophyte species supported high zooplankton assemblages. Consequently, habitat heterogeneity in the macrophyte bed was recognized as an important factor to determine zooplankton distribution, particularly in epiphytic species. The results indicate that macrophytes are critical for heterogeneity in lentic freshwater ecosystems, and the inclusion of diverse plant species in wetland construction or restoration schemes is expected to generate ecologically healthy food webs.

Nephron Heterogeneity of Renin Release in Rat Kidney Slices: Effects of L-Isoproterenol, Angiotensin II and TMB-8

  • Seul, Kyung-Hwan;Kim, Suhn-Hee;Koh, Gou-Young;Cho, Kyung-Woo
    • The Korean Journal of Physiology
    • /
    • 제25권1호
    • /
    • pp.61-67
    • /
    • 1991
  • In order to determine possible relationships between the renin-angiotensin system and nephron heterogeneity, we compared the response of renin release and the angiotensin-converting enzyme (ACE) activity from different areas of the rat kidney. We used the renal cortical slices from the capsular surface to the juxtamedullary junction. Slices from outer one-third of the cortex were designated as outer cortical slices (OC), middle one-third as midcortical slices (MC), and inner one-third as inner cortical slices (IC). The renal renin content markedly decreased from OC and MC to IC. The basal lenin release was higher in OC than in MC or IC. On the contrary the percent change of renin release in response to L-isoproterenol was significantly higher in MC than in OC or IC. By TMB-8, the renin release in MC by $231{\pm}21%$ was higher than OC by $171{\pm}19%$ or IC by $$162{\pm}19. Angiotensin II suppressed renin release in OC and MC by $68{\pm}2,\;71{\pm}4%$ respectively, but only $40{\pm}7%$ in IC. The ACE activity was higher in IC than in OC, MC, medulla and papilla. The present data indicate that renin content and basal lenin release gradulally decreased from outer (OC) to inner (IC) cortex. The renin release in response to beta-adrenergic agonist, L-isoproterenol and intracellular calcium antagonist, TMB-8 were higher in MC than in OC and IC, but angiotensin II suppressed renin release less in IC than in OC and MC. It is suggested that juxtaglomerular cells of outer, mid-and inner cortices show a difference in renin release response to the stimuli.

  • PDF

농촌주민의 지역사회조직 참여 실태 분석 (Socio-demographic Heterogeneity of Community Participation in Rural, Korea)

  • 박덕병;조영숙
    • 한국지역사회생활과학회지
    • /
    • 제16권2호
    • /
    • pp.61-73
    • /
    • 2005
  • This study aims to examine the socio-demographic heterogeneity of community participation in rural Korea. Data was collected through interviews with 1,870 rural householders and housewives who have lived in Up or Myen as an administrative unit of rural communities, and analyzed by the SPSS/PC Win V.10 program. The statistical techniques used for this study were frequency and percentile. The major findings of this study were as follows. Firstly, the extent to which rural people have participated in community organizations were: cooperative groups, $80.8\%$; religious groups, $20.6\%$; learning groups, $12.7\%$; political groups, $9.8\%;$ civil groups $6.7\%$; and voluntary groups, $5.3\%$. Whereas the numbers were high for community participation in groups related to agricultural production, participation in civil and voluntary groups were lower. Secondly, it showed that people who lived in urbanized and high population density areas were more likely to participate in community groups. The diversity of community organizations was different according to the level of rurality. Thirdly, farm householders were more likely to participate in religious, civil and voluntary groups than non-farm householders. Fourthly, people with higher education, females, those in the 40 to 50 age groups were more likely to participate in community organizations. Fifthly, even though men are more likely to participate in political parties, women were more likely then men to agree that women should participate in political parties. This empirical study could support the results of Sundeen (1988) and Wilson and Musick (1997) in that education was related positively to community participation. In addition, we concluded that community participation in a rural development process has two main considerations: philosophical and pragmatic. This implies that there is room for government to enable and facilitate 'true' community participation. That can be done through policy reform which creates a permissive environment for community decision-making and input, in addition to simply supporting community development through financial assistance.

  • PDF

성격유형별 소집단 협동학습이 유아의 과학활동에 미치는 효과 (The Effects of Small Group's Cooperative Learning According to Personality Types on Young Children's Science Activities)

  • 강상;신지혜
    • 한국보육지원학회지
    • /
    • 제9권1호
    • /
    • pp.201-220
    • /
    • 2013
  • 본 연구는 협력적인 탐구과정이 요구되는 과학활동에 초점을 맞추어, 성격 유형별 소집단과학협동학습이 유아의 과학적 능력에 어떠한 영향을 미치는지 알아보고자 하였다. 이를 위해 전라북도 J시에 소재한 S유치원과 J유치원 만 5세를 대상으로 K-ABC 인지능력 검사와 MMTIC 성격유형 검사를 통해 각 기관별로 15명씩 총 30명을 EI지표에 따라 E(외향성)집단과 I(내향성) 집단의 성격유형 동질집단과 EI 혼합집단인 이질집단으로 구성하였다. 자료 분석은 과학적 태도는 공변량분석(ANCOVA), 과학적 지식 발달은 빈도 분석을 하였다. 연구결과 첫째, 소집단 협동학습에서 성격 유형별 동질집단과 이질집단 간 과학적 지식발달에 차이가 나타났다. 둘째, 소집단 협동학습에서 성격 유형별 동질집단과 이질집단 간과학적 태도에도 차이가 나타났다. Scheffe 사후검증을 실시한 결과 E동질집단과 I동질집단 간에 유의한 차이가 있었으나 I동질집단과 이질집단, E동질집단과 이질집단 간에는 차이가 없었고, I동질집단이 과학적 태도 향상에 가장 효과적인 집단구성이었다.

Diagnostic Accuracy of the Quidel Sofia Rapid Influenza Fluorescent Immunoassay in Patients with Influenza-like Illness: A Systematic Review and Meta-analysis

  • Lee, Jonghoo;Song, Jae-Uk;Kim, Yee Hyung
    • Tuberculosis and Respiratory Diseases
    • /
    • 제84권3호
    • /
    • pp.226-236
    • /
    • 2021
  • Background: Although the Quidel Sofia rapid influenza fluorescent immunoassay (FIA) is widely used to identify influenza A and B, the diagnostic accuracy of this test remains unclear. Thus, the objective of this study was to determine the diagnostic performance of this test compared to reverse transcriptase-polymerase chain reaction. Methods: A systematic literature search was performed using MEDLINE, EMBASE, and the Cochrane Central Register. Pooled sensitivity, specificity, diagnostic odds ratio (DOR), and a hierarchical summary receiver-operating characteristic curve (HSROC) of this test for identifying influenza A and B were determined using meta-analysis. A sensitivity subgroup analysis was performed to identify potential sources of heterogeneity within selected studies. Results: We identified 17 studies involving 8,334 patients. Pooled sensitivity, specificity, and DOR of the Quidel Sofia rapid influenza FIA for identifying influenza A were 0.78 (95% confidence interval [CI], 0.71-0.83), 0.99 (95% CI, 0.98-0.99), and 251.26 (95% CI, 139.39-452.89), respectively. Pooled sensitivity, specificity, and DOR of this test for identifying influenza B were 0.72 (95% CI, 0.60-0.82), 0.98 (95% CI, 0.96-0.99), and 140.20 (95% CI, 55.92-351.54), respectively. The area under the HSROC for this test for identifying influenza A was similar to that for identifying influenza B. Age was considered a probable source of heterogeneity. Conclusion: Pooled sensitivities of the Quidel Sofia rapid influenza FIA for identifying influenza A and B did not quite meet the target level (≥80%). Thus, caution is needed when interpreting data of this study due to substantial betweenstudy heterogeneity.