• 제목/요약/키워드: Improvement of statistics quality

검색결과 296건 처리시간 0.026초

Enhancement of a language model using two separate corpora of distinct characteristics

  • 조세형;정태선
    • 한국지능시스템학회논문지
    • /
    • 제14권3호
    • /
    • pp.357-362
    • /
    • 2004
  • 언어 모델은 음성 인식이나 필기체 문자 인식 등에서 다음 단어를 예측함으로써 인식률을 높이게 된다. 그러나 언어 모델은 그 도메인에 따라 모두 다르며 충분한 분량의 말뭉치를 수집하는 것이 거의 불가능하다. 본 논문에서는 N그램 방식의 언어모델을 구축함에 있어서 크기가 제한적인 말뭉치의 한계를 극복하기 위하여 두개의 말뭉치, 즉 소규모의 구어체 말뭉치와 대규모의 문어체 말뭉치의 통계를 이용하는 방법을 제시한다. 이 이론을 검증하기 위하여 수십만 단어 규모의 방송용 말뭉치에 수백만 이상의 신문 말뭉치를 결합하여 방송 스크립트에 대한 퍼플렉시티를 30% 향상시킨 결과를 획득하였다.

Identification of Stearoyl-CoA Desaturase (SCD) Gene Interactions in Korean Native Cattle Based on the Multifactor-dimensionality Reduction Method

  • Oh, Dong-Yep;Jin, Me-Hyun;Lee, Yoon-Seok;Ha, Jae-Jung;Kim, Byung-Ki;Yeo, Jung-Sou;Lee, Jea-Young
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제26권9호
    • /
    • pp.1218-1228
    • /
    • 2013
  • Fat quality is determined by the composition of fatty acids. Genetic relationships between this composition and single nucleotide polymorphisms (SNPs) in the stearoyl-CoA desaturase1 (SCD1) gene were examined using 513 Korean native cattle. Single and epistatic effects of 7 SNP genetic variations were investigated, and the multifactor dimensionality reduction (MDR) method was used to investigate gene interactions in terms of oleic acid (C18:1), mono-unsaturated fatty acids (MUFAs) and marbling score (MS). The g.6850+77 A>G and g.14047 C>T SNP interactions were identified as the statistically optimal combination (C18:1, MUFAs and MS permutation p-values were 0.000, 0.000 and 0.001 respectively) of two-way gene interactions. The interaction effects of g.6850+77 A>G, g.10213 T>C and g.14047 C>T reflected the highest training-balanced accuracy (63.76%, 64.70% and 61.85% respectively) and was better than the individual effects for C18:1, MUFAs and MS. In addition, the superior genotype groups were AATTCC, AGTTCC, GGTCCC, AGTCCT, GGCCCT and AGCCTT. These results suggest that the selected SNP combination of the SCD1 gene and superior genotype groups can provide useful inferences for the improvement of the fatty acid composition in Korean native cattle.

A Restricted Partition Method to Detect Single Nucleotide Polymorphisms for a Carcass Trait in Hanwoo

  • Lee, Ji-Hong;Kim, Dong-Chul;Kim, Jong-Joo;Lee, Jea-Young
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제24권11호
    • /
    • pp.1525-1528
    • /
    • 2011
  • The purpose of this study was to detect SNPs that were responsible for a carcass trait in Hanwoo populations. A non-parametric model applying a restricted partition method (RPM) was used, which exploited a partitioning algorithm considering statistical criteria for multiple comparison testing. Phenotypic and genotypic data were obtained from the Hanwoo Improvement Center, National Agricultural Cooperation Federation, Korea, in which the pedigree structure comprised 229 steers from 16 paternal half-sib proven sires that were born in Namwon or Daegwanryong livestock testing station between spring of 2002 and fall of 2003. A carcass trait, longissimus dorsi muscle area for each steer was measured after slaughter at approximately 722 days. Three SNPs (19_1, 18_4 and 28_2) near the microsatellite marker ILSTS035 on BTA6, around which the quantitative trait loci (QTL) for meat quality were previously detected, were used in this study. The RPM analyses resulted in two significant interaction effects between SNPs (19_1 and 18_4) and (19_1 and 28_2) at ${\alpha}$ = 0.05 level. However, under a general linear (parametric) model no interaction effect between any pair of the three SNPs was detected, while only one main effect for SNP19_1 was found for the trait. Also, under another non-parametric model using a multifactor dimensionality reduction (MDR) method, only one interaction effect of the two SNPs (19_1 and 28_2) explained the trait significantly better than the parametric model with the main effect of SNP19_1. Our results suggest that RPM is a good alternative to model choices that can find associations of the interaction effects of multiple SNPs for quantitative traits in livestock species.

Multifactor Dimensionality Reduction (MDR) Analysis to Detect Single Nucleotide Polymorphisms Associated with a Carcass Trait in a Hanwoo Population

  • Lee, Jea-Young;Kwon, Jae-Chul;Kim, Jong-Joo
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제21권6호
    • /
    • pp.784-788
    • /
    • 2008
  • Studies to detect genes responsible for economic traits in farm animals have been performed using parametric linear models. A non-parametric, model-free approach using the 'expanded multifactor-dimensionality reduction (MDR) method' considering high dimensionalities of interaction effects between multiple single nucleotide polymorphisms (SNPs), was applied to identify interaction effects of SNPs responsible for carcass traits in a Hanwoo beef cattle population. Data were obtained from the Hanwoo Improvement Center, National Agricultural Cooperation Federation, Korea, and comprised 299 steers from 16 paternal half-sib proven sires that were delivered in Namwon or Daegwanryong livestock testing stations between spring of 2002 and fall of 2003. For each steer at approximately 722 days of age, the Longssimus dorsi muscle area (LMA) was measured after slaughter. Three functional SNPs (19_1, 18_4, 28_2) near the microsatellite marker ILSTS035 on BTA6, around which the QTL for meat quality were previously detected, were assessed. Application of the expanded MDR method revealed the best model with an interaction effect between the SNPs 19_1 and 28_2, while only one main effect of SNP19_1 was statistically significant for LMA (p<0.01) under a general linear mixed model. Our results suggest that the expanded MDR method better identifies interaction effects between multiple genes that are related to polygenic traits, and that the method is an alternative to the current model choices to find associations of multiple functional SNPs and/or their interaction effects with economic traits in livestock populations.

생활방사선안전 관련 일반지식 측정도구 개발 및 실태분석 (An Analysis and Development of the Measurement on General Knowledge Related to the Safety of Living Radiation)

  • 최경호;서혜영
    • 융합정보논문지
    • /
    • 제12권4호
    • /
    • pp.205-211
    • /
    • 2022
  • 우리 인간의 주변에는 다양한 방사성 물질이 존재하고 있다. 최근 들어서는 삶의 질 향상과 함께 건강에 대한 관심도 높아지면서 방사선을 활용한 검사 등 또한 많아지고 있다. 본 연구에서는 이런 방사선을 생활 관련 방사선으로 정의하고, 이에 대한 지식을 측정할 수 있는 측정도구를 개발해 보았다. 그 결과 신뢰성이 확보되는 18개 문항을 개발하였다. 나아가 이를 이용하여 생활방사선 안전 관련 지식에 대한 실태를 분석해 보았다. 그 결과 방사선 관련 교육을 받은 그룹이 그렇지 않은 그룹에 비하여 통계적으로 유의하게 높은 점수를 받는 것으로 나타났다. 그리고 상관분석 및 회귀분석을 통해서 보았을 때 평소 안전 관련 관심도가 높을수록 생활방사선 안전 관련 지식이 높은 것으로 분석되었다. 이를 토대로 현대인들의 안전을 위하여 학교 교육과정에서 방사선 안전 관련 교수-학습이 이루어져할 필요성이 제안되었다.

클러스터링 기법을 활용한 해외건설 필요정보 우선순위 수요 조사 평가 (Priority Demand Assessment for Overseas Construction Information Using Clustering Method)

  • 최원영;곽승진
    • 한국비블리아학회지
    • /
    • 제29권4호
    • /
    • pp.57-68
    • /
    • 2018
  • 국내 건설시장의 침체가 예상되는 상황에서 지속적으로 국내 중소엔지니어링 기업의 해외시장으로의 진출을 지원하기 위해 해외건설엔지니어링 정보시스템(OVICE)이 운영되고 있다. 이에 본 연구에서는 기존 연구를 통해 수행된 전문가 설문조사를 통한 필요정보 우선순위와 정보시스템 사용자 통계를 비교하여 정보시스템의 정보제공 방향성을 제시하여 정보서비스 품질을 높이고자 하였다. 정보시스템의 이용 통계 분석에 있어서, 우선순위를 매기기 힘든 통계분석의 효율성을 높이기 위해 K-means Clustering 기법을 분석에 활용하였다. 그 결과 기존 설문결과와 정보시스템 이용 통계의 차이를 분석하여 정보시스템의 정보제공에서의 보완점과 함께 설문조사 과정에서 부각되지 않았던 중요한 콘텐츠를 찾아낼 수 있었다.

제 4~5번 요추 추간판 탈출 정도와 요통의 한의학적 치료 효과의 상관성 연구 (The Study on Correlation between the Degree of Herniated Intervertebral Lumbar Disc at L4~5 Level and Improvement of Low Back Pain Treated by Korean Medicine Therapy)

  • 유형진;이현호;정성현;조경상;이기언;이동현;김상민
    • 한방재활의학과학회지
    • /
    • 제26권2호
    • /
    • pp.105-121
    • /
    • 2016
  • Objectives The purpose of this study was to compare the effects between the degree of herniated intervertebral lumbar disc (HIVD) at L4-5 level and improvement of low back pain treated by Korean Medicine therapy. Methods 567 patients who received inpatient treatment from May 2014 to December 2015 in the Daejeon-Jaseng of Korean Medicine Hospital were divided into 6 groups by the degree of HIVD at L4-5 level confirmed with a Lumbar spine magnetic resonance imaging. All patients received a combination of treatment including acupunture, chuna manual therapy, pharmacopunture, herbal medication. They were compared and analyzed on the basis of improvement between measuring Numeric Rating Scale (NRS), Oswestry Disability Index (ODI), EuroQol-5 Dimension Index (EQ5D Index) as they were hospitalized and as they were discharged. The statistically significance was evaluated by SPSS 23.0 for windows. Results After treatment, Normal stage on Intervertebral Lumbar Disc at L4-5 level group's Numeric Rating Scale (NRS), Oswestry Disability Index (ODI), EuroQol-5 Dimension Index (EQ5D Index) improvement was $1.30{\pm}1.62$, $4.52{\pm}11.82$ and $0.04{\pm}0.11$ respectively. Bulging group's improvement was $3.25{\pm}2.81$, $8.28{\pm}13.02$ and $0.09{\pm}0.17$ respectively. Spinal canal occupying ratio (SOR) less than 20 group's improvement was $2.15{\pm}1.92$, $11.79{\pm}17.81$ and $0.13{\pm}0.23$ respectively. SOR 20 to less than 40 stage group's improvement was $2.13{\pm}1.92$. $10.79{\pm}15.93$ and $0.10{\pm}0.26$ respectively. SOR 40 to less than 60 group's improvement was $2.16{\pm}2.24$, $9.80{\pm}16.62$ and $0.15{\pm}0.25$ respectively. Surgery group's improvement was $2.47{\pm}2.21$, $11.64{\pm}18.53$ and $0.15{\pm}0.27$ respectively (p<0.03). But there was no statistically significance between 6 group's improvement after treatment (p>0.05). Conclusions After inpatient treatment by Korean Medicine therapy, Most patient's pain, disability and Health Related Quality of Life was improved significantly. But there was no statistically correlation between the degree of HIVD at L4-5 level and improvement of low back pain. So We think that future research of higher quality and correct statistics shall be necessary.

Spillover Effects of FDI on Technology Innovation of Vietnamese Enterprises

  • HOANG, Duc Than;DO, Anh Duc;TRINH, Mai Van
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제8권1호
    • /
    • pp.655-663
    • /
    • 2021
  • This paper aims to develop a conceptual framework for determinants of spillover effects of FDI on technology innovation of Vietnamese enterprises. The research proposes a logistic regression model for assessing how enterprises' ability to implement technological innovation is affected by the presence of FDI enterprises as well as other factors that show the change through the indirect influence of FDI such as the size of the enterprise, the type of enterprise, and the skill level of the labor force or its research and development activities. Five forms of technology innovation are considered: improving production process; product quality improvement; product expansion; expanding business activities into a new field of production; and changing business activities into a new field of production. General Statistics Office of Vietnam provided survey data to collect information from 3,166 enterprises in the manufacturing and processing industry in Hanoi, which were valid for analysis. The results show that all variables of enterprise type, size, R&D, and industry have a positive impact on the selection of one of the innovation forms. Several recommendations are further suggested to take advantage of the positive effects and minimizing the negative effects of FDI for technological innovation of Vietnamese enterprises.

분산 분석을 이용한 자동차 안전벨트 준정적 해석과 인장시험 상관성 개선 (Quasi-static Analysis of Vehicle Seatbelt Using Analysis of Variance and Improvement of Tensile Test Correlation)

  • 이광섭;어영우;김삼성;김두용;송택림;이경상
    • 한국자동차공학회논문집
    • /
    • 제24권3호
    • /
    • pp.273-278
    • /
    • 2016
  • This study makes a relative comparison of the results of tensile test and quasi-static analysis using AGL(Adjuster Guide Loop) model that plays a role in adjusting the height of shoulder belt, of the components of the vehicle seatbelt system and attempts to propose a method of reducing the error rate of the quasi-static analysis technique effectively. This study selects two major factors affecting the result of an analysis, draws the result of analysis through the method of experimental design, one of the statistical techniques and understands the contribution rate of the major factors affecting the result of the analysis through ANOVA(Analysis of Variance).

암 환자 돌봄제공자의 돌봄부담감과 대처방식이 소진에 미치는 영향 (Influence of Caring Burden and the Way of Coping on Burnout in Caregivers of Cancer Patients)

  • 허수빈;신소영
    • 한국직업건강간호학회지
    • /
    • 제28권2호
    • /
    • pp.114-123
    • /
    • 2019
  • Purpose: The aims of this study were to identify the effects of caring burden and the way of coping on burnout in caregivers of cancer patients. Methods: One-hundred and forty family caregivers of cancer patients who visited the cancer center at one tertiary hospital in metropolitan city B were included. The data collection was conducted from August 1st to October 1st, 2018, using a structured, self-reported questionnaire. The collected data were analyzed using descriptive statistics, t-test, one-way ANOVA, Pearson correlation coefficients, and multiple regression. Results: In the multiple regression analysis, the subject's gender (${\beta}=.12$, p=.028) and caring burden (${\beta}=.74$, p<.001) had a significant effect on burnout. The explanatory power of the subject's gender, education level, religion, caring time, number of family caregivers, monthly income, economic burden, expectation for treatment, caring burden, the way of aggressive coping, and the way of passive coping with burnout was 63.8% (F=23.28, p<.001). Conclusion: Reducing the caring burden in family caregivers of cancer patients will ultimately contribute to reducing burnout, thereby contributing to an improvement in the psychological well-being and quality of life of family members, as well as positively contributing to the recovery of patients.