• Title/Summary/Keyword: Source of Random

Search Result 354, Processing Time 0.02 seconds

Estimation of Heritability and Genetic Parameter for Growth and Body Traits of Pig (종돈의 성장 및 체형 형질에 대한 유전력 및 유전모수 추정에 관한 연구)

  • Kang, Hyun-Sung;Nam, Ki-Chang;Kim, Kyung-Tai;Na, Chong-Sam;Seo, Kang-Seok
    • Journal of Animal Science and Technology
    • /
    • v.54 no.2
    • /
    • pp.83-87
    • /
    • 2012
  • The purpose of this study was to estimate genetic parameters for productive traits in swine. Productive traits were considered on average daily gain (ADG), body height (BH) and body length (BL). Genetic analysis was consisted of 18,668 heads for productive traits which were based on on-farm performance tested from May, 2007 to Apr, 2011. For estimating genetic parameters on productive traits, single best model was fitted after finding source of variance on fixed and random effects and estimated with a multiple trait animal model by using DF-REML (Derivative-Free Restricted Maximum Likelihood). The estimated heritabilities of Duroc, Berkshire, Landrace and Yorkshire 0.22-0.58 for the average daily gain, 0.34-0.41 for the body height and 0.4-0.52 for the body length, respectively. Phenotypic correlations of average daily gain with body height and body length for the four breeds were 0.42-0.48, 0.53-0.58, 0.34-0.46 and 0.47-0.56, respectively. Phenotypic correlations of body height with body length were 0.41, 0.57, 0.52, 0.59, respectively. The estimated genetic correlation coefficients of average daily gain with body height and body length estimated for the four breeds were 0.34-0.47, 0.70-0.75, 0.17-0.38 and 0.50-0.53, respectively. The estimated genetic correlation coefficients of body height with body length were 0.57, 0.69, 0.61 and 0.71, respectively.

Development of a Classification Method for Forest Vegetation on the Stand Level, Using KOMPSAT-3A Imagery and Land Coverage Map (KOMPSAT-3A 위성영상과 토지피복도를 활용한 산림식생의 임상 분류법 개발)

  • Song, Ji-Yong;Jeong, Jong-Chul;Lee, Peter Sang-Hoon
    • Korean Journal of Environment and Ecology
    • /
    • v.32 no.6
    • /
    • pp.686-697
    • /
    • 2018
  • Due to the advance in remote sensing technology, it has become easier to more frequently obtain high resolution imagery to detect delicate changes in an extensive area, particularly including forest which is not readily sub-classified. Time-series analysis on high resolution images requires to collect extensive amount of ground truth data. In this study, the potential of land coverage mapas ground truth data was tested in classifying high-resolution imagery. The study site was Wonju-si at Gangwon-do, South Korea, having a mix of urban and natural areas. KOMPSAT-3A imagery taken on March 2015 and land coverage map published in 2017 were used as source data. Two pixel-based classification algorithms, Support Vector Machine (SVM) and Random Forest (RF), were selected for the analysis. Forest only classification was compared with that of the whole study area except wetland. Confusion matrixes from the classification presented that overall accuracies for both the targets were higher in RF algorithm than in SVM. While the overall accuracy in the forest only analysis by RF algorithm was higher by 18.3% than SVM, in the case of the whole region analysis, the difference was relatively smaller by 5.5%. For the SVM algorithm, adding the Majority analysis process indicated a marginal improvement of about 1% than the normal SVM analysis. It was found that the RF algorithm was more effective to identify the broad-leaved forest within the forest, but for the other classes the SVM algorithm was more effective. As the two pixel-based classification algorithms were tested here, it is expected that future classification will improve the overall accuracy and the reliability by introducing a time-series analysis and an object-based algorithm. It is considered that this approach will contribute to improving a large-scale land planning by providing an effective land classification method on higher spatial and temporal scales.

Sentiment Analysis of Korean Reviews Using CNN: Focusing on Morpheme Embedding (CNN을 적용한 한국어 상품평 감성분석: 형태소 임베딩을 중심으로)

  • Park, Hyun-jung;Song, Min-chae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.59-83
    • /
    • 2018
  • With the increasing importance of sentiment analysis to grasp the needs of customers and the public, various types of deep learning models have been actively applied to English texts. In the sentiment analysis of English texts by deep learning, natural language sentences included in training and test datasets are usually converted into sequences of word vectors before being entered into the deep learning models. In this case, word vectors generally refer to vector representations of words obtained through splitting a sentence by space characters. There are several ways to derive word vectors, one of which is Word2Vec used for producing the 300 dimensional Google word vectors from about 100 billion words of Google News data. They have been widely used in the studies of sentiment analysis of reviews from various fields such as restaurants, movies, laptops, cameras, etc. Unlike English, morpheme plays an essential role in sentiment analysis and sentence structure analysis in Korean, which is a typical agglutinative language with developed postpositions and endings. A morpheme can be defined as the smallest meaningful unit of a language, and a word consists of one or more morphemes. For example, for a word '예쁘고', the morphemes are '예쁘(= adjective)' and '고(=connective ending)'. Reflecting the significance of Korean morphemes, it seems reasonable to adopt the morphemes as a basic unit in Korean sentiment analysis. Therefore, in this study, we use 'morpheme vector' as an input to a deep learning model rather than 'word vector' which is mainly used in English text. The morpheme vector refers to a vector representation for the morpheme and can be derived by applying an existent word vector derivation mechanism to the sentences divided into constituent morphemes. By the way, here come some questions as follows. What is the desirable range of POS(Part-Of-Speech) tags when deriving morpheme vectors for improving the classification accuracy of a deep learning model? Is it proper to apply a typical word vector model which primarily relies on the form of words to Korean with a high homonym ratio? Will the text preprocessing such as correcting spelling or spacing errors affect the classification accuracy, especially when drawing morpheme vectors from Korean product reviews with a lot of grammatical mistakes and variations? We seek to find empirical answers to these fundamental issues, which may be encountered first when applying various deep learning models to Korean texts. As a starting point, we summarized these issues as three central research questions as follows. First, which is better effective, to use morpheme vectors from grammatically correct texts of other domain than the analysis target, or to use morpheme vectors from considerably ungrammatical texts of the same domain, as the initial input of a deep learning model? Second, what is an appropriate morpheme vector derivation method for Korean regarding the range of POS tags, homonym, text preprocessing, minimum frequency? Third, can we get a satisfactory level of classification accuracy when applying deep learning to Korean sentiment analysis? As an approach to these research questions, we generate various types of morpheme vectors reflecting the research questions and then compare the classification accuracy through a non-static CNN(Convolutional Neural Network) model taking in the morpheme vectors. As for training and test datasets, Naver Shopping's 17,260 cosmetics product reviews are used. To derive morpheme vectors, we use data from the same domain as the target one and data from other domain; Naver shopping's about 2 million cosmetics product reviews and 520,000 Naver News data arguably corresponding to Google's News data. The six primary sets of morpheme vectors constructed in this study differ in terms of the following three criteria. First, they come from two types of data source; Naver news of high grammatical correctness and Naver shopping's cosmetics product reviews of low grammatical correctness. Second, they are distinguished in the degree of data preprocessing, namely, only splitting sentences or up to additional spelling and spacing corrections after sentence separation. Third, they vary concerning the form of input fed into a word vector model; whether the morphemes themselves are entered into a word vector model or with their POS tags attached. The morpheme vectors further vary depending on the consideration range of POS tags, the minimum frequency of morphemes included, and the random initialization range. All morpheme vectors are derived through CBOW(Continuous Bag-Of-Words) model with the context window 5 and the vector dimension 300. It seems that utilizing the same domain text even with a lower degree of grammatical correctness, performing spelling and spacing corrections as well as sentence splitting, and incorporating morphemes of any POS tags including incomprehensible category lead to the better classification accuracy. The POS tag attachment, which is devised for the high proportion of homonyms in Korean, and the minimum frequency standard for the morpheme to be included seem not to have any definite influence on the classification accuracy.

Comparision of Medical Care Utilization Patterns between Beneficiaries of Medical Aid and Medical Insurance (의료보호대상자의 의료이용양상)

  • Kim, Bok-Youn;Kim, Seok-Beom;Kim, Chang-Yoon;Kang, Pock-Soo;Chung, Jong-Hak
    • Journal of Yeungnam Medical Science
    • /
    • v.8 no.2
    • /
    • pp.185-201
    • /
    • 1991
  • A household survey was conducted to compare the patterns of morbidity and medical care utilization between medical aid beneficiaries and medical insurance beneficiaries. The study population included 285 medical aid beneficiaries that were completely surveyed and 386 medical insurance benficiaries selected by simple random sampling from a Dong(Township) in Taegu. Well-trained surveyers mainly interviewed housewives with a structured questionnaire. The morbidity rates of acute illness during the 15-day period, were 63 per 1,000 medical aid beneficiaries and 62 per 1,000 medical insurance beneficiaries. The rates for chronic illness were 123 per 1,000 medical aid beneficiaries and 73 per 1,000 medical insurance beneficiaries. The most common type of acute illness in medical aid and medical insurance beneficiaries was respiratory disease. In medical aid beneficiaries, musculoskeletal disease was most common, but in medical insurance beneficiaries, gastrointestinal disease was most common. The mean duration of acute illness of medical aid beneficiaries was 3.8 days and that of medical insurance beneficiaries was 6.8 days. During the one year period, mean duration of medical aid beneficiaries chronic illnesses was 11.5 months which was almost twice as long compared to medical insurance beneficiaries. Pharmacy was most preferrable facility among the acute illness patient in medical aid beneficiaries, but acute cases of medical insurance beneficiaries visited the clinic most commonly. Chronic cases of both groups visited the clinic most frequently. There were some findings suggesting that much unmet need existed among the medical aid beneficiaries. In acute cases, the average number of days of medical aid users utilized medical facilities was less than medical insurance users. On the other hand, the length of medical care utilization of chronic cases was reversed. Geographical accessibility was the most important factors in utilization of medical facilities. Almost half of the study population answered the questions about source of funds on medical security correctly. Most respondents considered that the objective of medical security was afford ability. The chief complaint on hospital utilization was the complicated administrative procedures. These findings suggest that there were some problems in the medical aid system, especially in the referral system.

  • PDF