• Title/Summary/Keyword: Classifier algorithm

Search Result 722, Processing Time 0.018 seconds

Estimation of Genetic Parameters for Linear Type and Conformation Traits in Hanwoo Cows (한우 암소의 선형 및 외모심사형질에 대한 유전모수 추정)

  • Lee, Ki-Hwan;Koo, Yang-Mo;Kim, Jung-Il;Song, Chi-Eun;Jeoung, Yeoung-Ho;Noh, Jae-Kwang;Ha, Yu-Na;Cha, Dae-Hyeop;Son, Ji-Hyun;Park, Byong-Ho;Lee, Jae-Gu;Lee, Jung-Gyu;Lee, Ji-Hong;Do, Chang-Hee;Choi, Tae-Jeong
    • Journal of agriculture & life science
    • /
    • v.51 no.6
    • /
    • pp.89-105
    • /
    • 2017
  • This study utilized 32,312 records of 17 linear type and 10 conformation traits(including final scores) of Hanwoo cows in the KAIA(Korea Animal Improvement Association) ('09~'10), with 60,556 animals in the pedigree file. Traits included stature, body length, strength, body depth, angularity, shank thickness, rump angle, rump length, pin bone width, thigh thickness, udder volume, teat length, teat placement, foot angle, hock angle, rear leg back view, body balance, breed characteristic, head development, forequarter quality, back line, rump, thigh development, udder development, leg line, and final score. Genetic and residual(co) variances were estimated using bi-trait pairwise analyses with EM-REML algorithm. Herd-year-classifier, year at classification, and calving stage were considered as fixed effects with classification months as a covariate. The heritability estimates ranged from 0.03(teat placement) to 0.42(body length). Rump length had the highest positive genetic correlation with pin bone width(0.96). Moreover, stature, body length, strength, and body depth had the highest positive genetic correlations with rump length, pin bone width, and thigh thickness(0.81-0.94). Stature, body length, strength, body depth, rump length, pin bone width, and thigh thickness traits also had high positive genetic correlations.

A Study on Differences of Contents and Tones of Arguments among Newspapers Using Text Mining Analysis (텍스트 마이닝을 활용한 신문사에 따른 내용 및 논조 차이점 분석)

  • Kam, Miah;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.53-77
    • /
    • 2012
  • This study analyses the difference of contents and tones of arguments among three Korean major newspapers, the Kyunghyang Shinmoon, the HanKyoreh, and the Dong-A Ilbo. It is commonly accepted that newspapers in Korea explicitly deliver their own tone of arguments when they talk about some sensitive issues and topics. It could be controversial if readers of newspapers read the news without being aware of the type of tones of arguments because the contents and the tones of arguments can affect readers easily. Thus it is very desirable to have a new tool that can inform the readers of what tone of argument a newspaper has. This study presents the results of clustering and classification techniques as part of text mining analysis. We focus on six main subjects such as Culture, Politics, International, Editorial-opinion, Eco-business and National issues in newspapers, and attempt to identify differences and similarities among the newspapers. The basic unit of text mining analysis is a paragraph of news articles. This study uses a keyword-network analysis tool and visualizes relationships among keywords to make it easier to see the differences. Newspaper articles were gathered from KINDS, the Korean integrated news database system. KINDS preserves news articles of the Kyunghyang Shinmun, the HanKyoreh and the Dong-A Ilbo and these are open to the public. This study used these three Korean major newspapers from KINDS. About 3,030 articles from 2008 to 2012 were used. International, national issues and politics sections were gathered with some specific issues. The International section was collected with the keyword of 'Nuclear weapon of North Korea.' The National issues section was collected with the keyword of '4-major-river.' The Politics section was collected with the keyword of 'Tonghap-Jinbo Dang.' All of the articles from April 2012 to May 2012 of Eco-business, Culture and Editorial-opinion sections were also collected. All of the collected data were handled and edited into paragraphs. We got rid of stop-words using the Lucene Korean Module. We calculated keyword co-occurrence counts from the paired co-occurrence list of keywords in a paragraph. We made a co-occurrence matrix from the list. Once the co-occurrence matrix was built, we used the Cosine coefficient matrix as input for PFNet(Pathfinder Network). In order to analyze these three newspapers and find out the significant keywords in each paper, we analyzed the list of 10 highest frequency keywords and keyword-networks of 20 highest ranking frequency keywords to closely examine the relationships and show the detailed network map among keywords. We used NodeXL software to visualize the PFNet. After drawing all the networks, we compared the results with the classification results. Classification was firstly handled to identify how the tone of argument of a newspaper is different from others. Then, to analyze tones of arguments, all the paragraphs were divided into two types of tones, Positive tone and Negative tone. To identify and classify all of the tones of paragraphs and articles we had collected, supervised learning technique was used. The Na$\ddot{i}$ve Bayesian classifier algorithm provided in the MALLET package was used to classify all the paragraphs in articles. After classification, Precision, Recall and F-value were used to evaluate the results of classification. Based on the results of this study, three subjects such as Culture, Eco-business and Politics showed some differences in contents and tones of arguments among these three newspapers. In addition, for the National issues, tones of arguments on 4-major-rivers project were different from each other. It seems three newspapers have their own specific tone of argument in those sections. And keyword-networks showed different shapes with each other in the same period in the same section. It means that frequently appeared keywords in articles are different and their contents are comprised with different keywords. And the Positive-Negative classification showed the possibility of classifying newspapers' tones of arguments compared to others. These results indicate that the approach in this study is promising to be extended as a new tool to identify the different tones of arguments of newspapers.