• Title/Summary/Keyword: 고밀도 SNP 칩

Search Result 2, Processing Time 0.016 seconds

SNP Marker Selection for Dog Breed Identification from Genotypes of High-density SNP Array and Machine Learning (고밀도 SNP 칩 유전자형 데이터 기계학습 기반 반려견 품종 식별 유전마커 선발)

  • Hyung-Yong Kim;Bong-Hwan Choi;Taeyun Oh;Byeong-Chul Kang
    • Journal of agriculture & life science
    • /
    • v.53 no.4
    • /
    • pp.93-101
    • /
    • 2019
  • Dog (Canis lupus familiaris) is a member of genius Canis that forms part of the wolf-like canids, and it has been evolved to diverse domestic breeds since 100 thousand years ago. Practical dog breed identification has been emerged to important part of pet industry such as genealogical certificates. From 11 dog breeds, 226 dogs and 23K SNP genotypes, we selected minimal SNPs of breed identification using machine learning algorithms including multiclass classification and feature selection. With 100 times of random choice of 70% data for training and 30% testing, we evaluated 9 classifiers' accuracies and 2 methods of feature selection. Linear SVM and PCA weighted feature selection showed the best accuracy of classification. Finally, we selected SNP markers and it could identify 11 breeds with approximately 90% accuracy, when having 40 SNP. This marker set is expected to be useful for dog breed and disease management by integration with disease markers.

Current Status of Cattle Genome Sequencing and Analysis using Next Generation Sequencing (차세대유전체해독 기법을 이용한 소 유전체 해독 연구현황)

  • Choi, Jung-Woo;Chai, Han-Ha;Yu, Dayeong;Lee, Kyung-Tai;Cho, Yong-Min;Lim, Dajeong
    • Journal of Life Science
    • /
    • v.25 no.3
    • /
    • pp.349-356
    • /
    • 2015
  • Thanks to recent advances in next-generation sequencing (NGS) technology, diverse livestock species have been dissected at the genome-wide sequence level. As for cattle, there are currently four Korean indigenous breeds registered with the Domestic Animal Diversity Information System of the Food and Agricultural Organization of the United Nations: Hanwoo, Chikso, Heugu, and Jeju Heugu. These native genetic resources were recently whole-genome resequenced using various NGS technologies, providing enormous single nucleotide polymorphism information across the genomes. The NGS application further provided biological such that Korean native cattle are genetically distant from some cattle breeds of European origins. In addition, the NGS technology was successfully applied to detect structural variations, particularly copy number variations that were usually difficult to identify at the genome-wide level with reasonable accuracy. Despite the success, those recent studies also showed an inherent limitation in sequencing only a representative individual of each breed. To elucidate the biological implications of the sequenced data, further confirmatory studies should be followed by sequencing or validating the population of each breed. Because NGS sequencing prices have consistently dropped, various population genomic theories can now be applied to the sequencing data obtained from the population of each breed of interest. There are still few such population studies available for the Korean native cattle breeds, but this situation will soon be improved with the recent initiative for NGS sequencing of diverse native livestock resources, including the Korean native cattle breeds.