• Title/Summary/Keyword: SNP Haplotype

Search Result 127, Processing Time 0.024 seconds

Tag-SNP selection and online database construction for haplotype-based marker development in tomato (유전자 단위 haplotype을 대변하는 토마토 Tag-SNP 선발 및 웹 데이터베이스 구축)

  • Jeong, Hye-ri;Lee, Bo-Mi;Lee, Bong-Woo;Oh, Jae-Eun;Lee, Jeong-Hee;Kim, Ji-Eun;Jo, Sung-Hwan
    • Journal of Plant Biotechnology
    • /
    • v.47 no.3
    • /
    • pp.218-226
    • /
    • 2020
  • This report describes methods for selecting informative single nucleotide polymorphisms (SNPs), and the development of an online Solanaceae genome database, using 234 tomato resequencing data entries deposited in the NCBI SRA database. The 126 accessions of Solanum lycopersicum, 68 accessions of Solanum lycopersicum var. cerasiforme, and 33 accessions of Solanum pimpinellifolium, which are frequently used for breeding, and some wild-species tomato accessions were included in the analysis. To select tag-SNPs, we identified 29,504,960 SNPs in 234 tomatoes and then separated the SNPs in the genic and intergenic regions according to gene annotation. All tag-SNP were selected from non-synonymous SNPs among the SNPs present in the gene region and, as a result, we obtained tag-SNP from 13,845 genes. When there were no non-synonymous SNPs in the gene, the genes were selected from synonymous SNPs. The total number of tag-SNPs selected was 27,539. To increase the usefulness of the information, a Solanaceae genome database website, TGsol (http://tgsol. seeders.co.kr/), was constructed to allow users to search for detailed information on resources, SNPs, haplotype, and tag-SNPs. The user can search the tag-SNP and flanking sequences for each gene by searching for a gene name or gene position through the genome browser. This website can be used to efficiently search for genes related to traits or to develop molecular markers.

Estimation of Haplotype Proportions in Single Necleotide Polymorphism Group Using EM Algorithm (EM 알고리듬을 이용한 단일염기변이 (SNP;SINGLE NUCLEOTIDE POLYMORPHISM)군의 일배체형 (HAPLOTYPE) 비율 추정)

  • 김선우;김종원;이경아
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.2
    • /
    • pp.195-202
    • /
    • 2003
  • Haplotype analysis in SNP is very useful for the study of complex genetic disease due to low cost and high efficiency comparing to individual analysis of each SNP, and is functionally important in biological view. But, the gametic phase of haplotypes is usually unknown in SNP group, and it is difficult to predict haplotype proportions. In this study, haplotype proportions were estimated using EM algorithm from diploid data of SNP group in solid tumor group and normal group. From these results, linkage disequilibrium among SNPs was analyzed.

Effects of Single Nucleotide Polymorphism Marker Density on Haplotype Block Partition

  • Kim, Sun Ah;Yoo, Yun Joo
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.196-204
    • /
    • 2016
  • Many researchers have found that one of the most important characteristics of the structure of linkage disequilibrium is that the human genome can be divided into non-overlapping block partitions in which only a small number of haplotypes are observed. The location and distribution of haplotype blocks can be seen as a population property influenced by population genetic events such as selection, mutation, recombination and population structure. In this study, we investigate the effects of the density of markers relative to the full set of all polymorphisms in the region on the results of haplotype partitioning for five popular haplotype block partition methods: three methods in Haploview (confidence interval, four gamete test, and solid spine), MIG++ implemented in PLINK 1.9 and S-MIG++. We used several experimental datasets obtained by sampling subsets of single nucleotide polymorphism (SNP) markers of chromosome 22 region in the 1000 Genomes Project data and also the HapMap phase 3 data to compare the results of haplotype block partitions by five methods. With decreasing sampling ratio down to 20% of the original SNP markers, the total number of haplotype blocks decreases and the length of haplotype blocks increases for all algorithms. When we examined the marker-independence of the haplotype block locations constructed from the datasets of different density, the results using below 50% of the entire SNP markers were very different from the results using the entire SNP markers. We conclude that the haplotype block construction results should be used and interpreted carefully depending on the selection of markers and the purpose of the study.

HapAnalyzer: Minimum Haplotype Analysis System for Association Studies

  • Jung, Ho-Youl;Park, Jung-Sun;Park, Yun-Ju;Kim, Young-Jin;Kimm, Kuchan;Koh, InSong
    • Genomics & Informatics
    • /
    • v.2 no.2
    • /
    • pp.107-109
    • /
    • 2004
  • HapAnalyzer is an analysis system that provides minimum analysis methods for the SNP-based association studies. It consists of Hardy-Weinberg equilibrium (HWE) test, linkage disequilibrium (LD) computation, haplotype reconstruction, and SNP (or haplotype)-phenotype association assessment. It is well suited to a case-control association study for the unrelated population.

Haplotype Assembly from Weighted SNP Fragments and Related Genotype Information (신뢰도를 가진 SNP 단편들과 유전자형으로부터 일배체형 조합)

  • Kang, Seung-Ho;Jeong, In-Seon;Choi, Mun-Ho;Lim, Hyeong-Seok
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.35 no.11
    • /
    • pp.509-516
    • /
    • 2008
  • The Minimum Letter Flips (MLF) model and the Weighted Minimum Letter Flips (WMLF) model are for solving the haplotype assembly problem. But these two models are effective only when the error rate in SNP fragments is low. In this paper, we first establish a new computational model that employs the related genotype information as an improvement of the WMLF model and show its NP-hardness, and then propose an efficient genetic algorithm to solve the haplotype assembly problem. The results of experiments on random data set and a real data set indicate that the introduction of genotype information to the WMLF model is quite effective in improving the reconstruction rate especially when the error rate in SNP fragments is high. And the results also show that genotype information increases the convergence speed of the genetic algorithm.

Development of Haplotype Reconstruction System Using Public Resources (공개용 리소스를 활용한 Haplotype 재조합 시스템 개발)

  • Kim, Ki-Bong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.2
    • /
    • pp.720-726
    • /
    • 2010
  • Haplotype-based research has become increasingly important in the field of personalized medicine since the haplotype reflects a set of SNPs (Single Nucleotide Polymorphisms) that are genetically associated and inherited together. Currently, the most widely used application softwares available for haplotype reconstruction, based on in silico method, include PL-EM, Haplotyper, PHASE and HAP. PL-EM, Haplotyper and PHASE are command-line application running on LINUX or Unix system and HAP is a web-based client-server application. This paper deals with an integrated haplotype reconstruction system that have been developed with PL-EM and Haplotyper selected from the accuracy test with experimentally verified data on public application softwares. This integrated system is a kind of client-sever one with user friendly web interface and can provide end-users with a high quality of haplotype analysis. SNPs genotype data with a length of 5 derived from 5 people and SNPs genotype data with a length of 13 derived from 15 people were used to test the analysis results of Haplotyper and PL-EM respectively. As a result, this system has been confirmed to provide the systematic and easy-to-understand analysis results that consist of two main parts, i.e. individual haplotype information and haplotype pool information. In this respect, the integration system will be utilized as a useful tool for the discovery of disease related genes and the development of personalized drugs through facilitating the reconstruction of haplotype maps.

Effect of Single Nucleotide Polymorphisms of Acetyl-CoA Carboxylase α(ACACA) Gene on Carcass Traits in Hanwoo (Korean Cattle)

  • Shin, Sung-Chul;Heo, Jae-Pil;Chung, Eui-Ryong
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.24 no.6
    • /
    • pp.744-751
    • /
    • 2011
  • Meat production and quality traits in beef cattle are largely affected by genetic factors. Acetyl-Coenzyme A carboxylase-${\alpha}$ (ACACA) plays a key role in the regulation and metabolism of fatty acid biosynthesis in mammalian animals. The gene encoding ACACA enzyme was chosen as a candidate gene for carcass and meat traits. In this study, we investigated effects of single nucleotide polymorphisms (SNPs) in the ACACA gene on beef carcass and meat traits in Hanwoo (Korean cattle) populations. We have sequenced a fragment of intron I region of the Hanwoo ACACA gene and identified two SNPs. Genotyping of the two SNP markers (g.2344T>C and g.2447C>A) was carried out using PCR-SSCP analysis in 309 Hanwoo steers to evaluate their association with carcass and meat production traits. The g.2344C SNP marker showed a significant increasing effect on LW (p = 0.009) and CW (p = 0.017). Animals with the CC genotype had higher CW and LW compared with TT and TC genotypes (p<0.05). The g.2447A SNP marker was associated with higher MC (p = 0.019). Animals with the AA genotype had higher MC than animals with CC and CA genotypes (p<0.05). Although the degree of linkage disequilibrium (LD) was not strong between g.2344T>C and g.2447C>A in the LD analysis, four major haplotype classes were formed with two SNP information within the ACACA gene. We constructed haplotypes using the HaploView software package program and analyzed association between haplotypes and carcass traits. The haplotype of ACACA gene significantly affected the LW (p = 0.027), CW (p = 0.041) and MC (p = 0.036). The effect of h1 haplotype on LW and CW was larger than that of h3 haplotype. Animals with the h1 haplotype also had greater MC than did animals with h2 haplotype. Consequently, the ACACA gene could be useful as a DNA marker for meat production traits such as carcass yield and meat contents in Hanwoo.

SNP과 Haplotype 분석의 통계적 문제점들

  • Kim, Ho;Jo, Seong-Il;Seo, Yu-Sin;Hyeon, Sun-Ju;No, Jae-Jeong;Lee, Bok-Ju
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.11a
    • /
    • pp.203-207
    • /
    • 2002
  • Post-genome 시대를 맞이하여 인류는 전 유전체에서의 염기서열에 대한 정보를 가질 수 있게 되었다. 이러한 정보를 이용하여서 인간에게 나타나는 다양성을 설명하기 위해서 SNP(Single Nucleotide Polymorphism)의 연구가 활발히 되고 있다. 하지만 인간 체세포의 염색체는 2쌍으로 되어있기 때문에 이러한 정보가 어떠한 쌍의 조합(haplotype)으로 나타나는가를 고려하여야한다. 현재 실험적 방법으로 이를 고려하기에는 여러 가지 제약이 따르므로 통계적인 방법으로 이를 모형화하려는 노력(in silico haplotyping)이 시도되고 있다. 이 논문에서는 통계적으로 haplotype을 정하는 대표적인 알고리즘인 Clark's algorithm, E-M algorithm 등에 대한 고찰을 통하여 유전체통계학에 대한 소개를 하고자 한다.

  • PDF

LD-based tagSNP Selection System for Large-scale Haplotype and Genotype Datasets (대용량의 Haplotype과 Genotype데이터에 대한 LD기반의 tagSNP 선택 시스템)

  • Kim, Sang-Jun;Yeo, Sang-Soo;Kim, Sung-Kwon
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2004.11a
    • /
    • pp.279-285
    • /
    • 2004
  • In the disease association study, the tagSNP selection problem is important at the view of time and cost. We developed the new tagSNP selection system that has also facilities for the haplotype reconstruction and missing data processing. In our system, we improved biological meanings using LD coefficients as well as dynamic programming method. And our system has capability of processing large -scale dataset, such as the total SNPs on a chromosome. We have tested our system with various dataset from daly et al., patil et al., HapMap Project, artificial dataset, and so on.

  • PDF

Linkage Disequilibrium (LD) Mapping and Tagging SNP Selection of C-Fos Induced Growth Factor (Figf) Gene in Korean Population

  • Kim, Sook;Yoo, Yeon-Kyung;Jang, Hye-Yoon;Shin, Eun-Soon;Cho, Eun-Young;Kim, Eu-Gene;NamKung, Jung-Hyun;Yang, Jun-Mo;Lee, Jong-Eun
    • Molecular & Cellular Toxicology
    • /
    • v.2 no.1
    • /
    • pp.7-10
    • /
    • 2006
  • We performed comprehensive SNP validation and linkage disequilibrium (LD) analysis of the c-fos induced growth factor (Figf) gene in Korean population. Out of 32 SNPs, only 9 SNPs were polymorphic in Korean population. Validated SNPs formed a single extended haplotype block with strong LD through the entire length of the gene. Tagging SNP analysis picked only 2 SNPs to represent most of the genetic variation information of the Figf gene. Our results demonstrate the utility of LD block and tagging SNP analysis for an efficient way of performing a candidate gene based association study.