DOI QR코드

DOI QR Code

A Short Report on the Markov Property of DNA Sequences on 200-bp Genomic Units of Roadmap Genomics ChromHMM Annotations: A Computational Perspective

  • Park, Hyun-Seok (Bioinformatics Laboratory, ELTEC College of Engineering, Ewha Womans University)
  • Received : 2018.11.26
  • Accepted : 2018.12.13
  • Published : 2018.12.31

Abstract

The non-coding DNA in eukaryotic genomes encodes a language that programs chromatin accessibility, transcription factor binding, and various other activities. The objective of this study was to determine the effect of the primary DNA sequence on the epigenomic landscape across a 200-base pair of genomic units by integrating 127 publicly available ChromHMM BED files from the Roadmap Genomics project. Nucleotide frequency profiles of 127 chromatin annotations stratified by chromatin variability were analyzed and integrative hidden Markov models were built to detect Markov properties of chromatin regions. Our aim was to identify the relationship between DNA sequence units and their chromatin variability based on integrated ChromHMM datasets of different cell and tissue types.

Keywords

References

  1. ENCODE. ENCODE chromatin state segmentation by HMM from broad institute, MIT and MGH. Santa Cruz: UCSC Genome Bioinformatics, 2011. Accessed 2018 Dec 1. Available from: http://moma.ki.au.dk/genome-mirror/cgibin/hgTrackUi?db=hg18&g=wgEncodeBroadHmm.
  2. Roadmap Epigenomics Consortium, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, et al. Integrative analysis of 111 reference human epigenomes. Nature 2015;518:317-330. https://doi.org/10.1038/nature14248
  3. Ward LD, Kellis M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res 2012;40:D930-D934. https://doi.org/10.1093/nar/gkr917
  4. Ritchie GR, Dunham I, Zeggini E, Flicek P. Functional annotation of noncoding sequence variants. Nat Methods 2014;11:294-296. https://doi.org/10.1038/nmeth.2832
  5. Lu Q, Hu Y, Sun J, Cheng Y, Cheung KH, Zhao H. A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data. Sci Rep 2015;5:10576. https://doi.org/10.1038/srep10576
  6. Zhou J, Troyanskaya OG. Predicting effects of noncoding variants with deep learning-based sequence model. Nat Methods 2015;12:931-934. https://doi.org/10.1038/nmeth.3547
  7. Kasowski M, Kyriazopoulou-Panagiotopoulou S, Grubert F, Zaugg JB, Kundaje A, Liu Y, et al. Extensive variation in chromatin states across humans. Science 2013;342:750-752. https://doi.org/10.1126/science.1242510
  8. Ernst J, Kellis M. ChromHMM: automating chromatin-state discovery and characterization. Nat Methods 2012;9:215-216. https://doi.org/10.1038/nmeth.1906
  9. Ernst J, Kheradpour P, Mikkelsen TS, Shoresh N, Ward LD, Epstein CB, et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 2011;473:43-49. https://doi.org/10.1038/nature09906
  10. Lee KE, Park HS. Preliminary testing for the Markov property of the fifteen chromatin states of the Broad Histone Track. Biomed Mater Eng 2015;26 Suppl 1:S1917-S1927.
  11. Wang Lab at Washington University in St. Louis. ROADMAP epigenomics project Chromatin state learning. St. Louis: Wang Lab, 2015. Accessed 2018 Sep 1. Available from: http://egg2.wustl.edu/roadmap/web_portal/chr_state_learning.html.